I'm looking for some pointers on the best way to manage a Linux data server, with 20 hard drives and with new files always being added (around 30GB/day). Performance is not important, reliability is crucial. I don't like RAIDs (many issues with RAID5 and broken disks!) . For now, all drives are accessed individually, but this is a problem because I've to keep moving data around ...
I'm trying to understand if LVM/Hadoop/some-other-magic is the best for me.
I'm specially concerned with a situation it hardware failures and with recovery plans to get back the data and/or not to loose the data on other drives (when some sort of middleware) is involved.
I'm fluent in Linux, not so much in (large) data management.