Questions tagged [drive-failure]

116 questions
45
votes
10 answers

How should I burn in hard drives?

Google did a very thorough study on hard drive failures which found that a significant portion of hard drives fail within the first 3 months of heavy usage. My coworkers and I are thinking we could implement a burn-in process for all our new hard…
Phil
  • 1,003
  • 2
  • 11
  • 16
37
votes
5 answers

Mean Time Between Failures -- SSD

The Mean Time Between Failures, or MTBF, for this SSD is listed as 1,500,000 hours. That is a lot of hours. 1,500,000 hours is roughly 170 years. Since the invention of this particular SSD is post-Civil War, how do they know what the MTBF is? A…
OSE
  • 473
  • 1
  • 4
  • 5
20
votes
4 answers

what is exactly an URE?

I have been looking into RAID5 Vs RAID6 lately and I keep seeing that RAID5 is not secure enough anymore because of the URE ratings and increasing size of the drives. Basically, most of the content I found says that in RAID5, in case you have a disk…
Memes
  • 368
  • 2
  • 3
  • 10
19
votes
6 answers

Should I 'run in' one disk of a new RAID 1 pair to decrease the chance of a similar failure time?

I'm setting up a RAID1 array of two new 4TB hard drives. I heard somewhere previously, that making a RAID1 array of new identical hard drives bought at the same time, increased the chance that they would fail at a similar point in time. I am…
a_henderson
  • 291
  • 1
  • 6
15
votes
6 answers

How to recover from a drive failure in a RAID 5 configuration?

This morning a drive failed on our database server. The drive array (3 disks) is setup in a RAID 5 configuration. While we wait for a drive replacement we are preparing for a recovery strategy. Users are continuing to work on the system, albeit very…
Philip Fourie
  • 537
  • 2
  • 6
  • 13
14
votes
2 answers

is UNC S.M.A.R.T. Error serious? need to take action?

I have a 300G Western Digital Raptor, recently showing UNC SMART, wondering anyone who has experience knows should I replace it and get warranty form WD? Details of smartctl -a as follows: smartctl 5.41 2011-06-09 r3365 [FreeBSD 8.2-RELEASE-p6…
c2h2
  • 759
  • 2
  • 8
  • 20
11
votes
4 answers

Hard drive read errors that... stop?

My story starts out quite simply. I have a light-duty server, running Arch Linux, which stores most of its data on a RAID-1 composed of two SATA drives. It was working without any problems for about 4 months. Then, suddenly I started getting read…
Rick Koshi
  • 887
  • 3
  • 13
  • 22
10
votes
1 answer

How can I tell if a disk is failing on ESXi / what do these errors mean?

I have a server running VMware ESXi v4.1.0 348481. It has a hardware RAID10 and a SATA backup drive. I have a VM running which has it's primary boot vmdk on the RAID10 datastore, and a 600 GB vmdk on the SATA backup drive's datastore. The VM runs…
Josh
  • 9,001
  • 27
  • 78
  • 124
8
votes
3 answers

Why Do Hard Drives Fail?

I'm just quite interested in the reasons why hard drive failures occur. Some people say that it's because it was handled poorly during the shipping and transportation processes while others say that it is due to the heat/prolonged intense usage, yet…
JFW
  • 209
  • 2
  • 4
7
votes
2 answers

What does it mean, if all LEDs of an LTO-6 drive are flashing?

All LEDs of an half height LTO-6 drive are flashing with ~ 4 Hz. An LTO-5 tape is still inside, the drive does not react on commands, the flashing continues after a power cycle. I could not find the error code in the Tandberg manual. The drive is…
Jonas Stein
  • 392
  • 4
  • 13
7
votes
2 answers

ext4 filesystem corruption -- maybe hardware error?

I'm getting these errors in dmesg after about half an hour after I turn on the computer: [ 1355.677957] EXT4-fs error (device sda2): htree_dirblock_to_tree: inode #1318420: (comm updatedb.mlocat) bad entry in directory: directory entry across…
pts
  • 425
  • 1
  • 5
  • 15
6
votes
2 answers

HP P840 HDD RAID 5 many strange drive failures

I've been using a RAID5 HDD storage (8x6TB) at my HP P840 for like 2 years now and it has always had unusually many drive failures. Everything was good for half a year, but now drives are failing in a strange way. For example 2 new drives failed a…
6
votes
1 answer

Dealing with CONFIG FAILURE on fresh drive (3ware / LSI RAID)

This is not about DRIVE failure. It's about drive CONFIG failure. I bought 3 brand new drives for my server, because existing ones have worked for over 4 years and one of them is failing (shows ECC ERROR or DEGRADE). I'm always able to rebuild array…
Kitet
  • 378
  • 2
  • 12
6
votes
1 answer

Replace HP Smart Array E200i without losing data

I've got a Smart Array 200i which seems to have some bad slots (slot 3 and slot 5). It doesn't matter what HD I put in these slots, it keeps telling me the drive is bad. My question is two-fold: Is it perhaps just something I'm doing wrong? I was…
6
votes
2 answers

Age replacement policy for hard disks and SSDs at servers

I'm planing an age replacement policy for our storage and servers. Most of them are for DBs and some for images (static content) so yes, they have an huge I/O everytime. Also, we use Samsung 840 Pro SSDs for the RAID Controllers (PERC H700i) as…
Masterl1nk
  • 147
  • 2
  • 12
1
2 3 4 5 6 7 8