Ive just spotted an amber disk error message on my 2850, E0D76 BP drive 4 fail. Drives are ULTRA 320 SCSI. Its been a while since this server was set up so I cannot be abolutely sure my memory is accurate but I think it was

Drive 0 73GB
Drive 1 73GB paired as RAID 1

Drive 2 146GB
Drive 3 146GB paired as RAID 1

Drive 4 146GB as hot swap

(I had a dodgy 146gb drive that was giving me flashing amber as a predicted fail but I thought better than nothing to leave it as the hot swap in drive 4)

I think I had the config as

   Raid Ch- 0

0 ONLIN A00-00
1 ONLIN A00-01
2 ONLIN A01-00
3 ONLIN A01-01

So on checking the config i now see

enter image description here

enter image description here

Seeing drive 4 as failed I removed it , and re-sited it and rebooted but still failed. So I rebooted without it in which gave a POST warning but corrected the LED error from amber to blue.

My question is, can someone with a clue help me figure what has happened, and how can I recover it?

[EDIT] Whats the best way to monitor hardware RAID failure, its PERC 4e/Di controller, OS is Windows Web Server 2008 R2. Can the state of the RAID array be monitored from within windows? Is there some error thrown in the event log that I can hook a warning event on to?

Logical drive 0 (RAID 1) has a failed hard drive or has not been rebuilt. Drive 4 appears to be the mirror of drive 1. Be very careful here and make sure you have a backup of all your data before proceeding. I'd consider placing drive 0 in slot 4 and see if it rebuilds. But, I can't verify from the screen shots which physical drives belong to which logical drives and what sizes they are. At this point be very sure of what you're doing.

EDIT: Looking at the screen shots again it appears that LD0 is using slots 1 and 4 and LD1 is using slots 2 and 3. Confirm the hard drive sizes in the slots and proceed accordingly. (Have a backup!)

  • I've got my data backed up already but I would still like to get this right :) If I put a new 73GB drive into drive 0 , it doesn't look like it would simply rebuild? presume I would have to config it as part of A00, then it would ? What I don't get is how the config seems to have migrated from what I remember to what I see now, unless my memory is worse than I thought – Saul Aug 13 '11 at 22:54
  • It really appears that LD0 is using slot 1 and 4. So I'd put the new 73GB drive into slot 4 and it should rebuild. Memory is the first thing to go with age :) – murisonc Aug 13 '11 at 23:04
  • Ive got one on order murisonc and Ill update when I have put in the replacement – Saul Aug 14 '11 at 20:12
  • I moved the 73 GB drive from drive0 to drive4, will it simply rebuild over a few hours? Is there any way I can check from outside the SCSI config screen at bootup ? – Saul Aug 15 '11 at 08:41
  • It should rebuild on it's own but I'd verify it's working. There is a tool called Dell OpenManage Server Administrator (free) that can be installed to manage the RAID without rebooting the server. – murisonc Aug 15 '11 at 13:36