I've RAID 5 array that started acting wonky lately. During POST it shows this error:
Slot 4 HP Smart Array P600 Controller (256MB, v2.04) 1 Logical Drive
1789-Slot 4 Drive Array Disk Drive(s) Not Responding
Check cables or replace the following drive(s):
Port 2I: Box 1: Bay 2
Select "F1" to continue - all logical drive(s) will remain disabled
Select "F2" to fail drive(s) that are not responding - Interim Recovery
mode will be enabled if configured for fault tolerance
Then it boots (after a 45 second delay) and starts rebuilding:
# hpacucli ctrl all show config
Smart Array P600 in Slot 4 (sn: P92B3AF9SXR018)
array A (SAS, Unused Space: 0 MB)
logicaldrive 1 (410.1 GB, RAID 5, Recovering, 62% complete)
physicaldrive 2I:1:1 (port 2I:box 1:bay 1, SAS, 146 GB, OK)
physicaldrive 2I:1:2 (port 2I:box 1:bay 2, SAS, 146 GB, Rebuilding)
physicaldrive 2I:1:3 (port 2I:box 1:bay 3, SAS, 146 GB, OK)
physicaldrive 2I:1:4 (port 2I:box 1:bay 4, SAS, 146 GB, OK)
(This takes about 40 minutes.)
We've tried replacing the disk in bay 2, with no perceptible difference.
We've added a spare disk into bay 5, added it to the array as a spare, but it was never used, so I removed it from spares.
Question: Is there a way to convince the RAID controller to drop drive 2I:1:2 from the array and use 2I:1:5 in its place? I've tried
hpacucli ctrl slot=4 array A modify drives=1:1,1:3,1:4,1:5
but this fails with Error: Cannot create array. Cannot add physical drive 1:1
.
(Update: smartctl -a -d cciss,$NR /dev/cciss/c0d0
tells me all five disks think they're self-assessing as OK.)