How to pre-emptively repair/replace drive in linux md array without going degraded first?

Question

I have (had, it's already been swapped out, asking this for future use) a drive that is indicating pending failure with internal SMART tests and bad block remaps.

It is straightforward to mdadm --fail the soon to be bad drive and rebuild to a hot spare, or to pull the drive and put a new one in, then rebuild to that drive.

The problem is this takes the array to degraded state for the entire period of that resync, incurring both the additional failure risk and the performance overhead of running degraded. That's expected if you actually have a drive failure, but it is an unnecessary exposure if the drive hasn't actually failed yet.

How can I pre-emptively replace/rebuild that single drive to a hot spare without taking it out of service first?

Mike Andrews · Answer 1 · 2018-06-24T13:51:28.133

I'm not sure how resilient this technique is, but it "should work". I'd want to give this procedure some test runs on other drives before doing it for real.

If you have a two disk RAID-1, you can use mdadm --grow to transform it to a three disk RAID-1. This is a triple mirror, not a RAID-1E. Then, you can fail out the drive you're worried about and --grow it back to two disks. Something like this:

mdadm --grow /dev/md0 --level=1 --raid-devices=3 --add /dev/sdgood

# wait for the resync to complete, then fail the drive out that's starting to go bad:

mdadm /dev/md0 --fail /dev/sdbad --remove /dev/sdbad

# then, set the RAID-1 back to two devices.
mdadm --grow /dev/md0 --raid-devices=2

If you do this, you'll always have at least one mirrored copy of your data.

Reportedly, you can --grow an array from RAID-5 to RAID-6, but I'm have never heard of anyone going back to a RAID-5 afterwards. At any rate, that approach is much riskier, because you'll have to rewrite all your data on all the disks.

For raid1 it's simple... and really nothing risky about making it a 3-drive vs 2-drive. It's the raid-5/raid-6 (and raid-10) cases that are interesting. — Nathan Neulinger, Jun 24 '18 at 16:41
I suspect there may be some device-mapper magic that could be used to turn the single drive into a raid1 mirror underneath, then fail it out and drop it back to a single disk, just not sure of the specifics of how to do it. Overall just seems like there should be a better way to pre-emptively replace a failing drive in an abitrary array than by it actually failing (or being removed). — Nathan Neulinger, Jun 24 '18 at 16:43

How to pre-emptively repair/replace drive in linux md array without going degraded first?

1 Answers1