SERVER: Dell T410 Raid Controller: Perc 6/i
What is the best way to diagnose root cause of failed drives?
Back story:
So I had a 4-drive Raid 10 + 1 hot spare setup. This setup worked for years, about 3 months ago, one drive died, and the hot spare took over. In the past week another drive failed, replaced it, and another failed during the rebuild. We lost data.
With 4 drives remaining, I created raid 6 and restored from a backup. I let the server run over the weekend and 3 more drives have failed.
- When the server was running, OMSA showed that the raid battery is dead. I plan on replacing it, but as far as I know, that shouldn't cause of my issues, perc 6 runs in a special mode, and we have a UPS backup. Ran a self-check on power-supply (pressing hardware button - led shows green).
All these drives are refurbs...but I have a hard time believing 6 drives went bad about the same time without some external cause. Looking for direction to diagnose. My guess is it's the power supply or raid controller - but how to best diagnose?