How to fix high Load_Cycle_Count laptop drive (TOSHIBA MK6006GAH in Vaio TX1XP)?

3

Hoping someone knows exactly what's going on here. It seems this drive has some combination of aggressive power saving settings and Ubuntu defaults that has massively increased the Load_Cycle_Count for the drive:

https://wiki.ubuntu.com/DanielHahler/Bug59695

So the drive is now so slow that it cannot boot because it takes long enough to access the data that the kernel will not recognise it properly. I'm not worried about the data on the drive, but would really like to keep the laptop functioning. There is some indication that this is possible because the figure is still low 200,000s and most drives supposedly go to 600,000. Additionally, SMART tests pass and consider the drive healthy and without errors. But the really surprising thing was when I ran mhdd...

Every single read came up red (slow) until I pressed 'R' for reset drive. I noticed the next read was normal speed, so held down 'R'. Magically the drive read perfectly for as long as I held the key BUT resumed slow (and noisy) seeking/reading after releasing. I don't think the source code to mhdd is available, so I'm not exactly sure what this means (besides, I don't know enough low-level HDD stuff either). It seems like the drive should be able to work, but is stuck trying to power save or something. There are no BIOS options on the laptop.

Does anyone know how I can stop the drive from doing extremely slow/noisy operations like this? Or is constantly resetting the drive also damanging, and only causing it to work well by luck (i.e. not a suggestion that it's fixable)?

Sam Brightman

Posted 2009-11-10T13:02:25.557

Reputation: 540

Answers

2

A High Load_Cycle_Count is not one of the pre-fail attributes. Even though your SMART status says the hard drive is OK... chances are very good that it is not ok based on your description of noise and speed. As for running at full speed while holding down R in mhdd... I have no idea.

Hard drives that are near the end of death can make many strange noises. Constantly power cycling the drive should not damage it.

I don't trust the pass/fail guidelines that the manufacturers offer with their SMART attributes. I have seen it happen many times that hard drives are corrupting data and acting flaky even though the SMART status has a big green OK next to it. I recommend looking at the raw data counts (using HDTune in windows or smartctl in linux). From there you can make a more educated decision about the health of your drive.

Google created the largest consumer hard drive failure study... so you might want to look at their findings from 2010:

We find, for example, that after their first scan error, drives are 39 times more likely to fail within 60 days than drives with no such errors. First errors in reallocations, offline reallocations, and probational counts are also strongly correlated to higher failure probabilities. Despite those strong correlations, we find that failure prediction models based on SMART parameters alone are likely to be severely limited in their prediction accuracy, given that a large fraction of our failed drives have shown no SMART error signals whatsoever.

http://static.googleusercontent.com/media/research.google.com/en//archive/disk_failures.pdf

You also might enjoy the wikipedia article which outlines the various SMART atributes and which attributes correlate with electro-mechanical failure.

http://en.wikipedia.org/wiki/S.M.A.R.T.

The only proper way to fix a hard drive is to replace it. Fixing the one you have is something you'd only want to do if you are trying to recover data from it... and even then, you'd want to migrate everything onto a new drive. I think it's safe to say you have a broken hard drive.

James T

Posted 2009-11-10T13:02:25.557

Reputation: 8 515

Really I was focussing on the fact that it seems to be capable of working properly, and isn't losing data. The convenience/cost saving of stopping it from being weird rather than sourcing and old/overpriced replacement was the aim. But all good points you've made. – Sam Brightman – 2010-05-26T11:17:47.397

So your looking for a script or something that you can start up with your system that essentially does the same thing as holding down the R button in mhdd so as to squeeze out the last bits of life out of a broken hard drive? – James T – 2010-05-26T20:47:57.003

I meant more that the mhdd R-button reset seems to cause it to behave normally, and perhaps this point to an underlying issue that could be fixed rather than irreversible failure. – Sam Brightman – 2011-06-24T02:15:52.373

1

I fixed this problem in my VAIO (TOSHIBA MQ01ABD075)

Load_Cycle_Count is not increased if you read any HDD S.M.A.R.T. option.

I use /usr/sbin/smartctl -g apm /dev/sda

#!/bin/bash

while :
  do
  sleep 3
  /usr/sbin/smartctl -g apm /dev/sda  > /dev/null
done

GoldMangust

Posted 2009-11-10T13:02:25.557

Reputation: 11

<previous comments wouldn't format well so I moved 'em to another answer.> – Matthew Elvey – 2018-03-08T21:12:02.887

0

GoldMangust's answer is good; this is similar and adds some info.

This showed an ability to stop the incrementing of load cycle count (on macOS 10.12 of a Seagate Laptop SSHD Hybrid drive, Model: ST1000LM014):

while : ; do sleep 9; smartctl -A /dev/disk0 | grep "Load_Cycle_Count\|FLAG\|smartctl" ; done

Every 9 seconds seems to be often enough.

Matthew Elvey

Posted 2009-11-10T13:02:25.557

Reputation: 419

0

I found a much better solution.

It installs a tool called hdapm in /usr/local/bin and a LaunchDaemon plist file to run it on boot, setting all connected drives to maximum performance mode.

Given how far beyond spec my drive is, this seems appropriate for sure. Others might want to modify the plist file to set their drives to a more of a middle of the road setting.

In my case, the output is quite scary - the drive is far over spec and quite possibly near failure:

smartctl 6.5 2016-05-07 r4318 [Darwin 16.7.0 x86_64] (local build) ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED ... RAW_VALUE 193 Load_Cycle_Count 0x0032 001 001 000 Old_age Always ... 857614

.

I have smartctl installed, and by running

sudo  smartctl -g all  /dev/rdisk0,

I can see that the System Preferences setting "Put hard disks to sleep when possible" has no effect on the apm setting for this drive - it's 128 either way. I would say the drive has some unreasonable settings if a middle setting produces the result I'm seeing. Also, I wonder if even drives that come inside new Macs display/come with crazy apm behavior/settings. I would bet not, but wouldn't be that surprised if I lost such a bet.

Matthew Elvey

Posted 2009-11-10T13:02:25.557

Reputation: 419