Solved

Hard Disk Failure: read-error timeouts take too long

Posted on 2014-01-22
5
1,129 Views
Last Modified: 2014-01-24
I have a failing hard disk (Hitachi HDT721032SLA380 - 320G, SATA). I know there are 3000 or more sectors that error on reading.
I'm running ntfsclone --rescue on Linux to make a copy (to a file on an external hard disk), but it reads in 512byte chunks and takes 30 seconds to timeout on each failure. (From what I can see, the errors are in 4096 byte chunks, so it actually fails 8 times (4 minutes) for each failing chunk.)

I've tried changing /sys/block/sda/device/timeout from 30 to 5 but that hasn't speeded anything up (should it?)

Does anyone know a way to reduce the timeout with or without stopping ntfsclone?
0
Comment
Question by:Beneford
  • 3
5 Comments
 
LVL 92

Assisted Solution

by:nobus
nobus earned 200 total points
ID: 39799724
i would stop ntfs clone rightaway, and try to repair the drive.
i would connect it to a working windows pc, and run hddregenerator on it, it has saved many drives for me : http://www.dposoft.net/hdd.html      

**but that is only my way of doing things
0
 
LVL 7

Author Comment

by:Beneford
ID: 39799773
I appreciate the your comment.

Right now, this is a non-cost project, but I may give HDD Regenerator a trial later.
I'm also keen to see which files are potentially corrupted, which ntfsclone is doing (slowly!)
0
 
LVL 47

Accepted Solution

by:
dlethe earned 300 total points
ID: 39799778
If this was a SAS drive, then the answer is that this a a programmable setting via a mode page editor software. The SATA spec doesn't let this setting be changed.  This timeout is the HDD going to deep recovery.  It is trying every trick in the firmware to recover those blocks.

Let it run. Nothing you can our SHOULD do about this.  It may take days or a week or longer.  Sorry.

Next time buy ENTERPRISE class SATA.  The recovery will give up in just a few seconds.  But you have a cheap desktop drive that is not designed for 24x7 use, and this is just the "no-free-lunch" downside of buying a consumer class disk drive.

Note also enterprise class drives have more ECC bits, so it is unlikely that you would have more than just a few unrecoverable blocks had you bought one of those drives instead.
0
 
LVL 7

Author Comment

by:Beneford
ID: 39799804
Thanks, dlethe,

It's nice to know there's no free lunch.

This is a home PC rather than an enterprise system (so no RAID, no expensive disks, no backups(!)). But lost data is sad rather than catastrophic.

Given that I've completed a full copy with dd_rescue, the ntfsclone is additional on the failed partition with the failures, and I'm as interested in the listed "Can't read sector at nnnnn" as in the actual data it recovers.
dd_rescue let me specify the minimum block size which meant only one delay per failing group-of-8-clusters. ntfsclone doesn't, and is doing 512 bytes at a time.
0
 
LVL 7

Author Comment

by:Beneford
ID: 39805994
Thanks guys for your thoughts.

The ntfsclone failed (returning 100s of errors per second) so I abandoned it.
The dd_rescue I had previously run seems to have worked, so alls well.
0

Featured Post

Optimizing Cloud Backup for Low Bandwidth

With cloud storage prices going down a growing number of SMBs start to use it for backup storage. Unfortunately, business data volume rarely fits the average Internet speed. This article provides an overview of main Internet speed challenges and reveals backup best practices.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

More or less everybody in the IT market understands the basics of Networking, however when we start talking about Storage Networks, things get a bit dizzier, and this is where I would like to help.
Or at least that’s the word according to a new blog from Tech Target on AWS’s new Managed Services (MS) offering. According to the blog, AWS is launching their AWS MS program to expedite the adoption of cloud by Fortune 1000 and Global 2000 companie…
This video Micro Tutorial explains how to clone a hard drive using a commercial software product for Windows systems called Casper from Future Systems Solutions (FSS). Cloning makes an exact, complete copy of one hard disk drive (HDD) onto another d…
Although Jacob Bernoulli (1654-1705) has been credited as the creator of "Binomial Distribution Table", Gottfried Leibniz (1646-1716) did his dissertation on the subject in 1666; Leibniz you may recall is the co-inventor of "Calculus" and beat Isaac…

831 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question