We help IT Professionals succeed at work.
Get Started

Failing hard disk?

DanJourno
DanJourno asked
on
1,143 Views
Last Modified: 2012-07-23
Hi,

Im running Centos on an old HP DL140 1U server.

Before installing centos, I added a brand new WD Black 1TB drive for data. THe OS is installed on a 160GB drive that was already in the server.

3 months on, and the WD drive has started to develop problems. Centos keeps putting it into read-only mode and I have to totally power down the server to get the drive to work again. A standard reboot doesnt seem to help, and when i do try rebooting, i get a bios warning that the drive is faulty.

However, when i power the server down fully, it boots with no problem.

Here is the log from centos:-

Jul 10 00:47:58 backupserver kernel: end_request: I/O error, dev sdb, sector 614323705
Jul 10 00:47:58 backupserver kernel: end_request: I/O error, dev sdb, sector 614325649
Jul 10 00:47:58 backupserver kernel: end_request: I/O error, dev sdb, sector 1430681601
Jul 10 00:47:58 backupserver kernel: end_request: I/O error, dev sdb, sector 1430681705
Jul 10 00:47:58 backupserver kernel: end_request: I/O error, dev sdb, sector 1430683593
Jul 10 00:47:58 backupserver kernel: end_request: I/O error, dev sdb, sector 1430685545
Jul 10 00:47:58 backupserver kernel: JBD: Detected IO errors while flushing file data on sdb1
Jul 10 00:47:58 backupserver kernel: EXT3-fs (sdb1): error: ext3_journal_start_sb: Detected aborted journal
Jul 10 00:47:58 backupserver kernel: EXT3-fs (sdb1): error: remounting filesystem read-only
Jul 10 00:49:49 backupserver named[1118]: zone keshercomms.com/IN: refresh: non-authoritative answer from master 37.128.190.75#53 (source 0.0.0.0#0)
Jul 10 00:52:01 backupserver named[1118]: zone averwood.co.uk/IN: serial number (2012051611) received from master 37.128.190.75#53 < ours (2012051803)
Jul 10 00:52:15 backupserver named[1118]: zone essaproperties.co.uk/IN: serial number (2012051611) received from master 37.128.190.75#53 < ours (2012051803)
Jul 10 00:53:31 backupserver named[1118]: zone cdd.uk.com/IN: serial number (2012051615) received from master 37.128.190.75#53 < ours (2012051803)
Jul 10 00:53:48 backupserver ntpd[4315]: synchronized to 178.79.150.93, stratum 3
Jul 10 00:54:45 backupserver named[1118]: dumping master file: tmp-CNUzHVGo43: open: permission denied
Jul 10 00:55:22 backupserver named[1118]: zone tamid.co.uk/IN: serial number (2012051615) received from master 37.128.190.75#53 < ours (2012051803)
Jul 10 00:55:30 backupserver kernel: __ratelimit: 27 callbacks suppressed
Jul 10 00:55:30 backupserver kernel: sd 1:0:1:0: [sdb] Unhandled error code
Jul 10 00:55:30 backupserver kernel: sd 1:0:1:0: [sdb] Result: hostbyte=DID_BAD_TARGET driverbyte=DRIVER_OK
Jul 10 00:55:30 backupserver kernel: sd 1:0:1:0: [sdb] CDB: Read(10): 28 00 6f f6 3e c1 00 00 08 00
Jul 10 00:55:30 backupserver kernel: __ratelimit: 27 callbacks suppressed
Jul 10 00:55:30 backupserver kernel: EXT3-fs error (device sdb1): ext3_find_entry: reading directory #58695697 offset 0
Jul 10 00:55:53 backupserver kernel: sd 1:0:1:0: [sdb] Unhandled error code
Jul 10 00:55:53 backupserver kernel: sd 1:0:1:0: [sdb] Result: hostbyte=DID_BAD_TARGET driverbyte=DRIVER_OK
Jul 10 00:55:53 backupserver kernel: sd 1:0:1:0: [sdb] CDB: Read(10): 28 00 28 01 ff 19 00 00 08 00
Jul 10 00:55:53 backupserver kernel: EXT3-fs error (device sdb1): ext3_find_entry: reading directory #20971528 offset 0
Jul 10 00:56:00 backupserver kernel: sd 1:0:1:0: [sdb] Unhandled error code
Jul 10 00:56:00 backupserver kernel: sd 1:0:1:0: [sdb] Result: hostbyte=DID_BAD_TARGET driverbyte=DRIVER_OK
Jul 10 00:56:00 backupserver kernel: sd 1:0:1:0: [sdb] CDB: Read(10): 28 00 00 00 6e e1 00 00 08 00
Jul 10 00:56:00 backupserver kernel: EXT3-fs error (device sdb1): ext3_find_entry: reading directory #11 offset 0
Jul 10 00:56:11 backupserver named[1118]: zone jemsmanchester.co.uk/IN: serial number (2012051615) received from master 37.128.190.75#53 < ours (2012051803)
Jul 10 00:56:32 backupserver kernel: sd 1:0:1:0: [sdb] Unhandled error code
Jul 10 00:56:32 backupserver kernel: sd 1:0:1:0: [sdb] Result: hostbyte=DID_BAD_TARGET driverbyte=DRIVER_OK
Jul 10 00:56:32 backupserver kernel: sd 1:0:1:0: [sdb] CDB: Read(10): 28 00 33 d2 b4 d1 00 00 08 00
Jul 10 00:56:32 backupserver kernel: EXT3-fs error (device sdb1): ext3_find_entry: reading directory #27164879 offset 0
Jul 10 00:57:01 backupserver kernel: sd 1:0:1:0: [sdb] Unhandled error code
Jul 10 00:57:01 backupserver kernel: sd 1:0:1:0: [sdb] Result: hostbyte=DID_BAD_TARGET driverbyte=DRIVER_OK
Jul 10 00:57:01 backupserver kernel: sd 1:0:1:0: [sdb] CDB: Read(10): 28 00 28 02 be b1 00 00 08 00
Jul 10 00:57:01 backupserver kernel: EXT3-fs error (device sdb1): ext3_find_entry: reading directory #20971822 offset 0
Jul 10 00:57:01 backupserver kernel: sd 1:0:1:0: [sdb] Unhandled error code
Jul 10 00:57:01 backupserver kernel: sd 1:0:1:0: [sdb] Result: hostbyte=DID_BAD_TARGET driverbyte=DRIVER_OK
Jul 10 00:57:01 backupserver kernel: sd 1:0:1:0: [sdb] CDB: Read(10): 28 00 6f f5 be e9 00 00 08 00
Jul 10 00:57:01 backupserver kernel: EXT3-fs error (device sdb1): ext3_find_entry: reading directory #58695700 offset 0
Jul 10 00:57:01 backupserver kernel: sd 1:0:1:0: [sdb] Unhandled error code
Jul 10 00:57:01 backupserver kernel: sd 1:0:1:0: [sdb] Result: hostbyte=DID_BAD_TARGET driverbyte=DRIVER_OK
Jul 10 00:57:01 backupserver kernel: sd 1:0:1:0: [sdb] CDB: Read(10): 28 00 28 08 41 49 00 00 08 00
Jul 10 00:57:01 backupserver kernel: EXT3-fs error (device sdb1): ext3_get_inode_loc: unable to read inode block - inode=20989173, block=83951697
Jul 10 00:57:01 backupserver kernel: sd 1:0:1:0: [sdb] Unhandled error code
Jul 10 00:57:01 backupserver kernel: sd 1:0:1:0: [sdb] Result: hostbyte=DID_BAD_TARGET driverbyte=DRIVER_OK
Jul 10 00:57:01 backupserver kernel: sd 1:0:1:0: [sdb] CDB: Read(10): 28 00 28 5c 3f b1 00 00 08 00
Jul 10 00:57:01 backupserver kernel: EXT3-fs error (device sdb1): ext3_get_inode_loc: unable to read inode block - inode=21160397, block=84639774
Jul 10 00:57:01 backupserver kernel: sd 1:0:1:0: [sdb] Unhandled error code
Jul 10 00:57:01 backupserver kernel: sd 1:0:1:0: [sdb] Result: hostbyte=DID_BAD_TARGET driverbyte=DRIVER_OK
Jul 10 00:57:01 backupserver kernel: sd 1:0:1:0: [sdb] CDB: Read(10): 28 00 33 d9 ff 29 00 00 08 00
Jul 10 00:57:01 backupserver kernel: EXT3-fs error (device sdb1): ext3_find_entry: reading directory #27181070 offset 0
Jul 10 00:57:01 backupserver kernel: sd 1:0:1:0: [sdb] Unhandled error code
Jul 10 00:57:01 backupserver kernel: sd 1:0:1:0: [sdb] Result: hostbyte=DID_BAD_TARGET driverbyte=DRIVER_OK
Jul 10 00:57:01 backupserver kernel: sd 1:0:1:0: [sdb] CDB: Read(10): 28 00 28 54 40 89 00 00 08 00
Jul 10 00:57:01 backupserver kernel: EXT3-fs error (device sdb1): ext3_get_inode_loc: unable to read inode block - inode=21144443, block=84574265

Open in new window


Is this a hardware problem (faulty hard disk) or is this something software related (since power cycling seems to temporarily fix it)?

I've tried running fsck which doesnt show any problems at all.

Also, is the WD Black ok in a server that runs 24/7? Someone on IRC commented that I'm having these problems because its a desktop hard disk and therefore shouldnt be used in a server.

If its not ok, what SATA hard disk should I be looking at for a server?
The server only supports SATA.

Thanks
Dan
Comment
Watch Question
This problem has been solved!
Unlock 5 Answers and 16 Comments.
See Answers
Why Experts Exchange?

Experts Exchange always has the answer, or at the least points me in the correct direction! It is like having another employee that is extremely experienced.

Jim Murphy
Programmer at Smart IT Solutions

When asked, what has been your best career decision?

Deciding to stick with EE.

Mohamed Asif
Technical Department Head

Being involved with EE helped me to grow personally and professionally.

Carl Webster
CTP, Sr Infrastructure Consultant
Ask ANY Question

Connect with Certified Experts to gain insight and support on specific technology challenges including:

  • Troubleshooting
  • Research
  • Professional Opinions
Did You Know?

We've partnered with two important charities to provide clean water and computer science education to those who need it most. READ MORE