Improve company productivity with a Business Account.Sign Up

x
  • Status: Solved
  • Priority: Medium
  • Security: Public
  • Views: 4966
  • Last Modified:

HP server: Raid 5 - predictive failure

HP proliant ml350 g5 windows 2003 server with raid 5 (5 hard disks). I checked the system management homepage and see that the port1|box:1 bay3 is predictive failure and it was actually flashing orange light. I relaced the hard disk, but it still doing the same thing and showing same error here. Also SCSI bus faults: 7, Hard Read Erros:  28536

Checked bay 4 and bay 5, eventhou the status says ok, but I also see some errors there:
Bay 4: Hard Read Erros:  80, Recovery Write Errors:  15

Bay5: Hard Read Erros:  49, Recovery Write Errors:  9

I also run HP Insight Diagnostics and found errors in hard disk 3, 4 and 5 here: please check screenshot:

any idea? system homepage insight 1 insight 2 insight 3
0
okamon
Asked:
okamon
  • 4
  • 4
  • 2
  • +1
1 Solution
 
rajkumartechCommented:
Try update the firmware and latest BIOS.
0
 
sosinc3Commented:
Before you update the firmware and/or bios of any kind with a questionable drive system, I would look at the backplane (where the drives get connected to the system) and your raid array controller. I would shut the system down, pull the drives out one by one or mark them so you can keep them in order, blow some air in the drive cage, reseat them. Take the array controller out and reseat it. Better yet, if you can get a replacment raid array controller of the same model, replace it. Then power the system back up and see what you up against. If the array starts to rebuild let it finish before you do anything else. Once you have a healthy raid, then do any firmware / bios updates. If you update the firmware on your raid array controller, BE SURE to also update the drivers in Windows before booting up with the new firmware. Otherwise, your system may not boot up.
0
 
andyalderCommented:
Hard Read Errors:  28536 is unacceptable, the other two aren't so bad. But if that is the replacement drive then it doesn't make much sense since it should have started off at a count of zero unless it was replaced with an equally faulty drive. DF072A9844 doesn't come up as having any firmware apart from initial release HPD0 but strangely it's only listed under Integrity servers, not Proliants and then only on one single document on HP's site. It's got 20 months service hours on it, so it isn't new. Maybe it's the stats for the old one that you've posted.

If you search the link below then you'll see that that drive model isn't listed at all, whereas if you just search it for DF072A you'll find 6 other drives that must be pretty similar and they are at HPD7 or above firmware.

I would question where this replacement disk came from, I suspect it may be a model that HP has pulled or forgotten existed, there just aren't enough references of that model number on HP's website to give me confidence in it.

http://h20000.www2.hp.com/bizsupport/TechSupport/SoftwareIndex.jsp?lang=en&cc=us&prodNameId=3182562&prodTypeId=329290&prodSeriesId=1157688&swLang=8&taskId=135&swEnvOID=1005 lists all the drive firmware for Proliants and that disk isn't there.
0
Get 10% Off Your First Squarespace Website

Ready to showcase your work, publish content or promote your business online? With Squarespace’s award-winning templates and 24/7 customer service, getting started is simple. Head to Squarespace.com and use offer code ‘EXPERTS’ to get 10% off your first purchase.

 
okamonAuthor Commented:
The hard disk was ordered from HP. and I already updated the firmware and driver... didn't help

Now I just noticed that all models of other hard disk are DF072A8B56, but the one one in Bay3 is DF072A9844...... different model hard disk can cause the problem??
1.JPG
2.JPG
3.JPG
4.JPG
5.JPG
0
 
andyalderCommented:
Not so much the different part number but that that particular disk doesn't have firmware available for it. I suspect HP have sent you one they've "tested" rather than a new one. They could have just made a mistake by fixing it but forgetting to clear down the S.M.A.R.T data back to zero. Phone them up first thing and tell them they've sent you a pup.
0
 
okamonAuthor Commented:
I will try to order another new hard disk and try it again. and post the result here.
0
 
okamonAuthor Commented:
Hi I have repalced the hard drive (hard disk 3) and this time It shows passed in array configuration utility and system management hompage. But in HP insight Diagnostics, the new hard disk now shows passed, but the hard disk 4 and 5 still shows error. Is the error normal? Can I ignore the error?
1.JPG
2.JPG
0
 
sosinc3Commented:
It seems the errors are pointing toward an expired warranty which is not something to worry about. However, it is also reporting errors beyond threshold. Where are these drives coming from? Are these used? If they are saying its beyond warranty I can't imagine they are coming direct from HP. You will need to may be replace these other drives as well but one at a time giving time for the array to rebuild fully.
0
 
andyalderCommented:
4 and 5 will show as bad until replaced as it says the read and write hard error count is above threshold.
0
 
okamonAuthor Commented:
so are you guys saying the 2 hard disks are going to fail soon?
0
 
andyalderCommented:
They won't necessarily fail but they have bad blocks that the drive hasn't mapped out so you may get read failures, If you value your data I would replace them (but not at the same time). You may just get away with the server being under warranty if you're lucky.
0
Question has a verified solution.

Are you are experiencing a similar issue? Get a personalized answer when you ask a related question.

Have a better answer? Share it in a comment.

Join & Write a Comment

Featured Post

Easily Design & Build Your Next Website

Squarespace’s all-in-one platform gives you everything you need to express yourself creatively online, whether it is with a domain, website, or online store. Get started with your free trial today, and when ready, take 10% off your first purchase with offer code 'EXPERTS'.

  • 4
  • 4
  • 2
  • +1
Tackle projects and never again get stuck behind a technical roadblock.
Join Now