Best option for predictive failure of hard drive in array

I have a HP Proliant DL380 with ESXi and a RAID-5 array. One of the drives is reporting a predictive failure. After reading the various ways to address this, I was planning to wait for it to actually fail, then swap it out with another drive. However, I am going on vacation in a week and I realized I'd like to take care of this before then, to avoid an actual failure while I"m away.

Options seems to be the shut down the server and swap it then, just pull out the predictive drive while online and active and swap it, or take the drive offline in the ACU. The first two seem to have potential issues from array failure to corruption, and I haven't read much about the third.

Any recommendations appreciated. Thanks.
ruhkusAsked:
Who is Participating?

[Product update] Infrastructure Analysis Tool is now available with Business Accounts.Learn More

x
I wear a lot of hats...

"The solutions and answers provided on Experts Exchange have been extremely helpful to me over the last few years. I wear a lot of hats - Developer, Database Administrator, Help Desk, etc., so I know a lot of things but not a lot about one thing. Experts Exchange gives me answers from people who do know a lot about one thing, in a easy to use platform." -Todd S.

65tdRetiredCommented:
Is there room in the disk shelf for hot spare drive?

Should be able to hot swap the drive, the ACU should rebuild the array.
Se HP document:
https://support.hpe.com/hpsc/doc/public/display?docId=mmr_kc-0128606
0
ruhkusAuthor Commented:
No space for a hot spare.

With regard to your link, my understanding is that since the drive didn't actually fail yet, it can cause corruption, which was why I was waiting for it to actually fail. The article (unless I missed it), didn't seem clear on my scenario.
0
andyalderSaggar maker's framemakerCommented:
There is no option to offline a drive with the ACU unless they have added it recently.  You just have to unplug it live and fit a replacement.

If you generate an ADU report either with the dianostic tab of the ACU or seperate ADU program I may be able to tell you why it's predicted to fail, Bear in mind old firmware often gives false predictive failures.
0
Python 3 Fundamentals

This course will teach participants about installing and configuring Python, syntax, importing, statements, types, strings, booleans, files, lists, tuples, comprehensions, functions, and classes.

ruhkusAuthor Commented:
I read about the false reports of failure, which was why I figured waiting was my best option initially. I've swapped failed drives before without issue, but since nothing was writing to that drive at the time, I wasn't as worried. I'm not clear how likely corruption would be in this case, but I've seen it mentioned. Would you suggest I shut down the VMs first, or any other precautions (besides a good backup of course)?
0
andyalderSaggar maker's framemakerCommented:
Assuming there is a minute risk of corruption there's no harm in shutting down the VMs, you can even shut down the hypervisor. The requirement to swap the disks hot doesn't mean you have to do it with an OS running, so long as it is powered on. It's really just a best practice anyway, you can swap them cold but if the controller sees old data on the replacement disk it'll disable it in case that data is important.

So saying I've never shut the OS down to replace a pre-failure disk on ProLiants and I've replaced hundreds of them without problem.

You say there are VMs on it, if it's VMware you can install HP offline bundle to get the status and ADU reports via CLI, Hyper-V the normal Windows GUI is used of course.
0

Experts Exchange Solution brought to you by

Your issues matter to us.

Facing a tech roadblock? Get the help and guidance you need from experienced professionals who care. Ask your question anytime, anywhere, with no hassle.

Start your 7-day free trial
mbkitmgrCommented:
Are your array member disks hot plug.  If so you need only remove the faling drive, fit the new, and the array controller will take care of the rest.

Good idea to do it now rather than when dead.
0
ruhkusAuthor Commented:
So I ended up just doing it like any other hot swap and it worked out fine. Thanks for the assurances.
0
andyalderSaggar maker's framemakerCommented:
Bear in mind that it's HPE specific, with Dell controllers you offline the drive first, other more advanced options are to mirror to spare before offlining but these controllers don't have those options.
0
It's more than this solution.Get answers and train to solve all your tech problems - anytime, anywhere.Try it for free Edge Out The Competitionfor your dream job with proven skills and certifications.Get started today Stand Outas the employee with proven skills.Start learning today for free Move Your Career Forwardwith certification training in the latest technologies.Start your trial today
Server Hardware

From novice to tech pro — start learning today.