HP MSA disk errors

We've been having issues with our HP MSA 2312sawith seeing parity errors plus the enclosure being in a degraded status. I've contacted HP and after updating the firmware on the controllers and hard drives after a verify disk process fails again they want me to run a verify vdisk VHD1  fix yes from the CLI.y understanding is this command will go through and fix the parity issues.  HP recommends backing up everything on the MSA before running this command. If this command just fixes parity is backing up the 1.5 tb worth of VHD's on the MSA necessary before running this process? Additionally HP informed us we needed to stop all I/O to the MSA prior to and during this process. This involves stopping all virtual hyper v guests and hosts and shutting them down. Is this required or just recommended? I tried to get a estimated tome frame for scheduling downtime but HP wasnt much help. The regular verify took about 5 hours befogging retiring out. How long would the fix yes option take? The VDisk is about 1.5 tb with total space on the MSA being about 1.8 tb.
georgedschneiderAsked:
Who is Participating?
 
georgedschneiderAuthor Commented:
We just had the power supply fail again and HP finally agreed to replace it verses having us reseat it which caused the psu unit error go away for a little while. Funny thing is after replacing the power supply the parity errors went awY. A verify VDisk even completed error free. What do you make of this?
0
 
Andrew Hancock (VMware vExpert / EE MVE^2)VMware and Virtualization ConsultantCommented:
After weeks of firmware upgrades, console sessions and log downloads back to HP, that was their fix for us, Sorry for the bad news here.

Yes, this is standard HP Support fix, with no understanding of 24x7 Production! (even the on-site Engineers after escalation  to 3rd line HP Storage support were disgusted with the solution!)

We've had issues with customers MSA2324i's with this problems, the verify vdisk takes days and weeks to complete. We had to backup just to be safe, and it was a good job, because the verify causes the virtual disk to be corrupted and destroyed.

We had to supply our clients with loan kit, and eventually we trashed the MSA2324i, and rebuilt from scratch, and restored from backups and lon MSA.

0
 
Andrew Hancock (VMware vExpert / EE MVE^2)VMware and Virtualization ConsultantCommented:
1. Yes I would backup!
2. I would schedule for a weekend. (start Fri eve 6pm)
3. Yes, you need to shutdown controllers, whiuch means all your VMs
4. Good Luck!
0
Has Powershell sent you back into the Stone Age?

If managing Active Directory using Windows Powershell® is making you feel like you stepped back in time, you are not alone.  For nearly 20 years, AD admins around the world have used one tool for day-to-day AD management: Hyena. Discover why.

 
DavidPresidentCommented:
It is good form to do a backup, but not for the reason HP gave you.  Verify is a safe operation.  However, when you go through the process you are reading every block on every disk and are effectively running a stress test on every disk in your system at once.   If a disk is going to fail, it will be during this time

So the backup is not because the verify isn't safe, it is because you have a high risk of drive failure.
0
 
Gerald ConnollyCommented:
Just look at it this way, a bit of inconvienence to do a full backup versus major trauma if the "fixes" crash the system and/or corrupt your data.

Of course you should do a backup before doing this. You should always do a backup before any kind of change.
0
 
georgedschneiderAuthor Commented:
I agree with you there.  It was really a matter of the dowtime to take down an entire site.  the MSA is our storage for the location host the VHD's.  What is best way to copy the VHD's off.  I was thinking with the Hyper-V servers turned off I would just copy the VHD's to a external USB drive via USB 3.0.  There's about 1.5 TB of data.
0
 
georgedschneiderAuthor Commented:
We just had the power supply fail again and HP finally agreed to replace it verses having us reseat it which caused the psu unit error go away for a little while. Funny thing is after replacing the power supply the parity errors went awY. A verify VDisk even completed error free. What do you make of this?
0
 
Andrew Hancock (VMware vExpert / EE MVE^2)VMware and Virtualization ConsultantCommented:
HP have replaced and re-designed the PSUs for the MSA range.

Does you new supply have 1 LED or 2 LEDs on the rear, later revision supplies just have one LED now, and not two. (also the catches are different colour, from what I remember!)

The older ones, had 2 LEDs, one indicating Power ON, and one indicating Fault!
0
 
georgedschneiderAuthor Commented:
Why would the PSU clear up disk errors?
0
 
Andrew Hancock (VMware vExpert / EE MVE^2)VMware and Virtualization ConsultantCommented:
It's odd that it fixes your issue, maybe the supply was failing, unable to provide enough current.
0
 
georgedschneiderAuthor Commented:
Very strange indeed.  But since then I haven't seen any disk errors
0
 
Andrew Hancock (VMware vExpert / EE MVE^2)VMware and Virtualization ConsultantCommented:
does this new disk have two led lights or one?
0
 
georgedschneiderAuthor Commented:
New disk?  Both the PSU and Disks have green lights on them.
0
 
Andrew Hancock (VMware vExpert / EE MVE^2)VMware and Virtualization ConsultantCommented:
sorry new PSU

old psus have two leds

newer versions only have one.
0
 
georgedschneiderAuthor Commented:
This comment resolved the issue I had orignaly had and asked in my question
0
Question has a verified solution.

Are you are experiencing a similar issue? Get a personalized answer when you ask a related question.

Have a better answer? Share it in a comment.

All Courses

From novice to tech pro — start learning today.