• Status: Solved
  • Priority: Medium
  • Security: Public
  • Views: 1233
  • Last Modified:

HP MSA 1000 controller failure - beyond useful life?

The end question is, specifically what would be the expected useful life of an HP MSA 1000 controller board?  I had one "go nuts" recently, and it was suggested that it was past its expected life.  I think the MSA 1000 is about six years in service.  Yes, hard drives fail, but a controller board that is only electronics, in a fairly dust free, static and temperature controlled server room, on a dedicated power circuit, connected via a UPS (standby, with over and under voltage protection) shouldn't "wear out" in about six years should it?

A single drive in the array went bad, not an unusual event.  A hot spare was present and configured.  The controller did not pick up the failed drive status, and did not pull the hot spare online and rebuild the array automatically as it should have.
Instead, the O/S (Windows Server 2003) ended up seeing the volume as heavily corrupted, and resulted in many errors as end users tried to access shares and the shared files on the volume.  The volume could be seen, but the longer things went the more corruption seemed to occur.  A rebuild of the volume was a final result, along with replacing the controller board in the MSA 1000.

For more info, the array was one LUN, with RAID6 (ADG).  Ten physical disks.  Rebuild priority was not set to zero.  Everything I know of should indicate the array should have easily survived a single drive failure.
0
JoesCat
Asked:
JoesCat
  • 4
  • 4
  • 3
5 Solutions
 
Andrew Hancock (VMware vExpert / EE MVE^2)VMware and Virtualization ConsultantCommented:
I would not have though so, I still have BBC Microcomputers running, that are almost 30 years old in a few weeks, and they mostly run on TTL logic!

We've got MSA500, 1000, 1500, DEC HSG80s running with no issues.

Plenty of spares on eBay to be had, if short.
0
 
andyalderSaggar makers bottom knockerCommented:
All digital electronics suffer from brain farts occasionally and it doesn't necessarily mean they're past their useful life. All it takes is a cosmic ray to flip a bit in a register and they start running with corrupt data, if you're lucky they crash and reboot, but there's no guarantee of that.
0
 
JoesCatAuthor Commented:
Thanks for your responses thus far, appreciated!

Any other thoughts anyone?  Although it pertains specifically to a HP MSA 1000, it's a bit general also, whether electronics should "wear out" in a arguably short timeframe.
0
Concerto's Cloud Advisory Services

Want to avoid the missteps to gaining all the benefits of the cloud? Learn more about the different assessment options from our Cloud Advisory team.

 
Andrew Hancock (VMware vExpert / EE MVE^2)VMware and Virtualization ConsultantCommented:
capacitors break down quickly, by logic devices, pga, TTL, CMOS devices should last a long time.
0
 
andyalderSaggar makers bottom knockerCommented:
The cache battery is really the only thing that wears out.
0
 
JoesCatAuthor Commented:
Thanks once again for the comments.  I don't recall any capacitors or other components that might wear on the MSA1000 controller board.  Makes sense about the battery, as it is a rechargeable.  I had no indication that the battery failed in this instance.  

Getting ready to shutdown the topic.  Any further comments anyone, particularly for a high performance, server grade HP storage array?
0
 
Andrew Hancock (VMware vExpert / EE MVE^2)VMware and Virtualization ConsultantCommented:
there are caps on backplane, in power supply, and generally decoupling caps.

but these should be good for 10/15 years! unless left in storage with non stable temperatures
0
 
andyalderSaggar makers bottom knockerCommented:
Replacement for MSA1000 is P2000 G3 range, or for higher performance with more disks or SSDs the P6000 range.
0
 
Andrew Hancock (VMware vExpert / EE MVE^2)VMware and Virtualization ConsultantCommented:
We had more failures with the P2000 range than we ever had with the MSA1000 series.

6-8 complete failures last year, which HP claimed were overheating backplane faults or psu faults something weird.

all purchased at different times, only problem visiting Engineers do not seem know these baby units.
0
 
andyalderSaggar makers bottom knockerCommented:
Yeah, I don' treally like them either, couldn't understand why HP didn't use a pair of Smart Array controllers plus a management chip like they did with the MSA1000 rather than OEM'ing something from Dot Hill.
0
 
JoesCatAuthor Commented:
Thanks everyone.  I split the points widely, as this is a rather general discussion.  It does provide valuable feedback to support my case that this MSA1000 unit should not be determined "beyond useful life" at its current time-in-service.
0

Featured Post

Independent Software Vendors: We Want Your Opinion

We value your feedback.

Take our survey and automatically be enter to win anyone of the following:
Yeti Cooler, Amazon eGift Card, and Movie eGift Card!

  • 4
  • 4
  • 3
Tackle projects and never again get stuck behind a technical roadblock.
Join Now