Solved

MSA 1000 - 2 Cabinets ... Disk Failure ...

Posted on 2006-07-10
13
942 Views
Last Modified: 2013-11-15
I came in from the weekend and found the following event log on a few of my servers.

Physical Drive in Box 1, Bay 12 of the Array Controller \Device\FibreArray1, HBA Slot 4, Chassis: 9J3xxxxxxxxx, has failed. Failure Code: 0x30.

Immediately following this entry are three more ...

Logical Drive 1 of Array Controller \Device\FibreArray1, HBA Slot 4, Chassis: 9J3xxxxxxxxx, has changed status from OK to INTERIM RECOVERY.

Logical Drive 2 of Array Controller \Device\FibreArray1, HBA Slot 4, Chassis: 9J3xxxxxxxxx, has changed status from OK to INTERIM RECOVERY.

Logical Drive 3 of Array Controller \Device\FibreArray1, HBA Slot 4, Chassis: 9J3xxxxxxxxx, has changed status from OK to INTERIM RECOVERY.

I'm assuming that this is telling me that one of my drives in the array went bad.  However when I go look at the cabinet. I don't see amber lights on drive 12. The controller does show INTERIM RECOVERY on the LCD panel. How should I proceed to make sure that my disks are in good condition?
0
Comment
Question by:Chadwhite
[X]
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
13 Comments
 
LVL 88

Accepted Solution

by:
rindi earned 84 total points
ID: 17072835
It looks as if this was a temporary disk problem, but the "Interim Recovery" has fixed the problem again. If the problem shows up again on the drive in HBA slot 4 it may help to replace that HD. If on the other hand you get the same problem on another slot it may help to replace the cables or the array controller, or update the firmware of the controller.
0
 
LVL 55

Assisted Solution

by:andyalder
andyalder earned 83 total points
ID: 17073377
You'd think "Interim recovery" would mean it's rebuilding but it doesn't, it means it's running with a disk down. Run the array diagnostic utility on a server connected to the storage and see what it says, also run the Array Configuration Utility and both programs will give you more information.
0
 
LVL 3

Author Comment

by:Chadwhite
ID: 17074144
I ran ADU and I think I see the drive in slot 12 showing no errors logged. But its a bit confusing (HEX) and very extensive (long) I ran ACU and it's easier to isolate drive 12 in box 1 but it shows up as OK.  Am I missing something. Anyone have any pointers for interpreting the ADU information?

Thanks!
0
Back Up Your Microsoft Windows Server®

Back up all your Microsoft Windows Server – on-premises, in remote locations, in private and hybrid clouds. Your entire Windows Server will be backed up in one easy step with patented, block-level disk imaging. We achieve RTOs (recovery time objectives) as low as 15 seconds.

 
LVL 55

Expert Comment

by:andyalder
ID: 17075358
Is Interim Recovery the last thing on the MSA log? I'm not sure what happens if someone uses the up button and scrolls past an error whether that gets in the event log or not so make sure the MSA controller is at the last message.
0
 
LVL 15

Assisted Solution

by:mcp_jon
mcp_jon earned 83 total points
ID: 17091255
Try to Update the Hp ACU Utility, and check again !

Maybe something went corrupt and is giving that odd distortion. ( I've seen it happen ).

Best Regards !
0
 
LVL 20

Expert Comment

by:brwwiggins
ID: 17091269
check the firmware on your MSA. I had a similar problem where my MSA reported a drive being bad on a few servers (but not all). HP recommended I update the firmware (which is usually their canned solution) but it worked for me in this case.
0
 
LVL 15

Expert Comment

by:mcp_jon
ID: 17091276
0
 
LVL 15

Expert Comment

by:mcp_jon
ID: 17091292
For Windows, it's the ACU GUI, Graphic User Interface, Version 7.50.23.0, dated 13 Apr 06 .

Best Regards !
0
 
LVL 15

Expert Comment

by:mcp_jon
ID: 17599283
I'd suggest a Split !

Best Regards !
0
 
LVL 15

Expert Comment

by:mcp_jon
ID: 17649814
Fine by me !

Best Regards !
0

Featured Post

Is Your DevOps Pipeline Leaking?

Is your CI/CD pipeline a hodge-podge of randomly connected tools? You’ve likely got a tool to fix one problem & then a different tool to fix another, resulting in a cluster of tools with overlapping functionality. Learn how to optimize your pipeline with Gartner's recommendations

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

The article will include the best Data Recovery Tools along with their Features, Capabilities, and their Download Links. Hope you’ll enjoy it and will choose the one as required by you.
Each year, investment in cloud platforms grows more than 20% (https://www.immun.io/hubfs/Immunio_2016/Content/Marketing/Cloud-Security-Report-2016.pdf?submissionGuid=a8d80a00-6fee-4b85-81db-a4e28f681762) as an increasing number of companies begin to…
This tutorial will walk an individual through the steps necessary to install and configure the Windows Server Backup Utility. Directly connect an external storage device such as a USB drive, or CD\DVD burner: If the device is a USB drive, ensure i…
Two types of users will appreciate AOMEI Backupper Pro: 1 - Those with PCIe drives (and haven't found cloning software that works on them). 2 - Those who want a fast clone of their boot drive (no re-boots needed) and it can clone your drive wh…

739 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question