HP  Smart Array 532 Controller stuck in interim recovery mode

Posted on 2010-11-10
Last Modified: 2012-06-27
We are running an older HP Proliant ML370 server and for some time now it's had 1 drive in our 4 drive RAID 5 show as bad.  This behavior started some time ago and we used to be able to reboot the server, and it would recover the RAID and then be fine for a few months.  It finally got to the point where no matter how many times we would reboot, it would not come back up, and now the drive is always amber.  We got a replacement drive from our HP supply reseller, and had no change.  We thought we had gotten a bad drive, but the replacement's replacement did the same thing.

If we go into the RAID configuration utility in the BIOS, it does not see that there is a drive present in that slot and has the RAID in interim recovery mode.  In the HP Array Configuration Utility also lists it in interim recovery mode and has "???" instead of a GB size of the drive.  The drives status is failed.

We are getting to the point of wondering if we either have a bad port, or a bad RAID Controller.  The server is old enough to where we don't really want to put any money into it, but we can't phase it out just yet.  Please let us know if we are looking at a physical problem, or if we are missing that magical "rebuild array" button buried deep somewhere.
Question by:genequip
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
  • 2
  • 2
  • 2

Accepted Solution

wwakefield earned 125 total points
ID: 34105968
The rebuild is definately automatic.

Sounds like the Cage may be going bad.   Do you have any other drive cages you can put in to test?

Since it is older, assume not a core piece of equipment, have you considered rebuilding the server or upgrading the HP Management tools?   Prhaps a newer SmartStart version will give more information.
LVL 56

Assisted Solution

andyalder earned 125 total points
ID: 34106106
Ditto the above, unlikely to be the controller since a fault would affect more than one drive on parallel SCSI. If you haven't got a spare server to try it out in do you have a spare disk slot in the current one?

Author Comment

ID: 34106269
The server is important enough we need to keep it working, but not important enough to go through the effort of rebuilding it.  We need to keep it limping along for a little while yet till we can phase it out with new equipment.

On your suggestion we will try out the SmartStart Tools, we had not actually tried those yet.  (Perhaps we should have.  :) )

We have an identical Proliant ML370 server currently not in production so we have some spare parts to work with.

In response to andyalder, we could have a spare disk slot if needed.  Would there be a way to rebuild the array using a different slot?
The Eight Noble Truths of Backup and Recovery

How can IT departments tackle the challenges of a Big Data world? This white paper provides a roadmap to success and helps companies ensure that all their data is safe and secure, no matter if it resides on-premise with physical or virtual machines or in the cloud.


Assisted Solution

wwakefield earned 125 total points
ID: 34106396
Swap the cage out.....

Note I know physically it is easy, but unsure what you have to do logically.

LVL 56

Assisted Solution

andyalder earned 125 total points
ID: 34109666
I was thinking more of just putting the disk in the spare slot to see if it was properly recognised. Annoyingly you can't use it in the spare slot as part of the array because I don't think it will let you make it a hot spare while the array is degraded.

Swapping the cage as suggested above would be better, procedure is in the maintenance and service guide which you can get grom HP's support and download area, I can't give you a link because I don't know which generation ML370 you have.

Author Closing Comment

ID: 34111233
We were unsure where to go on our problem, and these guys gave us some good ideas to try going forward on our issue.  

Featured Post

Optimize your web performance

What's in the eBook?
- Full list of reasons for poor performance
- Ultimate measures to speed things up
- Primary web monitoring types
- KPIs you should be monitoring in order to increase your ROI

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Moving your enterprise fax infrastructure from in-house fax machines and servers to the cloud makes sense — from both an efficiency and productivity standpoint. But does migrating to a cloud fax solution mean you will no longer be able to send or re…
Compliance and data security require steps be taken to prevent unauthorized users from copying data.  Here's one method to prevent data theft via USB drives (and writable optical media).
This video teaches viewers how to encrypt an external drive that requires a password to read and edit the drive. All tasks are done in Disk Utility. Plug in the external drive you wish to encrypt: Make sure all previous data on the drive has been …
This tutorial will walk an individual through the process of installing the necessary services and then configuring a Windows Server 2012 system as an iSCSI target. To install the necessary roles, go to Server Manager, and select Add Roles and Featu…

623 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question