How to recover from Intel SRCZCR RAID Controller Multiple Drive "Failure" causing Array to go into Error state

Posted on 2009-04-24
Last Modified: 2013-11-14
After a power outage (yes we have functional power backup and was successful in this instance), we experienced an apparent array failure. I have worked with this RAID controller and others like it (specifically SRCZCRX and other Intel RAID controllers). I have also worked with older Adaptec RAID controllers (ARO-1130U2, 2100S, 3200S, 2000S, 2010S). In the Adaptec 3200S there are "secret" commands like "Make Optimal". You would only use these commands if you knew the data was there but the stupid RAID controller can't figure it out. Such is the case with the Intel SRCZCR that I am dealing with. The data is all there, the drives are NOT defective, and now I seek the secret keys to push to make this RAID adapter play nice. I have actually used these settings in the Intel SCRZCRX, but I do not remember what they are. I have patched the drive to fail state at this point. I'm in fairly big hurry to get this fixed as the entire organization will be running on paper in about 7 hours if I can't get this thing running. Intel does not provide ANY support (Free or Paid) for hardware past the end of thier "interactive support" date. If need be, I am more than willing to contact an Intel RAID controller Guru for their expert assistance however, I don't even know where to look for that!
Question by:dsasc
    LVL 12

    Expert Comment

    I'd toss in new drives, setup the raid and restore your backup, It's faster than recovery.

    I've fought this type of battle before and ended up doing data recovery on the raid array to get the data back as there was no (None) support available from intel.

    Author Comment

    I agree, however this problem has resolved itself due to a 12 hour WatchDog timeout on the controller. Evidently I had already performed the necessary tasks to bring the array back online in a degraded state and left the controller at the array failure message. When the server booted up all by itself in the middle of the day, I was quite surprised. I copied important stuff for the day prior to the end-of-day backup, then began the rebuild process.

    I was able to obtain assistance from Intel through my Intel rep and we (the support person & I) had a good conversation about the need for support after three years even if payment is required. To simply have no support (even third party referral) is NOT reasonable. The support person appeared to agree with the premise and I moved on with my day. Thanks for your input. I still hope to attract the attention of someone with specific experience related to this RAID controller, so I will leave the question open for a while longer.
    LVL 12

    Expert Comment

    Glad it worked itself out..

    where was it setting, in the controller screen, or stuck during boot up?

    Accepted Solution

    When the machine booted, it stopped waiting for user input. At that point I had to instruct the controller to Patch/Fail the array which allows the controller to mark the logical drive that failed first as the only failed drive (this way there is only one defective drive). It will then attempt to use the array. With the parallel SCSI hardware, other devices connected to the same channel can simultaneously appear to fail when they really are not defective. So you have to tell the controller to ignore the problem and move on. I am very thankful for SAS since that particular problem is no longer possible. I have seen the same problem with other branded controllers in the past. Thanks again for your input.
    LVL 12

    Expert Comment

    Thank you for the clarification.

    Write Comment

    Please enter a first name

    Please enter a last name

    We will never share this with anyone.

    Featured Post

    How to run any project with ease

    Manage projects of all sizes how you want. Great for personal to-do lists, project milestones, team priorities and launch plans.
    - Combine task lists, docs, spreadsheets, and chat in one
    - View and edit from mobile/offline
    - Cut down on emails

    Lets start to have a small explanation what is VAAI(vStorage API for Array Integration ) and what are the benefits using it. VAAI is an API framework in VMware that enable some Storage tasks. It first presented in ESXi 4.1, but only after 5.x sup…
    Data center, now-a-days, is referred as the home of all the advanced technologies. In-fact, most of the businesses are now establishing their entire organizational structure around the IT capabilities.
    This tutorial will walk an individual through locating and launching the BEUtility application and how to execute it on the appropriate database. Log onto the server running the Backup Exec database. In a larger environment, this would generally be …
    To efficiently enable the rotation of USB drives for backups, storage pools need to be created. This way no matter which USB drive is installed, the backups will successfully write without any administrative intervention. Multiple USB devices need t…

    737 members asked questions and received personalized solutions in the past 7 days.

    Join the community of 500,000 technology professionals and ask your questions.

    Join & Ask a Question

    Need Help in Real-Time?

    Connect with top rated Experts

    20 Experts available now in Live!

    Get 1:1 Help Now