Solved

failed raid. server will not boot. help.

Posted on 2014-02-28
13
1,038 Views
Last Modified: 2016-10-27
I need help urgent. Have a failed server. Two failed drives. Raid 5.

8 total disks. Can I recover?
IBM server. Mega raid m 5015 adapter.

Please help
0
Comment
Question by:nflnetwork29
[X]
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
  • 5
  • 4
  • 3
  • +1
13 Comments
 
LVL 88

Accepted Solution

by:
rindi earned 500 total points
ID: 39895581
If 2 disks have failed in a RAID 5 array I'm afraid you are out of luck. RAID 5 only allows for one disk to go bad. For that reason you should always change any bad disks as soon as they fail, and not wait until the next one fails too (it isn't that likely that 2 fail simultaneously, although of course it is possible).

Replace the bad disks, if there is enough space in the server for an additional one, get an extra disk, then create a new array, with a hot spare (the additional disk). Then restore your server from your backups, and in future monitor it so you can replace any failed drives as soon as they fail. The hot spare will allow for some extra time as it will be added to the array as soon as a disk fails.
0
 
LVL 47

Expert Comment

by:dlethe
ID: 39895602
Turn off ALL the hardware, and contact a data recovery firm.  Even applying power to the HDDs risk damage.  You have an excellent chance of getting all your data back with professional recovery.

Be prepared for something in the $5000 - $10000 range.  Sorry.
0
 
LVL 30

Expert Comment

by:pgm554
ID: 39895621
Not looking good.

My advice is to reseat the drives if they are hotswap and hope that maybe it was a connection problem.

What type (make and model ) drives?

Some manufactures have diags that can fix some errors on their hard disks and make them usable until you can do a swap and rebuild.
0
Simplifying Server Workload Migrations

This use case outlines the migration challenges that organizations face and how the Acronis AnyData Engine supports physical-to-physical (P2P), physical-to-virtual (P2V), virtual to physical (V2P), and cross-virtual (V2V) migration scenarios to address these challenges.

 
LVL 47

Expert Comment

by:dlethe
ID: 39895661
I write such diagnostic software professionally.  Some things can be fixed entirely with software and damage can be limited, then again if you have chunks of metal flying around inside the canister at 10K RPM then you need the guys in the bunny suits.

It all depends on what is wrong.  the problem YOU have is that you need somebody like me with the hardware and software to determine the failure, let alone the training.

If the data is valuable, then forcing the drives online may result in additional corruption if you force the WRONG drive online, even if the disks are healthy and condition is transient.

Maybe the system ran degraded when drive A died, then drive B died a few days later.  If you force A online, then your data is going to be corrupted with ZERO chance of getting it back the moment it boots up.

If you force A&B online, you corrupt data.
If you force B online, then you'll have minor corruption, or very little, but don't have the means to find out.

Log files will help. If you don't have them, then personally I would forget it, unless you have nothing to lose, then guess and hope for the best.    
(But I would use another PC to take a binary image of all 8 disks for safe keeping ... but even that is risky because the two disks that failed might have the only copy of data you need and the mere stress of reading will do more damage).
0
 

Author Comment

by:nflnetwork29
ID: 39895712
Should i try reseating the drives? 1 drive or both drives ?
0
 
LVL 47

Expert Comment

by:dlethe
ID: 39895736
That won't hurt. Go for it.  Both drives. The controller won't arbitrarily choose to do the wrong thing ... that is what "forcing" is for, to force the controller to make a decision that would put your data at risk, in situation where the controller is wrong.

Or more correctly, when controller is being overly conservative.
0
 
LVL 30

Expert Comment

by:pgm554
ID: 39895744
I would reseat all the drives.

Troubleshooting 101.

I've had slots in backplanes go bad just as often as drives.

Their is a fair chance that the back plane could be the issue.

I would exhaust the simple fixes first.
0
 

Author Comment

by:nflnetwork29
ID: 39895813
tried reseating all 8 drives. same result.
0
 
LVL 47

Expert Comment

by:dlethe
ID: 39895819
That controller also does support drive roaming, so you could leave the system on, but stay at the megaraid BIOS, and hot swap disks to different bays.  This will tell you if the problem is with the HDD or the Bay.

BUT REMEMBER - You do not know the nature of the problem with the drives.  Applying power could cause further damage.  So again, if the data is worth significantly more than the price of recovery, then the correct course of action is to call in a pro.
0
 
LVL 47

Expert Comment

by:dlethe
ID: 39895826
P.S. it is maybe 1000:1 against the backplane or controller failing unless somebody is inside that box a lot and swapping out disks.
0
 

Author Comment

by:nflnetwork29
ID: 39895827
tried spinning up a live recovery vm and now am getting the following error.

“STOP: c00002e2 Directory Services could not start because of the following error: A device attached to the system is not functioning.

Error Status: 0xc0000001. Please shutdown this system and reboot into Directory Services Restore Mode, check the event log for more detailed information.”
0
 
LVL 30

Expert Comment

by:pgm554
ID: 39895861
live recovery vm?

Who's?
Storagecraft ,Vmware,M$,Acronis?
0
 

Author Comment

by:nflnetwork29
ID: 39895927
storagecraft
0

Featured Post

The Eight Noble Truths of Backup and Recovery

How can IT departments tackle the challenges of a Big Data world? This white paper provides a roadmap to success and helps companies ensure that all their data is safe and secure, no matter if it resides on-premise with physical or virtual machines or in the cloud.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

The business world is becoming increasingly integrated with tech. It’s not just for a select few anymore — but what about if you have a small business? It may be easier than you think to integrate technology into your small business, and it’s likely…
When it comes to protecting Oracle Database servers and systems, there are a ton of myths out there. Here are the most common.
This video Micro Tutorial explains how to clone a hard drive using a commercial software product for Windows systems called Casper from Future Systems Solutions (FSS). Cloning makes an exact, complete copy of one hard disk drive (HDD) onto another d…
This video teaches viewers how to encrypt an external drive that requires a password to read and edit the drive. All tasks are done in Disk Utility. Plug in the external drive you wish to encrypt: Make sure all previous data on the drive has been …

696 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question