Link to home
Start Free TrialLog in
Avatar of VaiSatchAtrucci
VaiSatchAtrucci

asked on

Dell PowerEdge 2950 Raid 5 Drive in Foreign State will not boot Server 2003 OS

There was a previous question asked by someone stating the following and I am having the same exact problem:
-------start---------
Today morning we've found an server that has somehow crashed over the weekend.

It's an Dell Poweredge 2950 with an Perc5/i adapter and 2 attached 70 GB SAS drives  for raid1,  and 4x 130gb SAS for Raid5.

If we try to boot the server the Perc5 adapter shows an error:
"Foreing configuration found. Press "c" for config utility, any other key to continue..."

"Continue" will result in try to boot from PXE-dhcp and repeat. Win2003 will not boot.
--------end---------
The difference in my issue is I have a 4 disk array and one of the disks is showing up in a foreign state.  The resolution to that particular issue for that guy was he deleted the virutal disks and recreated them and everything fixed itself... I am afraid to try that without at least posing the question, will that really work?  The drives appear to not be corrupted in any shape form or fashion when I go into the configuration manager - it allows me to "blink" all the drives so connectivity is good and it's reporting the drive sizes correctly and other than the "foreign" state of the one drive, everything else appears to be ok?

Any ideas on how to proceed from here?

Thanks in advance!
ASKER CERTIFIED SOLUTION
Avatar of gardmanIT
gardmanIT

Link to home
membership
This solution is only available to members.
To access this solution, you must be a member of Experts Exchange.
Start Free Trial
Avatar of Hoard
Hoard

The system boots now currently right?
If so; either use Openmanage Server Administrator to clear forgeign or go into the CTRL R for the RAID as the system is coming up.
CTRL R will probably be the easiest.
Hit that as the system is coming up.
Once in there; on the first page; highlight the controller (It'll be the top option in the tree)
Hit F2. You have an option to Clear Foreign.
Say yes.
That'll take care of that problem.
At that point I'd recommend setting this problem drive as a hotspare and see if it will rebuild. If it does; you're good to go (Update RAID firmware after the rebuild is finished)
If it doesn't you're problly looking at a bad disk.
Good luck!
I have the same problem that seems to have started when the owner removed one of the drives and it showed up as foreign. During the eval process, another drive was removed and it too came up as foreign. I tried the "clearing" outlined above. That cleared the "foreign" status but now they show up as "missing". I'm reluctant to rebuild in the event I might lose critical data. Any suggestions.
Hi Grey Wolf,

If you have two "missing" drives in a raid 5 set you have lost the data and a rebuild will not work.

RAID 5 utilisies disks as follows and allows for 1 disk in the set to fail without loss of data.
you can have any number of disks from 3 upwards (there is often an upper limit due to controller capacity or slots) These disks are used by the RAID controller as follows. Each block of data is split across (using a 3 disk example) 1/2 the data on one disk, the other half on the next and whats known as parity data on the thrid. In a four disk raid set its 1/3 of the data on the first 3 disks and parity on the fourth. This is why in RAID 5 you get the capacity of the number of disks -1 as one disks worth of space is used to store parity info.
In effect the parity info is a calculation of a checksum, a mathmatic sum of all the bytes of data.
If you lose a disk it will either contain the parity info or 1/2 of the data, if the parity info is lost no problem the data is still on the other two disks, if half the data is missing the RAI controller can take half the data and the parity info and do the maths to reconstruct the missing half of the data.

So RAID 5 lets you lose a sinlgle didk and recover or "rebuild" it onto a replacement disk, if while one disk is down however another fails (or was removed in your case) you have lost the data. Rebuilding won't work as too much of the data / parity is missing.

Trying a rebuild will not do any harm but it will probably just confirm that you have lost the data when it fails.
If you have a disk fail in RAID 5 the normal process is....
Remove it (having made sure you get the right failed drive - blink the drive if possible from the RAID controller or its software (say dell open manage)
Insert a new drive - it wil show as foreign which simply means the controller sees it as a disk it hasn't seen before, clear the foreign status.
Select the virtual disk (the whole raid 5 set) and select rebuild - this intitiates the process of calculating all the missing data and or parity data that was stored on the failed disk and writes it to the new one.

For info, if the RAID controller had had access to a spare disk already installed in the server as a "hot spare" when the first disk was removed it would have automatically rebuilt the missing data on this spare so when the second disk was removed it would not have been ternminal. Another option is RAID6 (not supported on the Perc 4 in the 2950 from memory. This uses an extra disk in the RAID set for parity info so in effect needs 4 disks min but allows two disks to fail (be removed) without failing. Down side is your capacity drops to number of disks -2