Solved

PowerEdge 2500 died - BSOD followed by 'NTdetect failed'

Posted on 2008-10-28
4
776 Views
Last Modified: 2012-06-27
On Sunday afternoon, I popped into the office to change a few email settings on our Dell PowerEdge 2500 running Windows SBS 2000 and Exchange 2000.

I switched on the monitor to be greeted with the blue screen of death.

I powered it down and started it booting back up, which it did, although one of the RAID 5 drives is flashing the 'X' LED, suggesting that drive has had it.

After booting, the Exchange mail stores were all offline and wouldn't come back online.

After a lot of messing around and a restore of the mail stores, I got them back online with no loss of data.  However, the routing tables weren't working and as such, no email could be sent or received.  I followed an article on Microsoft's support site which said to reinstall Exchange which would repair the routing tables.

I did this and it asked for a reboot to replace the files that were in use.  I hit OK and after running through the BIOS and RAID settings, instead of booting into Windows, it came up with 'NTdetect failed' and nothing else.  It just sat there.

I searched the net, which turned up results suggesting the boot files were corrupted and that I should boot using the Win2K CD and use the repair option to replace them.  I tried this, but although the LED flashed on the CD-ROM drive when a CD is inserted, the disk isn't actually recognised and it won't boot from it, so I assume the drive has also had it.

I built a bunch of boot floppies and booted into Win2K setup from those.  When I chose R to repair, it told me there was no hard drives available with a Win2K installation and quit.

This was at 2am yesterday morning, 2 hours before I had to leave to go to Holland, so I had to leave the office with webmail only and no historical email.

I'm now in Holland and have managed to get net access, so I thought I'd do some digging and see if I could formulate a plan of action before I get back - I'm back on Friday night and need to spend the weekend getting the email back up and running before Monday!

Thinking about it, I think the late hour and stress had made me forget to load any RAID drivers when booting into Windows setup, which is why it couldn't find the installation.  So, my first plan is to download the drivers and stick them on a floppy to load in during the boot process.  Hopefully that will allow me to do the repair and get back into the server.

I'm hoping it's then just a case of replacing the hard drive and allowing the RAID to work it's rebuilding magic!

The C: drive is a partition on the RAID array, so I assume that the hard drive failing has somehow buggered that?  If so, I guess it's a partition error?  Is there any way of recovering from that?

Has anybody got any good advice on recovering from this NTdetect error?

I desperately need to recover this server - not just because of the email (which is backed up and, worst case, would only lose a few days worth of email); but because there is a very important database on there - this should also be backed up, but I'm not sure how recent the last back up of it is.

Help!?
0
Comment
Question by:adfield
  • 2
4 Comments
 
LVL 3

Assisted Solution

by:Darksied9
Darksied9 earned 100 total points
ID: 22821192
This is typical of a PERC failure as the PERC Controllers that Dell uses are not exactly robust.  First thing to try is to flash the PERC with an updated version. Check the DELL DRAC to see if there are any errors.  You can try replacing the disk and letting the PERC Controller do it's magic as well, but check the PERC first, even if you see it reporting a faulty drive.  If the RAID rebuild fails or does not work I would certainly go with replacing the PERC Controller next.  Be sure that when you replace the PERC controller that you dis-engage the drives and slide them out of their bays just a bit BEFORE you power them on -- DELL TECHS OFTEN MISS THIS -- because the default configuration for the PERC is not raid and it will wipe all your drives on power up.  Go in and re-enable raid, and them connect your drives and power back up.  Good luck.
0
 
LVL 26

Accepted Solution

by:
lnkevin earned 400 total points
ID: 22823011
You have two separated issues: HD failed and Windows ntdetect is corrupted. In RAID 5, if you lose 1 drive, your system and OS should be running OK with degraded status. Your windows does not come up because its system file is corrupted. Here is what you should do.

Do not power your server down when replacing a failed drive. The Dell drives should be hot swappable so all you have to do is replace the drive when your server power is on with equal or bigger capacity, but make sure it has the same speed. RAID 5 will rebuild itself in a few hours.

After the drive replacement, you need to run repair install again. Repair install is a bit tricky so follow this instruction step by step:
http://www.microsoft.com/windowsxp/using/helpandsupport/learnmore/tips/doug92.mspx

K
0
 
LVL 2

Author Comment

by:adfield
ID: 22823139
Thanks for the replies.

lnkevin, how can I replace the drive without it booting into Windows?  Should I use the configuration utility accessible when the server loads the BIOS?  And then once the rebuild is finished, use the repair function?  That link you posted references Win XP, I assume SBS 2000 is the same process?

Again, thanks!
0
 
LVL 26

Expert Comment

by:lnkevin
ID: 22823854
how can I replace the drive without it booting into Windows?

If your drives are already in RAID 5, you don't have to to much. First you need to get in the Perc Bios by hitting ctrl - M when it appears.  Follow the following instruction from Dell to rebuild a failed drive. Find the key word: Rebuilding Failed Hard Drives in this document for quick access

http://www.cs.uwaterloo.ca/~brecht/servers/docs/PowerEdge-2600/en/Perc4scdc/UG/bios.htm

Repair install should be the same on Windows all version.

K
0

Featured Post

PRTG Network Monitor: Intuitive Network Monitoring

Network Monitoring is essential to ensure that computer systems and network devices are running. Use PRTG to monitor LANs, servers, websites, applications and devices, bandwidth, virtual environments, remote systems, IoT, and many more. PRTG is easy to set up & use.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

The problem of the system drive in SBS 2003 getting full continues to be an issue, even though SBS 2008 and SBS 2011 are both in the market place.  There are several solutions to this, including adding additional drive space or using third party uti…
Data center, now-a-days, is referred as the home of all the advanced technologies. In-fact, most of the businesses are now establishing their entire organizational structure around the IT capabilities.
Along with being a a promotional video for my three-day Annielytics Dashboard Seminor, this Micro Tutorial is an intro to Google Analytics API data.
Migrating to Microsoft Office 365 is becoming increasingly popular for organizations both large and small. If you have made the leap to Microsoft’s cloud platform, you know that you will need to create a corporate email signature for your Office 365…

864 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question

Need Help in Real-Time?

Connect with top rated Experts

22 Experts available now in Live!

Get 1:1 Help Now