We help IT Professionals succeed at work.

Dell Servier HDD out

Dell Poweredge 710
Windows server 2003
Running Citrix Xenserver

My school network stopped working and the message on the front of the case is saying there is a drive fault.  I tried to run the systems diagnostic utility and got a message saying the diagnostics platform is empty.  When I tried to update the platform I got a message saying the network is not configured.

I believe the HD that the message referenced is toast... am I right or are there other things I need to check on first?
Comment
Watch Question

Commented:
how are the HDDs setup?

single disk? RAID? Hot swap?

Thanks,
Dekkar

Author

Commented:
I didn't set it up but believe we do not have it set up with raid.
PowerEdgeTechIT Consultant
CERTIFIED EXPERT
Top Expert 2010
Commented:
What controller do you have?
How many drives do you have?
Which drive is indicated as having failed?

Author

Commented:
It is configured with RAID and I ran the perc utility and it says drive 4 has failed.  I have reseated the drive and it is still showing as failed. I called support and I need to run the dset tool on it.  So I need to figure out how to do that.  I am not able to sync with it using the Xencenter management tool so am not sure how I will download it and run it on the server.

I was trained as a programmer so am trying to learn the server side as I go.
PowerEdgeTechIT Consultant
CERTIFIED EXPERT
Top Expert 2010
Commented:
I don't know about accessing hardware from a Windows OS through a Hyper-Visor.  I'm not sure if it translates everything to OMSA/DSET that it should, as Windows doesn't work directly with the hardware.

There are a couple of ways you can go about this outside of the OS, but that would require taking the server down for a little longer than the length of a reboot.

In the CTRL-R utility during POST, go to the PD MGMT screen (CTRL-N for Next) to see if your drive is listed as Foreign.  If so, go back to the VD MGMT screen (CTRL-P), highlight the controller, hit F2, Foreign Config, Clear.  You should then be able to set this drive as a Hot Spare to begin the rebuild.  If it is not Foreign (Missing or not listed), it most likely needs to be replaced.

If Dell cannot tell you how to run a DSET Report or install OMSA from a VM (if it is possible), then your only other option to get the information would be using OMSA Live! ... a bootable CentOS disk with OMSA loaded on it.  From there you will be able to look at the hardware and controller logs, which is probably what Dell Support is after anyway.

http://linux.dell.com/files/openmanage-contributions/omsa-65-live/
Commented:
If the server is still under maintenance, you should be able to request a technician come out and have a look at it.....

usually if you reseat the drive, and it still fails, then its a drive problem. The issue is it depends who you get on the phone, as to how the issue is resolved.


So the server is still up and running? and the RAID array is running in a degraded mode, with 1 of the 4 disks failed.

If it was me, I would call dell and tell them that the test just failed, and ask them to send someone out to fix it.
IT Consultant
CERTIFIED EXPERT
Top Expert 2010
Commented:
Dell won't send a technician out to "take a look" unless you paid LOTS of money for a "dispatch-first, trouble-shoot later" contract.  Dell will only send out a technician to replace faulty parts, after required troubleshooting has been performed to determine the faulty part.  Dell onsite technicians will not troubleshoot the issue.

If the drive is listed as "foreign", you can't rebuild it until you clear that foreign config.

Dell support should be able to walk you through all this if you need assistance.  
Commented:
I have over 30 dell servers here..... all on their standard 4 x 7 response maintenance...

When a drive fails, and the light flashes on the disk, you call them up, they ask you to reseat it... if this doesnt work, they send out a new disk.... + ask you if you need a technician to install it.

Every now and again, you will get someone on the phone who wants to troubleshoot it further, but most of the time reseating the disk is more than enough troubleshooting....
PowerEdgeTechIT Consultant
CERTIFIED EXPERT
Top Expert 2010
Commented:
Reseating is great for re-establishing connections, but if the drive has a foreign config, it will not rebuild on a reseat ... the foreign config must be addressed.  If they are not having you check for and address the foreign config on the drive, they are not doing themselves or you any favors, as it could be another problem - the disk could even be fine.

Commented:
You are right..... but how often does a server spit out one of those errors from out of nowhere?

You need to have moved disks from one server or controller to another for this to happen.

Either way, the machine is running server 2003 on it correct? You should be able to run the support tools on it and get the what dell wants.
PowerEdgeTechIT Consultant
CERTIFIED EXPERT
Top Expert 2010
Commented:
It doesn't need to have come from another server to get flagged as "foreign" ... it just needs "a" configuration present that doesn't match the metadata (including timestamp) of the other member drives ... in a number of scenarios, the drive will go offline and when the controller sees that the config on the drive is outdated, it will assume the other three drives are right, make that the active VD, and mark the other drive as foreign - just in case - and let you make the determination on what to do with it.  If you haven't seen this, that's a good thing ... I hope that continues for you.

All I'm saying is the foreign config could be preventing the drive from rebuilding ... it takes seconds to check and to take care of, if that's the case.  If it is not flagged as foreign, then continue on as usual toward the idea that the drive is bad.

Author

Commented:
Okay thanks,  I will try some of your suggestions on Sunday.  I was trained as a programmer and hired to maintain the network for a 300 student school.  I am desperately learning as fast as I can as I go along.

Author

Commented:
Sorry it took so long to complete this question.  I got a utility from Dell and it shows bad blocks so we will try to repair that.  Thanks all for the answers.

Explore More ContentExplore courses, solutions, and other research materials related to this topic.