Question

Proliant DL380G3 won't boot, RAID or drive issue issue

Asked by: williameott

Problem:  I have a Proliant DL380G3 that went off-line and would not restart.  The machine functions primarily as a SQL Server.

Equipment:  

Proliant DL380G3, dual Pentium Xeon 2.80GHz processors
Windows Server 2003
Proliant BIOS P29
Integrated Lights Out 1.64
HP Smart Array 5i version 2.56
Four 36.4GB 10krpm drives configured as RAID 5 with one logical drive ( Windows Server is then divided into a C and D drive )

I was getting an error message about a logical disk being disabled.  It gave me two options F1 to leave the current state of logical drive as disabled.  F2 to enable logical disk but risk data loss.  After choosing F1 it would give the error non system disk.  Drive 2 had the orange light on when the machine was still powered, drives 0,1, and 3 were green..  I powered down the server, disconnected power and pulled each drive out and then reseated them.  I received the same result when I powered back up.  

I restarted the machine and opened the RAID controller. This confirmed that drive 2 was a problem. All other drives were fine.

I powered off and inserted a new, identical drive into slot 2 with the notion that the RAID would recognize the new drive and write the data back to it. It did seem to initialize the drive and in a matter of minutes the new drive was green.   Inspecting the logical drive showed all four drives healthy.

When attempting to boot now, the server acts properly right up to the point it attempts to boot from CD, Floppy, then Hard Drive and at this point the machine just sits there as if waiting to start Windows.

I have 6 brand new 36.4GB 10Krpm drives available on the shelf along with one 72.8GB drive. I had visions of just replacing all of the drives one by one allowing the RAID to rebuild them. This would give the server all new drives.  I would also like to add a hot spare to the existing setup.

I hope what I have is recoverable.  I'm not well versed in dealing with the RAID issues. I'm primarily a network and security person, but this issue has fallen to me today.

I would greatly appreciate any ideas or assistance in getting this server online again.

Thank you
Bill

This Question has been solved and asker verified All Experts Exchange premium technology solutions are available to subscription members.

Subscribe now for full access to Experts Exchange and get

Instant Access to this Solution

  • Plus...
  • 30 Day FREE access, no risk, no obligation
  • Collaborate with the world's top tech experts
  • Unlimited access to our exclusive solution database
  • Never be left without tech help again

Subscribe Now

Asked On
2009-11-04 at 18:27:50ID24873310
Tags

Proliant DL380G3

,

RAID

,

drive

,

won't boot

Topic

Computer Servers

Participating Experts
2
Points
500
Comments
17

Trusted by hundreds of thousands everyday for fast, accurate and reliable tech support.

  • "The time we save is the biggest benefit of Experts Exchange to Warner Bros. What could take multiple guys 2 hours or more each to find is accessed in around 15 minutes on Experts Exchange." Mike Kapnisakis, Warner Bros.
  • "Our team likes having a resource that is more secure than just using Google and most experts using this service really know their stuff. It's nice to look here first versus using Google." Dayna Sellner, Lockheed Martin
  • "Anytime that I've been stumped with a problem, 9 out of 10 times Experts Exchange has either the accepted solution or an open discussion of the potential solution to the problem." Kenny Red, eBay Inc.

See what Experts Exchange can do for you.

Got a question?

We've got the answer.

Experts Exchange has been collecting answers to technology questions since 1996…3 million and counting! If you have a question, chances are we already have your answer.

Screenshot of Experts Exchange Knowledgebase

Need individual assistance?

Our experts are ready to help.

If you can't find the exact answer you're looking for, ask our exclusive community of 50,000 experts. You’ll get a personalized answer from a trusted professional.

Screenshot of Experts Exchange Knowledgebase

Want to learn from the best?

Read articles from industry experts.

Thousands of free tech tips, tricks, how-to’s and tutorials are available in our peer reviewed articles section. See for yourself how smart our experts are, no login required.

Screenshot of an Article

Working on a long term project?

Store your work and research.

Save solutions to your questions, answers you’ve discovered through searching plus helpful articles in your personal knowledgebase for easy future access.

Screenshot of Experts Exchange Knowledgebase

Access the answers to your technology questions today.

Subscribe Now

30-day free trial. Register in 60 seconds.

What Makes Experts Exchange Unique?

Members of the expert community talk about why the experience at Experts Exchange is different than what you will find anywhere else.

Trusted by the world's most respected brands.

image of each brand's logo

Faithfully serving IT professionals since 1996.

Experts Exchange Logo

Try it out and discover for yourself.

Subscribe Now

30-day free trial. Register in 60 seconds.

Related Solutions

  1. Raid
    What is better Raid 5 or Raid 10?
  2. Compaq Proliant DL380 with dead Raid 5I
    Our exchange server went down this morning. We are running a Compaq Proliant DL380 with the internal onboard Raid 5I, 5 total hard drives in the hot swap bays. 2 configured Raid 0, and the other 3 Raid 5. Replacement part is not available this weekend which leaves email...
  3. RAID 5 with hot spare or hot swap
    I have a DELL PE2650 with 5 Drives. RAID controller PERC 3 planning to do RAID with 2 disks for OS and RAID 5 with other 3 for DATA. Can I add another same size disks and use that as hot spare for RAID 5. Also what is the difference between hot spare and hot swap. If hot swa...
  4. RAID questions
    I was in a discussion with one of my colleagues regarding RAID during lunch. The scenario was a RAID 1 setup and if one of the harddisk actually failed and we unplug it, the system should continue to run without problems. The question he had was what happens when we plug in ...
  5. Extending RAID Drives in Proliant Servers?
    We need to find a way to increase disk space in an existing system. We have an HP Proliant DL360 G4 with (2) 72.8 GB drives running RAID 1. Is there a way to install 146.8 GB drives without losing anything? Can I just eject one of the drives and pop in a 146.8 and have it ...

Free Tech Articles

  1. WARNING: 5 Reasons why you should NEVER fix a computer for free.
    It is in our nature to love the puzzle. We are obsessed. The lot of us. We love puzzles. We love the challenge. We thrive on finding the answer. We hate disarray. It bothers us deep in our soul. W...
  2. SCCM OSD Basic troubleshooting
    SCCM 2007 OSD is a fantastic way to deploy operating systems, however, like most things SCCM issues can sometimes be difficult to resolve due to the sheer volume of logs to sift through and the dispe...
  3. Migrate Small Business Server 2003 to Exchange 2010 and Windows 2008 R2
    This guide is intended to provide step by step instructions on how to migrate from Small Business Server 2003 to Windows 2008 R2 with Exchange 2010. For this migration to work you will need the fo...
  4. Create a Win7 Gadget
    This article shows you how to create a simple "Gadget" -- a sort of mini-application supported by Windows 7 and Vista. Gadgets can be dropped anywhere on the desktop to provide instant information, ...
  5. Outlook continually prompting for username and password
    There have been a lot of questions recently regarding Outlook prompting for a username and password whilst using Exchange 2007. There are a few reasons why this would happen and I will try to cover t...
  6. Backup Exchange 2010 Information Store using Windows Backup
    There seems to be quite a lot of confusion around the ability to backup Exchange 2010 using the built in Windows Backup feature. This stems from the omission of this feature prior to Exchange 2007 s...

Cloud Class Webinars

  1. Avoiding Bugs in Microsoft Access
    Alison Balter takes and in-depth look at avoiding bugs in Access. In this webinar you will learn about using the immediate window to debug your applications, invoking the debugger, using breakpoints to troubleshoot, stepping through code, setting the next statement to execute, ...
  2. Top 10 Best New Features in Visio 2010
    Scott Helmers gives live demonstrations of the top 10 new features in Visio 2010. This webinar will teach you how to create compelling diagrams by adding shapes to the page with a single click, linking the shapes in a diagram to data in Excel (or SQL Server, or SharePoint), ...
  3. IT Consultant Business Secrets Revealed
    Michael Munger, Experts Exchange tech pro and IT consultant, pulls back the curtain on his very successful businesses and answers question on every IT consultant and business owner should know about. He shares secrets on what he did to solve the 5 most common problems in IT, ...
  4. Disaster Recovery and Business Continuity
    Quest CTO, Mike Billon, gives an overview of the steps involved in building a dunamic disaster recovery plan. Through case studies and an examination of software/hardware tooles for monitoring and testing, you'll gain a better understandin of where you are, where you want ...
  5. Organize Your Visio Diagrams with Containers and Lists
    Scott Helmers uses cross functional flowcharts, wireframe diagrams, data graphic legends and seating charts to teach you: how to ustilize all three new structured diagram components in Visio 2010, the best practices for organizeing shapes in previous version of Visio, how to organize ...
  6. How to Us Objects, Properties, Events and Methods in Microsoft Access
    Alison Dalter gives an in-depbth look at objects, properties, events and methods in Microsoft Access. In this webinar you will learn about using the object browser, referring to objects, working with properties and methods, working with object variables, understanding the ...

Join the Community

Give a Little. Get a Lot.

Join the community of experts here and help other tech pros by answering question in your area of expertise. You can earn FREE access to all Experts Exchange's premium features and resources.

Join the Community

Answers

 

by: dlethePosted on 2009-11-04 at 18:46:52ID: 25746327

First, do you run regular data consistency check/repairs?   If not, then you *could* have a catastrophic problem where you have a bad block at the beginning of one of the surviving disk drives. If that is the case, (1 dead drive, and an unrecovered read error), then part of stripe #0 is unrecoverable.

Anything of use in any event logs?   Does drive #2 even spin up?  If you plug that drive into another PC (don't leave the BIOS, just see if the bios in another PC sees the disk).

Are you 100% sure you configured RAID5 and not RAID0?  I only ask because I've seen this problem before.

The RAID should not have disabled the LUN on a drive failure.  It will do that on a double drive failure, or if you didn't have RAID5 configured.

No matter what it should be recoverable, even if it means spending $500+ on data recovery firm to get the  #2 disk data back, assuming it isn't a full media loss.

 

by: williameottPosted on 2009-11-04 at 19:12:55ID: 25746453

I do not believe any data consistency checks are performed on this server. The SQL data is backed up daily but there are a lot of other files on the server that are likely backed up somewhere.  I'm not the normal caretaker for this machine, it has fallen to me because the caretakers are out of town for a few weeks.  Naturally the server has run fine for four+ years until yesterday.

The machine indicates it is RAID 5 when viewing the logical drive configuration area.

Drive two does spin up and is green like the others and the logical drive config area shows it being okay.  I'd entered the F2 option ( which warned there could be data loss ) where the machine was originally stuck after power cycling several times. The machine seemed to drive two back to life.  The RAID config area now shows all drives being okay.

I've learned that drive two had turned orange early in the day that it died. It was several hours after this was first noticed that the machine stopped answering its SQL duties and I was contacted to come over and see what I can do with it.

 

by: dlethePosted on 2009-11-04 at 21:36:43ID: 25747015

If no data consistency checks are ever run, you can pretty much bet your life that you have some of the double-error situations.   Blocks die all of the time, and unless the host computer has a reason to read a certain logical block on the RAID, then the corresponding physical block(s) that make up the stripe & parity could be dead and nobody will know ... until a drive fails and it has to rebuild.

So I can not tell you for sure if this is the case, only that it does explain things and is highly likely you have some unrecoverable data.

If you are a business and the data is valuable enough to pay somebody, then contact one of the data recovery people like Ontrack.   They can reconstruct RAID from dead disk drives.  If you want to experiment (but recognize that it stresses your disks and could make recovery more difficult or even next to impossible) ... then you can run some diagnostics on the disks.  The diagnostics won't fix anything, but it will save you time in that if you have bad blocks and confirmed drive failures .. then you'll learn your only hope is to spend $1000+ to recover the raid set.


 

by: andyalderPosted on 2009-11-05 at 00:41:27ID: 25747629

A background parity check is automatically run whenever the server is idle for more than 15 seconds, but for once that hasn't helped. Looks like it's restore time.

 

by: williameottPosted on 2009-11-05 at 06:07:16ID: 25749540

Thank you all so much for the insight.  Yes, this is a business server, for a public safety organization actually.  I've either got to make it run again or rebuild it, need to do one or the other today.  The data records are accumulating on portable devices in the field.  I do have solid backup of the SQL data.  I've also learned this morning that this server also functions as a domain controller for a small set of personnel in one of the offices.  There are some miscellaneous files, word documents and such that probably are not backed up. The folks in the office used this server as a bit of a backup and file share for their desktop PCs. The SQL data is the main thing and I've restored a backup to a different server to ensure that the backup was good, it is.

So I gather that the thinking is I should reconfigure the server from scratch..?  I'll put all new drives in it.  I have no idea where the HP CDs are. This server was handed down from another department and I am thinking there is no hope for finding the CDs.  What do I need driver-wise and is all of this downloadable from the HP site?

Thank you,
Bill

 

by: williameottPosted on 2009-11-05 at 06:13:18ID: 25749601

Should I use RAID 5 or RAID 0 in a rebuild?

 

by: andyalderPosted on 2009-11-05 at 06:13:48ID: 25749607

Smartstart CD is on HP's web - http://www.hp.com/#Support enter machine details, then OS and download firmware CD (as you may as well get it up to date now) and then smartstart CD. Smartstart will put the right drivers on for you so don't have to mess about downloading individual ones.

 

by: andyalderPosted on 2009-11-05 at 06:36:38ID: 25749840

RAID 0 is asking for trouble, there's no parity. Use RAID 10 if you have enough capacity since it's faster than RAID 5.

 

by: williameottPosted on 2009-11-05 at 06:53:03ID: 25750039

Good heavens, there are seemingly hundreds of downloadable drivers an firmware for this machine at HPs site.  I'm downloading the HP SmartStart CD x32 version 8.25 from May of this year.  Will this CD image provide all that I need or do I need to grab other things as well?

Looks like RAID 5 is what I need to do based on the storage space requirements.

Everyone agree that using new drives is the way to go?  I'm thinking 4 like are currently in the server and adding one for a hot spare. What should I do with the sixth slot, add a second hot spare?

Thank you
Bill

 

by: dlethePosted on 2009-11-05 at 07:22:24ID: 25750397

Technically, brand spankin' new drives are higher risk of failure then ones that have been in service for a few months to several years .. due to high infant mortality.  Don't be so concerned with new vs old.  I'll take 4-year-old enterprise SCSI class disks over 1TB consumer-class SATA disks any day of the week.

Read the specs on the disks,  make sure 24x7 duty cycle & look at the ECC correction.   Go with RAID5 if you must, but I would partition it into 2 logical disks so when time allows you could add a pair of disks, mirror (RAID1) them, and migrate the O/S + scratch table space + log files to the RAID1.

 

by: andyalderPosted on 2009-11-05 at 09:25:10ID: 25751760

Smartstart CD for install, it has all the individual drivers listed on the web inbuilt so don't need any of the others. You boot that and go to "deploy server" and it copies the drivers then asks for the Windows (or other OS) CD when it needs it.

Recommend also download firmware CD to bring it up to date before rebuilding. Again it has all the individual firmware files so don't need the individual ones.

On a brand new server the Smartstart and firmware CDs are not up to date with the most recent fixes but with a G3 that's not a worry since the bugs were ironed out ages ago.

HP universal 1" SCSI disks are all 100% duty cycle, no worry there. I'd agree with dlethe about the current disks being burnt in and tested so less chance of failing than brand new ones.

 

by: dlethePosted on 2009-11-05 at 09:44:48ID: 25751938

On a related note on HP quality, am doing a cross compile on an ancient HP VMS (alpha) based system for some disk diagnostics.  Look how many cumulative reads/writes they have over the years, and the UNRECOVERED read/write totals are all zero.  That is almost unheard of.
===========================
The COMPAQ SCSI disks were manufactured in 2000!  - they don't make them like this any more
Discovered COMPAQ BD0096349A S/N "3BV120SH" on LAB153$DKC0: (Not  Enabling SMART)(8678 MB)
 Inquiry Text Page Data - ANSI defined fields
   Device Type:                         disk
   Removable Device:                    NO
   Vendor Identification:               COMPAQ
   Product Identification:              BD0096349A
   Firmware Revision:                   3B12
   SAF-TE Enclosure services available: NO
   Total Capacity (In Bytes):           9100044288
   Total grown defects:                 0
   Total Primary (factory) defects:     308
   Board serial number:                 H0000V1230T45
   Servo RAM Release number:            83470910
   Servo ROM Release number:            00000001
   Servo RAM Release date:              512A
   Servo ROM Release date:              2000
   ETF Log date MMDDYYYY:               12/16/2000
  Write errors corrected with possible delays: 0
   Total Write errors: 0
   Write errors corrected: 0
   Times correction algorithm processed (on Writes): 0
   Bytes processed (on Writes): 1857795739648
   Unrecovered errors (on Writes): 0
   Read errors corrected without substantial delay: 6304116
   Read errors corrected with possible delays: 0
   Total Read errors: 0
   Read errors corrected: 6304116
   Times correction algorithm processed (on Reads): 6304116
   Bytes processed (on Reads): 403237462176768
   Unrecovered errors (on Reads): 0
   Verify errors corrected without substantial delay: 6
   Verify errors corrected with possible delays: 0
   Total Verify errors: 0
   Verify errors corrected: 6
   Times correction algorithm processed (on Verifys): 6
   Bytes processed (on Verifys): 9100044288
   Unrecovered errors (on Verifys): 0

 

by: williameottPosted on 2009-11-05 at 09:54:19ID: 25752031

Well, I've been running some of the diagnostic tests with SmartStart.  For the drive array is is telling me that drive 1 has failed, yet drive two originally had the amber fail light on.  It tells me I should replace drive one.  I suppose I should replace drive 1 on the chance that it brings the machine back to life..?  Can I swap the drives while in SmartStart?  Curious if it is safe to pull on out and insert another in its place with the server on and SmartDrive running or should I power off and allow the server to try and boot normally?

The test show all of these drive have been running for 61 months.

Thank you for all of your assistance.

Bill

 

by: andyalderPosted on 2009-11-05 at 10:19:06ID: 25752274

You can swap while it's running SmartStart, the controller doesn't know you're running from CD, it'll rebuild just like if it was running an OS from disks.

 

by: williameottPosted on 2009-11-05 at 10:40:26ID: 25752478

I've swapped the disk, let it rebuild and I'm right back to were I was.  Server get to the point were it should boot into Windows and it just sits.   The diagnostics from SmartStart now show all four drives passing the test but all four have a yellow textbox indicating that some information wasn't written to the logs.

So I am guessing this means it is time to do the complete rebuild..?

 

by: andyalderPosted on 2009-11-05 at 11:18:07ID: 25752865

Yup, that'll be safest.

20120131-EE-VQP-002

3 Ways to Join

30-Day Free Trial

The Experts

98% positive feedback on 31,087 answers since March 2000. angeliii is a Microsoft Most Valuable Professional for his work with MS SQL Server & Develoment.

He has also proven his knowledge of Visual Basic Programming, PHP Scripting and Oracle Databases.

The Experts

97% positive feedback on 10,752 answers since July 2000. lrmoore has more than 18 years experience in the networking industry.

The six-time Mircosoft MVPs specialties include firewalls, virtual private networking, and network management.

Testimonials

"...and excellent source for support... Kind of like having your very own IT dept." Electriciansnet

Testimonials

"I was apprehensive at signing up at first. However... it has already made my life as an IT administrator much easier." JaCrews

Testimonials

"WOW! You guys have great, active, and knowledgeable people on here." moore50

Business Clients

Business Clients

In the Press

"If you’ve got a question... Experts Exchange can supply an answer.”

In the Press

"...an invaluable aid for both IT professionals and those who require tech support."

In the Press

"where IT professionals provide quick answers on just about any topic"

Business Account Plans

Loading Advertisement...