server crashing

Posted on 2007-09-28
Last Modified: 2010-05-18
we have a windows 2003 server SP1 running exchange.  It is an IBM x236 server.  We have tried to run the IBM diagnostics but can not view the output file because it is corrupted.  I am looking for any suggestions to try to get to the root of what is going on with this server.  we will be taking the server offline to run the diag via a bootable CD but not until next week.  In the meantime, does anyone have a suggestion for getting to the bottom of the problem.  

A chkdsk was run on the server with no arguements.  Windows decided to fix the security descriptors on several files and propogate them to the mirrored drive.  I think there is more going on here that just a hardware issue.  Now i think there is file corruption as well.  We have backups, but i am not confident that they are any good either.

Thank you.
Question by:egrotjahn
    LVL 1

    Expert Comment


    Is the volume a RAID 5 per chance?  You could backup all relevant data and then go into the SCSI firmware utilities and rebuild the volume.  This could take several hours depending on the size of the volume, but I have had a lot of luck doing this when volumes start to get weird.  Besides, sometimes it takes longer to troubleshoot than rebuild. This should not delete any data but backup anyway  just in case...


    Author Comment

    it is a mirror, not a RAID5.  I think a rebuild is in order once we determine if there is a hardware issue.
    LVL 8

    Expert Comment

    If you look at the array controller utility, does it show the array as healthy?  Are either of your drives showing as degraded?  Also, anything showing up in your event logs (system or application) that's related?
    LVL 1

    Accepted Solution

    Rebuilding the mirror may or may not resolve the issue.  Since ideally both drive should have the same data on them.  However, if the drives are possibley not healthy JjcampNR's suggestion may work.  The the arrays is not healthy and you have the option in the controller's firmware to rebuild or repair the array, you may need to run a thorough disk-check in Windows once you reboot as many files may get re-indexed during the array repair.  You may also need to defrag the drive before it starts working properly again.

    LVL 8

    Assisted Solution

    I wouldn't suggest running a defrag on the drives until you're comfortable that your array is back in a healthy state.  Don't start moving a ton of data around until you know if there are bad blocks on the drives or if there may be other mechanical problems with the drives.  It would be awful to run a defrag on the drives only to find you've  moved a bunch of critical system data into an entire sector full of bad blocks making the entire system go down for good.

    Also, there's no reason to kick off a rebuild of the array since you're on a RAID 1 mirror.  If you were no a RAID 5 and had some verified disk issues, I'd say that may help, however on a RAID 1 only one of yoru two disks is active so there's no way that some sort of array inconsistency (in the data between your drives) is causing what you're seeing.  What I would suggest is once we get this issue straightened out that you do a rebuild on the array to ensure the spare disk has a completely up to date copy of what's on the active disk.

    Once you give us info on the state of the array and if any messages are showing up in the event logs we'll be able to tell you how to move forward.

    Author Comment

    a complete rebuild and restore was necessary.  IBM replaced the motherboard and did not enable the host raid controller before windows started leading to a lot of problems and windows corruption. After a reinstall and restore I am up and running.  Thanks for the suggestions.

    Write Comment

    Please enter a first name

    Please enter a last name

    We will never share this with anyone.

    Featured Post

    How to run any project with ease

    Manage projects of all sizes how you want. Great for personal to-do lists, project milestones, team priorities and launch plans.
    - Combine task lists, docs, spreadsheets, and chat in one
    - View and edit from mobile/offline
    - Cut down on emails

    Suggested Solutions

    Title # Comments Views Activity
    Selecting one CPU or two 4 67
    VMware PSoD 2 54
    How to virtualize old server? (2003) 7 62
    Dentrix G4 1 11
    Scenerio: You have a server running Server 2003 and have applied a retail pack of Terminal Server Licenses.  You want to change servers or your server has crashed and you need to reapply the Terminal Server Licenses. When you enter the 16-digit lic…
    The 6120xp switches seem to have a bug when you create a fiber port channel when you have a UCS fabric interconnects talking to them.  If you follow the Cisco guide for the UCS, the FC Port channel will never come up and it will say that there are n…
    Need more eyes on your posted question? Go ahead and follow the quick steps in this video to learn how to Request Attention to your question. *Log into your Experts Exchange account *Find the question you want to Request Attention for *Go to the e…
    Here's a very brief overview of the methods PRTG Network Monitor ( offers for monitoring bandwidth, to help you decide which methods you´d like to investigate in more detail.  The methods are covered in more detail in o…

    779 members asked questions and received personalized solutions in the past 7 days.

    Join the community of 500,000 technology professionals and ask your questions.

    Join & Ask a Question

    Need Help in Real-Time?

    Connect with top rated Experts

    15 Experts available now in Live!

    Get 1:1 Help Now