Windows 2003 Domain Controller restarting multiple times a day unexpectedly

I have two Windows 2003 Domain Controllers here, and one of them is restarting unexpectedly about 2-3 times a day.  I can't find anything in the logs that points to the cause, and virus scans cone up totally clean.  The only reason I even found out is because I was browsing the logs on my Exchange Server and kept seeing a cluster of errors saying that <mydomaincontroller> was shut down or not available for the reboot period.  Where do I start troubleshooting THIS one!?
tenoverAsked:
Who is Participating?
I wear a lot of hats...

"The solutions and answers provided on Experts Exchange have been extremely helpful to me over the last few years. I wear a lot of hats - Developer, Database Administrator, Help Desk, etc., so I know a lot of things but not a lot about one thing. Experts Exchange gives me answers from people who do know a lot about one thing, in a easy to use platform." -Todd S.

adiloadiloCommented:
can you post you memory dumb file ?
0
tenoverAuthor Commented:
Where would I find the file?
0
Has Powershell sent you back into the Stone Age?

If managing Active Directory using Windows Powershell® is making you feel like you stepped back in time, you are not alone.  For nearly 20 years, AD admins around the world have used one tool for day-to-day AD management: Hyena. Discover why.

adiloadiloCommented:
0
adiloadiloCommented:
They are usually located in %systemroot%/minidump (%systemroot%/minidump).
0
tenoverAuthor Commented:
It is set to write a kernel dump to %systemroot%\MEMORY.DMP, however when looking in that directory or doing a search, for *.DMP on the hard drive, nothing is there.
0
adiloadiloCommented:
hmm then its is not crashing it is manually shutting down or restarting , it must be a software which causing  that . can you check you event viewr for reboot or system errors on both application and system logs and post any erros ?
0
tenoverAuthor Commented:
There's NOTHING running on this domain controller besides DNS services......And the reboots are pretty random.  Here's a screenshot of the log:



Picture-1.png
0
tenoverAuthor Commented:
Where it is highlighted is when I start seeing "Services started", etc....
0
simonlimonCommented:
Do you have an IPMI connection to it, is it an HP server or IBM. HP calls is integrated lights out can't remember how does IBM name it. It could be a hardware issue -Power supply unit? It would help if you could get the IPMI logs as well.

0
tenoverAuthor Commented:
Just noticed after checking the logs, there are quite a few 3011 and 3012 errors.  I see that I can rebuild the Performance Index logs by following the instructions in this article:
http://support.microsoft.com/kb/300956

1.) Is there a good chance that these errors :
Unloading the performance counter strings for service WmiApRpl (WmiApRpl) failed. The Error code is the first DWORD in Data section.
could be causing the reboots?

2.) Is it pretty safe to rebuild the performance counters on a domain controller (my PDC emulator)?
0
tenoverAuthor Commented:
Anyone?  
Even these errors show up AFTER the reboots, I'm thinking they must be related somehow.  Should I try to rebuild the performance counters?
0
Steve_RogersCommented:
I've had servers randomly reboot and when there is nothing useful in the logs, it is almost always a hardware problem.  I'd download the latest manufacturer's offline diagnostics and run them.  

I recommend the offline over the online diagnostics only because the last time this happened, HP's online found nothing, while the offline found it to be a memory/motherboard issue.

Good luck.
0
simonlimonCommented:
I would second that. What kind of hardware is it, is it HP or IBM server. Or Supermicro? No name?

What kind of a power supply does it have?  

Can you try replacing with a another one, just to exclude that possibility?

Did you recently add any hardware to it?
0
tenoverAuthor Commented:
We are a Dell shop.  It is a Poweredge 860 with a single power supply.  There is absolutely NOTHING in the logs BEFORE the reboots, just a gaggle of events afterwards that always starts with the "The system shutdown on ----- was unexpected", and then all the services starting, etc....I will try to track down the Dell offline diagnostics.
0
simonlimonCommented:
0
tenoverAuthor Commented:
Everything comes up clean.  I've been doing some research and found some other folks on the Dell forum that had the same issue with other PE models.  Most of them reloaded the OS and were good to go, however this REALLY makes me nervous, as the server in question is my MAIN DC (First in the organization) and holds some FSMO roles, DNS and DHCP.......How easy/hard would it be for me to "reload" the OS and what is involved?
0
tenoverAuthor Commented:
Dell replaced the motherboard yesterday late afternoon.  It then rebooted 3 times overnight.  Dell came out again this morning and replaced the CPU, Controller board and Power Supply.  So far so good.  Is there any way to have an alert sent via email if/when the machine reboots itself again?  
0
tenoverAuthor Commented:
I'm going to close this question.  After having Dell replace the CPU and Power Supply, the DC stayed up for over 72 hours without rebooting, which was unheard of for the past two weeks, so I'm attributing the reboots to one of those things.  Thanks for all the help.
0

Experts Exchange Solution brought to you by

Your issues matter to us.

Facing a tech roadblock? Get the help and guidance you need from experienced professionals who care. Ask your question anytime, anywhere, with no hassle.

Start your 7-day free trial
It's more than this solution.Get answers and train to solve all your tech problems - anytime, anywhere.Try it for free Edge Out The Competitionfor your dream job with proven skills and certifications.Get started today Stand Outas the employee with proven skills.Start learning today for free Move Your Career Forwardwith certification training in the latest technologies.Start your trial today
Windows Server 2003

From novice to tech pro — start learning today.