Random Restart

Hi Experts,

I logged into my server this morning only to find it recovered from an unexpected shutdown. We are running an SBS 2008 system, and all the usual services - WSUS, File, Mail, Print, DHCP, DNS, AV, IIS & Backup.

I have looked through the event logs and all I can find is the below. There was no power failures and no other events that raise serious alarm bells.

Log Name:      System
Source:        Microsoft-Windows-WHEA-Logger
Date:          4/09/2013 3:40:03 AM
Event ID:      1
Task Category: None
Level:         Error
Keywords:      WHEA Error Event Logs
User:          LOCAL SERVICE
Computer:      SBS2008.domain.local
Description:
An uncorrected hardware error occurred. A record describing the condition is contained in the data section of this event.
Event Xml:
<Event xmlns="http://schemas.microsoft.com/win/2004/08/events/event">
  <System>
    <Provider Name="Microsoft-Windows-WHEA-Logger" Guid="{c26c4f3c-3f66-4e99-8f8a-39405cfed220}" />
    <EventID>1</EventID>
    <Version>0</Version>
    <Level>2</Level>
    <Task>0</Task>
    <Opcode>0</Opcode>
    <Keywords>0x8000000000000002</Keywords>
    <TimeCreated SystemTime="2013-09-03T17:40:03.048Z" />
    <EventRecordID>3172895</EventRecordID>
    <Correlation ActivityID="{05F7EF24-FE0B-4B0E-8287-FE7ABD0E4B75}" />
    <Execution ProcessID="1476" ThreadID="2640" />
    <Channel>System</Channel>
    <Computer>SBS2008.domain.local</Computer>
    <Security UserID="S-1-5-19" />
  </System>
  <EventData>
    <Data Name="Length">224</Data>
    <Data Name="RawData">435045520102FFFFFFFF01000100000002000000E00000000E27110003090D140000000000000000000000000000000000000000000000000000000000000000BDC407CF89B7184EB3C41F732CB57131FE6FF5E89C91C54CBA8865ABE14913BB5548AC76CCA8CE01020000000000000000000000000000000000000000000000C8000000180000000102000001000000E85412E7B9C14049AB76909703A4320F0000000000000000000000000000000001000000000000000000000000000000000000000000000089C20201327E2020000423036FC104FF0000000000000000</Data>
  </EventData>
</Event>

There is no memory.dmp's or minidumps.dmp.

Look forward to reading your expert responses!
isdd2000Asked:
Who is Participating?
I wear a lot of hats...

"The solutions and answers provided on Experts Exchange have been extremely helpful to me over the last few years. I wear a lot of hats - Developer, Database Administrator, Help Desk, etc., so I know a lot of things but not a lot about one thing. Experts Exchange gives me answers from people who do know a lot about one thing, in a easy to use platform." -Todd S.

N-WCommented:
What hardware are you running here? HP, Dell, IBM, whitebox, etc.?

Depending on what the hardware is, you may be able to run diagnostics to confirm all of your hardware is functioning correctly (HDDs, RAM, CPUs, everything).
0
isdd2000Author Commented:
Hi N-W,

Its an IBM server!
0
Sanjay SantokiCommented:
Hello,

I would suggest you to through below suggestions and I am sure you will find clue;


Update the BIOS and the drivers for the motherboard chipset.  
Update all the hardware drivers, if updates are available from your manufacturer.
Check the temperature inside the computer to make sure your processor and related peripherals are not overheating.
Check the fan on your CPU to make sure it is properly attached to the CPU.
If you have overclocked your CPU,  reset your settings to the default settings.    
Make sure you power supply fan is working correctly

Regards,
Sanjay Santoki
0

Experts Exchange Solution brought to you by

Your issues matter to us.

Facing a tech roadblock? Get the help and guidance you need from experienced professionals who care. Ask your question anytime, anywhere, with no hassle.

Start your 7-day free trial
Newly released Acronis True Image 2019

In announcing the release of the 15th Anniversary Edition of Acronis True Image 2019, the company revealed that its artificial intelligence-based anti-ransomware technology – stopped more than 200,000 ransomware attacks on 150,000 customers last year.

Mohammed KhawajaManager - Infrastructure:  Information TechnologyCommented:
IBM servers also have built-in diagnostics which you can access on the server physically.  Run the diagnostics as I have a feeling this issue might be CPU or memory related.
0
isdd2000Author Commented:
Hi Guys,

I will go with you recommendation on performing hardware updates.

mnkhawaja,

How do I access these diagnostics?
0
David AtkinTechnical DirectorCommented:
Hello,

You can download the diagnostic tools from the IBM website.  

http://www-947.ibm.com/support/entry/portal/docdisplay?lndocid=tool-center

Type in your serial number if you know it.
(You can find it by typing the following into cmd:     wmic bios get serialnumber)

Have you made any hardware changes recently?  Any new USB Drives etc?
0
isdd2000Author Commented:
Hi Scorpeo,

No hardware changes at all!
0
David AtkinTechnical DirectorCommented:
Have you found and installed the diagnostic tools?

Do you have warranty on the server?
0
isdd2000Author Commented:
Hi Scorpeo,

I could not find the right diagnostic tool, the server is about 6 months out of warranty.
0
David AtkinTechnical DirectorCommented:
What is the model number of your server?  I will see if I can find it for you.

Any more restarts since then?
0
Mohammed KhawajaManager - Infrastructure:  Information TechnologyCommented:
IBM servers have a little panel (usually it is above the CD-ROM and in some models, you have to remove the cover).  The panel has LEDs and you press the test button.  Under the cover, there is instructions on how to interpret the results.
0
isdd2000Author Commented:
Hi Scorpeo,
It is an IBM system x3650, strangely has not happened again..

Mnkhawaia,

Does the server have to be off for this?
0
Mohammed KhawajaManager - Infrastructure:  Information TechnologyCommented:
I don't believe so but you should refer to IBM documentations.  I have run it only when the server was off.
0
It's more than this solution.Get answers and train to solve all your tech problems - anytime, anywhere.Try it for free Edge Out The Competitionfor your dream job with proven skills and certifications.Get started today Stand Outas the employee with proven skills.Start learning today for free Move Your Career Forwardwith certification training in the latest technologies.Start your trial today
IT Administration

From novice to tech pro — start learning today.