• Status: Solved
  • Priority: Medium
  • Security: Public
  • Views: 760
  • Last Modified:

Potential Thermal Shutdown on HP Proliant DL380 G4

Last week we received an alert that the server would be shutting down due to a thermal condition.  After looking at the logs, event ID 1135 warning showing System Information Agent: Health: A Temperature Sensor Condition has been set to degraded.  The system may or may not shutdown depending on the state of the thermal degraded action value.  This indicated that the heat problem was with one of the CPUs.  Subsequent events said the server would shut down, but the next event stated that the sensor had cooled below the threshold and the server would not shutdown,  We did not experience a shutdown and the server has been operational.  Upon inspecting the server, the top LED on the front panel is Red and the overtemperature LED on the motherboard is amber.

I am looking for recommendations on what action we should take to prevent a potential future thermal/shutdown issue or should we consider this a false alert.  The server is running SBS 2008 and is approx. 9 years old.
0
Jeff Edelstein
Asked:
Jeff Edelstein
1 Solution
 
epichero22Commented:
I would leave the HWMonitor software running on the server, periodically check what the temperature is during certain periods of the day, and pay attention to what the max temps it's reporting are.  You can download it for free from here:

http://www.cpuid.com/softwares/hwmonitor.html

If the temps are consistently high, then you might consider the following:

*Dusting the computer.
*Removing the heatsink and re-applying thermal paste.
*Purchase an after-market CPU fan.
*Purchase additional case fans.
*Add an AC unit to where the server is located.

You don't have to do all of these, but try them in order and they should do the trick.
0
 
rindiCommented:
If a server that old hasn't seen proper maintenance in it's many years of service it would certainly have accumulated tons of dust. Dust effectively blocks the airflow and so heat accumulates. So first open the server and clean out all the dust (like has been mentioned already). Also check all the fans that they are running smoothly. You should really do such elementary service on servers regularly, at least twice an year, or if you have a very clean environment, once an year. Besides, think of replacing the server, they don't last forever.
0
 
diperspCommented:
I'm with rindi - 9 years is asking for trouble.  Schedule replacement ASAP.  No sense spending time and money on a server that old.
0
Question has a verified solution.

Are you are experiencing a similar issue? Get a personalized answer when you ask a related question.

Have a better answer? Share it in a comment.

Join & Write a Comment

Featured Post

Network Scalability - Handle Complex Environments

Monitor your entire network from a single platform. Free 30 Day Trial Now!

Tackle projects and never again get stuck behind a technical roadblock.
Join Now