Potential Thermal Shutdown on HP Proliant DL380 G4

Last week we received an alert that the server would be shutting down due to a thermal condition.  After looking at the logs, event ID 1135 warning showing System Information Agent: Health: A Temperature Sensor Condition has been set to degraded.  The system may or may not shutdown depending on the state of the thermal degraded action value.  This indicated that the heat problem was with one of the CPUs.  Subsequent events said the server would shut down, but the next event stated that the sensor had cooled below the threshold and the server would not shutdown,  We did not experience a shutdown and the server has been operational.  Upon inspecting the server, the top LED on the front panel is Red and the overtemperature LED on the motherboard is amber.

I am looking for recommendations on what action we should take to prevent a potential future thermal/shutdown issue or should we consider this a false alert.  The server is running SBS 2008 and is approx. 9 years old.
Jeff EdelsteinAsked:
Who is Participating?

[Product update] Infrastructure Analysis Tool is now available with Business Accounts.Learn More

x
I wear a lot of hats...

"The solutions and answers provided on Experts Exchange have been extremely helpful to me over the last few years. I wear a lot of hats - Developer, Database Administrator, Help Desk, etc., so I know a lot of things but not a lot about one thing. Experts Exchange gives me answers from people who do know a lot about one thing, in a easy to use platform." -Todd S.

epichero22Commented:
I would leave the HWMonitor software running on the server, periodically check what the temperature is during certain periods of the day, and pay attention to what the max temps it's reporting are.  You can download it for free from here:

http://www.cpuid.com/softwares/hwmonitor.html

If the temps are consistently high, then you might consider the following:

*Dusting the computer.
*Removing the heatsink and re-applying thermal paste.
*Purchase an after-market CPU fan.
*Purchase additional case fans.
*Add an AC unit to where the server is located.

You don't have to do all of these, but try them in order and they should do the trick.
0

Experts Exchange Solution brought to you by

Your issues matter to us.

Facing a tech roadblock? Get the help and guidance you need from experienced professionals who care. Ask your question anytime, anywhere, with no hassle.

Start your 7-day free trial
rindiCommented:
If a server that old hasn't seen proper maintenance in it's many years of service it would certainly have accumulated tons of dust. Dust effectively blocks the airflow and so heat accumulates. So first open the server and clean out all the dust (like has been mentioned already). Also check all the fans that they are running smoothly. You should really do such elementary service on servers regularly, at least twice an year, or if you have a very clean environment, once an year. Besides, think of replacing the server, they don't last forever.
0
diperspCommented:
I'm with rindi - 9 years is asking for trouble.  Schedule replacement ASAP.  No sense spending time and money on a server that old.
0
It's more than this solution.Get answers and train to solve all your tech problems - anytime, anywhere.Try it for free Edge Out The Competitionfor your dream job with proven skills and certifications.Get started today Stand Outas the employee with proven skills.Start learning today for free Move Your Career Forwardwith certification training in the latest technologies.Start your trial today
Server Hardware

From novice to tech pro — start learning today.