Solved

HP Proliant DL580 G2 with Windows Server 2003 Enterprise Edition SP2 reboots periodically

Posted on 2008-10-01
6
1,103 Views
Last Modified: 2010-04-21
I have a Proliant DL580 G2 that keeps going into ASR shutdowns.  I've dealt with Microsoft Premier Support and they state it's not their issue as I can't get the server to generate a memory dump when it locks up.  As per their line of reasoning, the issue is hardware-related.  The SmartStart diagnostics reveal no issues on any hardware in the server, and I ran the full test.  All drivers/firmware are current as well.  In the process of attempting to fix this issue, I've replaced everything in the server save for the SCSI backplane, array controller and the power supplies.  The server's temperature is stable and not elevated at this point, nor is it anywhere close to being elevated.  Additionally, the WMI Performance Adapter service, which is stopped and the startup type is set to manual, keeps stopping and restarting.  The server in question is a DR box; the production server has identical hardware to the problem box but doesn't exhibit the same problems.  Both servers have VMWare GSX installed on them and run two VMWare guest server sessions apiece.  I'm also looking into what could be causing the WMI Performance Adapter service to keep stopping and starting, but the production (working) server has the same symptoms but it doesn't lock up.  Last, I've updated the symevent.sys file as the server has Symantec Antivirus installed.  If anyone could offer some insight as to what I could try next, I'd appreciate it.
0
Comment
Question by:jgerstner74
[X]
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
6 Comments
 
LVL 7

Accepted Solution

by:
BogdanSUA earned 84 total points
ID: 22619293
Before you try anything, verify that the BIOS and firmware levels on all your components are good.

During a scheduled maintenance window, shutdown both boxes.

Remove the disks from each box (but label them and notate their position).
Take the disk from your DR box and pop it into your production box.  
See if it reboots, crashes, etc.
If so, you have a software related problem.
If not, it could be a hardware related problem.

If the above didn't (and it shouldn't) cause any problems when booting up, pop in the drives from your production server into your DR box.

Does the problem reoccur?  
If so, you have a hardware problem.  Maybe a bad power supply....do you have dual power supplies?  Maybe its the juice coming from the wall.....Is at least one of them plugged into a UPS?
0
 

Author Comment

by:jgerstner74
ID: 22619373
The servers are approximately 300 miles apart from each other in different data centers, so I don't think I can try swapping the hard drives.  I apologize for not mentioning this previously.  The BIOS as well as all drivers and firmware are current on the problem server.  I will look into replacing the power supplies as your suggestion is in line with suggestions I've found elsewhere as well.  Thank you for the suggestion.
0
 
LVL 7

Expert Comment

by:BogdanSUA
ID: 22619687
No problem, but you still have some other options while you wait for the power supplies.  

Download the memtest.iso, burn it to a CD (or use ILO if you've got it) and let it go to town on your hardware.  If it reboots/crashes, then you've got a HW problem.

BTW, do you have a UPS connected to the server?  It could be that the power supply(s) are fine, but the electricity dips down for a split second to unacceptable voltage levels.  Do you have any other equipment in the room connected to the same source (power strip/PDU) that might tell you if it rebooted.

LOL - Some humor - Dust off that VCR, set it's time, plug it in, and see if it starts blinking 12:00.  :)
0
Simplifying Server Workload Migrations

This use case outlines the migration challenges that organizations face and how the Acronis AnyData Engine supports physical-to-physical (P2P), physical-to-virtual (P2V), virtual to physical (V2P), and cross-virtual (V2V) migration scenarios to address these challenges.

 
LVL 3

Assisted Solution

by:omic_admin
omic_admin earned 83 total points
ID: 22619766
check the UPS and make sure that it is also conditioning the lines. I've seen issues with equipment rebooting or shutting down periodically when the power is not clean.
0
 
LVL 8

Assisted Solution

by:markzz
markzz earned 83 total points
ID: 22645279
Are you sure this is an ASR. As in HP ASR??
If so it will log info into the hardware logs indicating what has happened.
You can view this via https://hostname:2381/
You can also set your ASR timeout in BIOS to increase or decrease the time out limit. A true ASR will occur if the hardware doesn't revieve a poll from the software driver.
The exception to this is if you have a CPU issue with CPU1 (or 0 depending on how you count them) but the first CPU as this CPU is used by the system for monitoring etc..
0
 

Author Closing Comment

by:jgerstner74
ID: 31502143
After checking with the asset group at my organization, we've decided t replace the server with a new one.  Nonetheless, thank you all for your input.  I've tried to split the points as evenly as I can, but as 250/3=83.33 and I can't award points in decimal points, I gave the extra point to BogdanUSA.  
0

Featured Post

Is Your DevOps Pipeline Leaking?

Is your CI/CD pipeline a hodge-podge of randomly connected tools? You’ve likely got a tool to fix one problem & then a different tool to fix another, resulting in a cluster of tools with overlapping functionality. Learn how to optimize your pipeline with Gartner's recommendations

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

In this step by step tutorial with screenshots, we will show you HOW TO: Enable SSH Remote Access on a VMware vSphere Hypervisor 6.5 (ESXi 6.5). This is important if you need to enable SSH remote access for additional troubleshooting of the ESXi hos…
This article provides a convenient collection of links to Microsoft provided Security Patches for operating systems that have reached their End of Life support cycle. Included operating systems covered by this article are Windows XP,  Windows Server…
Teach the user how to configure vSphere Replication and how to protect and recover VMs Open vSphere Web Client: Verify vsphere Replication is enabled: Enable vSphere Replication for a virtual machine: Verify replicated VM is created: Recover replica…
This Micro Tutorial steps you through the configuration steps to configure your ESXi host Management Network settings and test the management network, ensure the host is recognized by the DNS Server, configure a new password, and the troubleshooting…

724 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question