Solved

HP ProLiant ML350 G3 - SBS2000 freezing - Blue Screen Trap

Posted on 2007-04-02
17
3,887 Views
Last Modified: 2013-12-01
Hi All,

We have a server that locks ups and then reboots at random times regardless of time and load, it was every 6 to 7 days but then it when again after 2 days the most recent. (7 times now)

It is a HP ProLiant ML350 G3 with 2000 SBS SP4, running the HP System Managemet v2.1.6.156 we have also applied the latest firmware and HP drivers "After" the lockups and made no difference.

Nothing is reported in the Windows event logs,

We also turned off Windows auto re-boot on crash, unfortunately HP have a function built into the ROM and their software that will reset the system after 10 minutes as I wanted to leave it on a blue screen for error codes and I can’t disable it.

The only error I get is below (ignore the ASR, that is the rom rebooting the server after 10mins)

ASR Detected by System ROM

Blue Screen Trap (BugCheck, STOP: 0x000000D1 (0xDEADFB06, 0x00000002, 0x00000000, 0xF234B51E))

The blue screen trap is always the same message and Google brings back a blank.

The only software installed prior (I believe) was powerchute 7.0.5 then the first reboot was 2 days later, I will take this off just incase while I try and find the solution.

Any more questions please ask,
0
Comment
Question by:nmxsupport
  • 6
  • 4
  • 2
  • +2
17 Comments
 

Expert Comment

by:Philip
Comment Utility
Hmmm.......
And the event viewer doesn't provide any more information?
You also mention powerchute which implies a UPS. You don't by chance have a printer plugged into the UPS. Powerchute logs?
Can I ask are you using genuine ram. I only use HP for my servers and only with genuine ram and am yet to have one ever fail. If I had some spare ram available I would consider at least a temporary swap and test.
Good Luck!
Philip

PS. Cant hurt to run error checking on the hard drives too
0
 
LVL 21

Expert Comment

by:dan_blagut
Comment Utility
Hi
This can be a hardware thing (overheating processor, faulty disk system)... I think of that because you said that now the BSoD apear often...

Dan
0
 
LVL 2

Accepted Solution

by:
abissa earned 500 total points
Comment Utility
Do you have a USB device connected ? If yes, have a look at this one:

http://support.microsoft.com/kb/888825

Hope this helps...
0
 

Author Comment

by:nmxsupport
Comment Utility
Hi All,

Iinteresting regarding USB and followed the link, we were also getting "Communication Lost on Agent" from the UPS but at again different times to the reboot, I was hoping to send a serial cable across but they use the only comm port for the fax.

They have 2 APC UPS's one for the 2000 SBS (Rebooting) and the other 2000 SQL which is fine, nothing else is attached and no other USB devices are in use,

Again following the link above I'll try and read the mem dump if it's there.
0
 

Author Comment

by:nmxsupport
Comment Utility
Hi All,

This is the results that link to abissa's link,

*** ERROR: Module load completed but symbols could not be loaded for adpu160m.sys
*******************************************************************************
*                                                                             *
*                        Bugcheck Analysis                                    *
*                                                                             *
*******************************************************************************
 
Use !analyze -v to get detailed debugging information.
 
BugCheck D1, {deadfb06, 2, 0, f234b51e}
 
***** Kernel symbols are WRONG. Please fix symbols to do analysis.
 
***** Kernel symbols are WRONG. Please fix symbols to do analysis.
 
*** ERROR: Module load completed but symbols could not be loaded for openhci.sys
Probably caused by : openhci.sys ( openhci+351e )
 
Followup: MachineOwner
---------

Alot on the net regarding APC and this "openhci.sys " file hopefully getting there,
0
 
LVL 21

Expert Comment

by:dan_blagut
Comment Utility
0
 
LVL 11

Expert Comment

by:Zenith63
Comment Utility
I have a 2003 SBS server on a HP DL380G5 plugged into a 3 year old APC 750 UPS via USB cable doing a very similar thing.  It is just restarting randomly.  Going out tomorrow to pull the UPS out to see if it's the problem, pretty much ruled everything else out by replacing the entire server!

You can turn off ASR - if you go into the HP Management site, then into Tasks the option is there to disable it.  You should get more info from the BSoD then.
0
What Should I Do With This Threat Intelligence?

Are you wondering if you actually need threat intelligence? The answer is yes. We explain the basics for creating useful threat intelligence.

 

Author Comment

by:nmxsupport
Comment Utility
Hi All, just an update after removing the APC software and unplugging the usb - its been up for 8 days, would like to leave it for another 7 days to ensure this was the cause.
0
 

Author Comment

by:nmxsupport
Comment Utility
It now been running for 16 days with no blue screen, so I'm happy the APC software is the cause or the MS USB driver - I'll try and contact APC is see if they have a fix, unless anyone else has a cure? I can't use a serial cable because the only com port is being used by the fax.
0
 
LVL 11

Expert Comment

by:Zenith63
Comment Utility
We took the server out of the UPS and disconnected the USB as well about 3 weeks back, like you we haven't had a problem since!  Let me know what you get out of APC, I don't think it's too impressive that their "monitoring" software isn't capable of realising it's dropping power to connected devices!  Or maybe the USB connection is the issue...
0
 

Author Comment

by:nmxsupport
Comment Utility
From what I can gather its only the USB connection thats effected, if you use serial you're fine, we can't because of the modem - someone said that it's a special serial cable, worth a shot

I'll keep the thread updated,
0
 
LVL 11

Expert Comment

by:Zenith63
Comment Utility
I might put the UPS back in and leave the signal cable out (I have a modem on the server as well) and see what happens.
0
 
LVL 11

Expert Comment

by:Zenith63
Comment Utility
Just for future readers, the problems with the server I was having issues with were firstly a Microsoft storport driver not being properly compatible with one of HP's drivers.  You should download the latest version from the MS site, even servers with Windows 2003 Service Pack 2 require the update.  We have had 2 completely different model HP G5 servers restarting randomly like this, in both cases the Storport driver sorted it out straight away.  The other problem with my server in question was a dodgy CPU which has now gone back to HP.
0
 

Author Comment

by:nmxsupport
Comment Utility
Hi All, sorry for not updating this, I'll look at Zenith63's solution, as speaking at length to APC who were really good, it links to this driver and we decided to swap our modem to usb and use a serial cable just because it was the easy option in the end.
0

Featured Post

Top 6 Sources for Identifying Threat Actor TTPs

Understanding your enemy is essential. These six sources will help you identify the most popular threat actor tactics, techniques, and procedures (TTPs).

Join & Write a Comment

You may have discovered the 'Compatibility View Settings' workaround for making your SBS 2008 Remote Web Workplace 'connect to a computer' section stops 'working around' after a Windows 10 client upgrade.  That can be fixed so it 'works around' agai…
Today, still in the boom of Apple, PC's and products, nearly 50% of the computer users use Windows as graphical operating systems. If you are among those users who love windows, but are grappling to keep the system's hard drive optimized, then you s…
In this seventh video of the Xpdf series, we discuss and demonstrate the PDFfonts utility, which lists all the fonts used in a PDF file. It does this via a command line interface, making it suitable for use in programs, scripts, batch files — any pl…
This tutorial demonstrates a quick way of adding group price to multiple Magento products.

763 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question

Need Help in Real-Time?

Connect with top rated Experts

11 Experts available now in Live!

Get 1:1 Help Now