Solved

rehat server crash question

Posted on 2010-08-25
6
608 Views
Last Modified: 2013-11-25
Our server has been crashed and here is the log message from the message file.

kernel: pciehp: acpi_pciehprm:\_SB_.PCI0.EXB0 _HPP fail=0x5


Do  you guys know what is causing it?
0
Comment
Question by:mokkan
6 Comments
 

Author Comment

by:mokkan
Comment Utility
Also, server hung and we have rebooted it.
0
 
LVL 9

Accepted Solution

by:
jeremycrussell earned 167 total points
Comment Utility
That's referring to PCI hot plugging, was there possibly some type of device being inserted/removed?

I guess its possible that its also related to ACPI functions of some sort.

If this is happening often, you may be able to disable/tweak ACPI or PCIe in your bios as a workaround.

0
 
LVL 4

Assisted Solution

by:abodette
abodette earned 167 total points
Comment Utility
Pretty sure you have a bad PCI Express card.

pcihp is the PCI Express HotPlug

and it's giving you a failure code, I'm not sure for what device, but you can't have too many PCI express cards and there are likely related symptoms.

Did you happen to patch the OS recently? there are a few known issues with hotplug that may be remedied by patching to a newer kernel version.

Also look around that error in the message file to see if anything is coming up along with it.
0
Maximize Your Threat Intelligence Reporting

Reporting is one of the most important and least talked about aspects of a world-class threat intelligence program. Here’s how to do it right.

 

Author Comment

by:mokkan
Comment Utility
Thank you  for the help. Here is message I got it from the message file.

Aug 25 10:16:45 nepeon kernel: pciehp: acpi_pciehprm:\_SB_.PCI0.EXB4 _HPP fail=0x5
Aug 25 10:16:45 nepeon kernel: pciehp: acpi_pciehprm:\_SB_.PCI0.EXB4 OSHP fails=0x5
Aug 25 10:16:45 nepeon kernel: pciehp: acpi_pciehprm:\_SB_.PCI0.EXB4 _HPP fail=0x5
Aug 25 10:16:45 nepeon kernel: pciehp: acpi_pciehprm:\_SB_.PCI0.EXB4 OSHP fails=0x5
Aug 25 10:16:45 nepeon kernel: pciehp: acpi_pciehprm:\_SB_.PCI0.EXB4 _HPP fail=0x5
Aug 25 10:16:45 nepeon kernel: pciehp: acpi_pciehprm:\_SB_.PCI0.EXB4 OSHP fails=0x5
Aug 25 10:16:45 nepeon kernel: pciehp: acpi_pciehprm:\_SB_.PCI0.EXB4 _HPP fail=0x5
Aug 25 10:16:45 nepeon kernel: pciehp: acpi_pciehprm:\_SB_.PCI0.EXB4 OSHP fails=0x5
Aug 25 10:16:45 nepeon kernel: pciehp: acpi_pciehprm:\_SB_.PCI0.EXB4 _HPP fail=0x5
Aug 25 10:16:45 nepeon kernel: pciehp: acpi_pciehprm:\_SB_.PCI0.EXB4 OSHP fails=0x5
Aug 25 10:16:45 nepeon kernel: pciehp: acpi_pciehprm:\_SB_.PCI0.EXB4 _HPP fail=0x5


lspci   |  grep   -i   express
00:13.0 PCI bridge: Broadcom HT2100 PCI-Express Bridge (rev a2)
00:14.0 PCI bridge: Broadcom HT2100 PCI-Express Bridge (rev a2)
00:15.0 PCI bridge: Broadcom HT2100 PCI-Express Bridge (rev a2)
00:16.0 PCI bridge: Broadcom HT2100 PCI-Express Bridge (rev a2)
00:17.0 PCI bridge: Broadcom HT2100 PCI-Express Bridge (rev a2)
40:13.0 PCI bridge: Broadcom HT2100 PCI-Express Bridge (rev a2)
40:14.0 PCI bridge: Broadcom HT2100 PCI-Express Bridge (rev a2)
40:15.0 PCI bridge: Broadcom HT2100 PCI-Express Bridge (rev a2)
40:16.0 PCI bridge: Broadcom HT2100 PCI-Express Bridge (rev a2)
40:17.0 PCI bridge: Broadcom HT2100 PCI-Express Bridge (rev a2)
c0:13.0 PCI bridge: Broadcom HT2100 PCI-Express Bridge (rev a2)
c0:14.0 PCI bridge: Broadcom HT2100 PCI-Express Bridge (rev a2)
c0:15.0 PCI bridge: Broadcom HT2100 PCI-Express Bridge (rev a2)
c0:16.0 PCI bridge: Broadcom HT2100 PCI-Express Bridge (rev a2)
c0:17.0 PCI bridge: Broadcom HT2100 PCI-Express Bridge (rev a2)



Any other files that I can check? how do I find our which pci express card is causing the issue?   Thanks in advance.
0
 

Author Comment

by:mokkan
Comment Utility
Any help?
0
 
LVL 1

Assisted Solution

by:onaas
onaas earned 166 total points
Comment Utility
Hi mokkan,

It's nice to know if you post more details like:

what rhel version do u have?
is it fresh installed server?
did you add any new hardware to the server?
what's the specs of your server?

more info would help us helping you!

-10x
0

Featured Post

What Security Threats Are You Missing?

Enhance your security with threat intelligence from the web. Get trending threat insights on hackers, exploits, and suspicious IP addresses delivered to your inbox with our free Cyber Daily.

Join & Write a Comment

Introduction We as admins face situation where we need to redirect websites to another. This may be required as a part of an upgrade keeping the old URL but website should be served from new URL. This document would brief you on different ways ca…
It’s 2016. Password authentication should be dead — or at least close to dying. But, unfortunately, it has not traversed Quagga stage yet. Using password authentication is like laundering hotel guest linens with a washboard — it’s Passé.
Learn how to find files with the shell using the find and locate commands. Use locate to find a needle in a haystack.: With locate, check if the file still exists.: Use find to get the actual location of the file.:
Connecting to an Amazon Linux EC2 Instance from Windows Using PuTTY.

771 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question

Need Help in Real-Time?

Connect with top rated Experts

10 Experts available now in Live!

Get 1:1 Help Now