Solved

rehat server crash question

Posted on 2010-08-25
6
615 Views
Last Modified: 2013-11-25
Our server has been crashed and here is the log message from the message file.

kernel: pciehp: acpi_pciehprm:\_SB_.PCI0.EXB0 _HPP fail=0x5


Do  you guys know what is causing it?
0
Comment
Question by:mokkan
[X]
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
6 Comments
 

Author Comment

by:mokkan
ID: 33525355
Also, server hung and we have rebooted it.
0
 
LVL 9

Accepted Solution

by:
jeremycrussell earned 167 total points
ID: 33525487
That's referring to PCI hot plugging, was there possibly some type of device being inserted/removed?

I guess its possible that its also related to ACPI functions of some sort.

If this is happening often, you may be able to disable/tweak ACPI or PCIe in your bios as a workaround.

0
 
LVL 4

Assisted Solution

by:abodette
abodette earned 167 total points
ID: 33525490
Pretty sure you have a bad PCI Express card.

pcihp is the PCI Express HotPlug

and it's giving you a failure code, I'm not sure for what device, but you can't have too many PCI express cards and there are likely related symptoms.

Did you happen to patch the OS recently? there are a few known issues with hotplug that may be remedied by patching to a newer kernel version.

Also look around that error in the message file to see if anything is coming up along with it.
0
What is SQL Server and how does it work?

The purpose of this paper is to provide you background on SQL Server. It’s your self-study guide for learning fundamentals. It includes both the history of SQL and its technical basics. Concepts and definitions will form the solid foundation of your future DBA expertise.

 

Author Comment

by:mokkan
ID: 33525736
Thank you  for the help. Here is message I got it from the message file.

Aug 25 10:16:45 nepeon kernel: pciehp: acpi_pciehprm:\_SB_.PCI0.EXB4 _HPP fail=0x5
Aug 25 10:16:45 nepeon kernel: pciehp: acpi_pciehprm:\_SB_.PCI0.EXB4 OSHP fails=0x5
Aug 25 10:16:45 nepeon kernel: pciehp: acpi_pciehprm:\_SB_.PCI0.EXB4 _HPP fail=0x5
Aug 25 10:16:45 nepeon kernel: pciehp: acpi_pciehprm:\_SB_.PCI0.EXB4 OSHP fails=0x5
Aug 25 10:16:45 nepeon kernel: pciehp: acpi_pciehprm:\_SB_.PCI0.EXB4 _HPP fail=0x5
Aug 25 10:16:45 nepeon kernel: pciehp: acpi_pciehprm:\_SB_.PCI0.EXB4 OSHP fails=0x5
Aug 25 10:16:45 nepeon kernel: pciehp: acpi_pciehprm:\_SB_.PCI0.EXB4 _HPP fail=0x5
Aug 25 10:16:45 nepeon kernel: pciehp: acpi_pciehprm:\_SB_.PCI0.EXB4 OSHP fails=0x5
Aug 25 10:16:45 nepeon kernel: pciehp: acpi_pciehprm:\_SB_.PCI0.EXB4 _HPP fail=0x5
Aug 25 10:16:45 nepeon kernel: pciehp: acpi_pciehprm:\_SB_.PCI0.EXB4 OSHP fails=0x5
Aug 25 10:16:45 nepeon kernel: pciehp: acpi_pciehprm:\_SB_.PCI0.EXB4 _HPP fail=0x5


lspci   |  grep   -i   express
00:13.0 PCI bridge: Broadcom HT2100 PCI-Express Bridge (rev a2)
00:14.0 PCI bridge: Broadcom HT2100 PCI-Express Bridge (rev a2)
00:15.0 PCI bridge: Broadcom HT2100 PCI-Express Bridge (rev a2)
00:16.0 PCI bridge: Broadcom HT2100 PCI-Express Bridge (rev a2)
00:17.0 PCI bridge: Broadcom HT2100 PCI-Express Bridge (rev a2)
40:13.0 PCI bridge: Broadcom HT2100 PCI-Express Bridge (rev a2)
40:14.0 PCI bridge: Broadcom HT2100 PCI-Express Bridge (rev a2)
40:15.0 PCI bridge: Broadcom HT2100 PCI-Express Bridge (rev a2)
40:16.0 PCI bridge: Broadcom HT2100 PCI-Express Bridge (rev a2)
40:17.0 PCI bridge: Broadcom HT2100 PCI-Express Bridge (rev a2)
c0:13.0 PCI bridge: Broadcom HT2100 PCI-Express Bridge (rev a2)
c0:14.0 PCI bridge: Broadcom HT2100 PCI-Express Bridge (rev a2)
c0:15.0 PCI bridge: Broadcom HT2100 PCI-Express Bridge (rev a2)
c0:16.0 PCI bridge: Broadcom HT2100 PCI-Express Bridge (rev a2)
c0:17.0 PCI bridge: Broadcom HT2100 PCI-Express Bridge (rev a2)



Any other files that I can check? how do I find our which pci express card is causing the issue?   Thanks in advance.
0
 

Author Comment

by:mokkan
ID: 33526841
Any help?
0
 
LVL 1

Assisted Solution

by:onaas
onaas earned 166 total points
ID: 33529486
Hi mokkan,

It's nice to know if you post more details like:

what rhel version do u have?
is it fresh installed server?
did you add any new hardware to the server?
what's the specs of your server?

more info would help us helping you!

-10x
0

Featured Post

Free Tool: Subnet Calculator

The subnet calculator helps you design networks by taking an IP address and network mask and returning information such as network, broadcast address, and host range.

One of a set of tools we're offering as a way of saying thank you for being a part of the community.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

I am a long time windows user and for me it is normal to have spaces in directory and file names. Changing to Linux I found myself frustrated when I moved my windows data over to my new Linux computer. The problem occurs when at the command line.…
Little introduction about CP: CP is a command on linux that use to copy files and folder from one location to another location. Example usage of CP as follow: cp /myfoder /pathto/destination/folder/ cp abc.tar.gz /pathto/destination/folder/ab…
Learn how to find files with the shell using the find and locate commands. Use locate to find a needle in a haystack.: With locate, check if the file still exists.: Use find to get the actual location of the file.:
Get a first impression of how PRTG looks and learn how it works.   This video is a short introduction to PRTG, as an initial overview or as a quick start for new PRTG users.

740 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question