Link to home
Start Free TrialLog in
Avatar of snowdog_2112
snowdog_2112Flag for United States of America

asked on

URGENT! ibm x3650 m3 - vmware hardware alarm: Bus Uncorrectable error

Hardware: x3650 m3 (7945ac1)
ESXi: 5.1.0 799733 (ibm-specific build)

The host just rebooted and now shows hardware alarms (after the reboot):
Group 2 PCIs: Bus Uncorrectable error

(I don't have host logs prior to reboot because the tech who installed esxi 5.1 had not yet set the syslog to persistent storage...)

I can't find much info on the alert - is it a concern?  I've tried "Reset Sensors" and Refresh, and the alerts are still present.
Avatar of snowdog_2112
snowdog_2112
Flag of United States of America image

ASKER

more info - the IMM event log shows the following at the time of the reboot:

02/18/2013; 15:05:01	0x816f03131701ffff	System "SN# xxxx" has recovered from an NMI
02/18/2013; 15:03:46	0x806f002125820900	Fault in slot "All PCI Error" on system "SN# xxxx"
02/18/2013; 15:03:46	0x806f002130010901	Fault in slot "PCI 1" on system "SN# xxxx"
02/18/2013; 15:03:40	0x806f08132582ffff	A Uncorrectable Bus Error has occurred on system "SN# xxxx"
02/18/2013; 15:03:40	0x806f03131701ffff	A software NMI has occurred on system "SN# xxxx"

Open in new window

ASKER CERTIFIED SOLUTION
Avatar of Andrew Hancock (VMware vExpert PRO / EE Fellow/British Beekeeper)
Andrew Hancock (VMware vExpert PRO / EE Fellow/British Beekeeper)
Flag of United Kingdom of Great Britain and Northern Ireland image

Link to home
membership
This solution is only available to members.
To access this solution, you must be a member of Experts Exchange.
Start Free Trial
lightpath also indicates a PCI fault.

VLP from IMM -
Fault: orange
PCI: orange
PCI1: orange  <-- this must be the slot?
(pci2 - 4): off
Also consider the following: http://www-947.ibm.com/support/entry/portal/docdisplay?lndocid=MIGR-5084146

Basically a NIC reseat, as described in accepted solution, however parts are available from IBM to avoid a repeat.