ESX 5.0 host disconnected from vCenter 5.0, troubleshooting and RCA
Posted on 2013-10-24
The other night a host disconnected from our vCenter server. I was able to RDP to the host and the 4 VMs below. It was rebooted via ILO and then became completely inaccessible. After pulling out and reseating the blade BL460c, I was able to reconnect the host from vCenter and the VMs again became accessible. Is there good CLI commands to do a root cause analysis? And/or anywhere other than ILO and the Tasks&Events tab to get troubleshooting information?
Rebooting the CIM service and enabling SSH on the host is being blamed and I think disconnection issue was hardware related. So far, I looked at right click and "report performance..." and can only see when the host was disconnected. Also, same thing in the tasks&events, I see when it lost connection "host is not responding" but that's it. The other spot I looked was the management log in the ILO (HP ILO2), and only found: POST Error: 1794-Drive Array - Array Accelerator Battery Charge Low. Date was after issue happened.