Solved

ESXi Virtual Machine hangs

Posted on 2014-01-21
18
2,434 Views
Last Modified: 2016-10-27
hi,

i am running ESXi 5.1.0 with several virtual machines. windows, Linux, etc....

now since few weeks i have Always one virtual machine running on Linux that is not accessible anymore.

also when i give the command to reset the virtual machine in esx i receive an error the the virtual machine is not responding.

then i need to restart the whole esx host so that the virtual machine is turned off.

then i can start again the virtual machine.

sometimes a few hours, sometimes a couple of days later the same problem.

any ideas?
0
Comment
Question by:Rik Van Lier
[X]
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
  • 8
  • 5
  • 2
  • +2
18 Comments
 
LVL 17

Expert Comment

by:James H
ID: 39798219
What version of Linux?
Did you try uninstalling and reinstalling VMWare tools?
0
 
LVL 121
ID: 39798317
is your host  server certified for running ESXi 5.x? see HCL

i think we need to look at the logs for the VM , these can be found in the Virtual Folder called vmware.log

also at time of crash we need /var/logs/vmkernel.log as well

what is the VM role?
0
 
LVL 48

Expert Comment

by:Tintin
ID: 39798384
I've had instances of CentOS hosts not being accessible due to kernel audit log issues.

The box was still pingable, but effectively frozen.   It wouldn't respond to a restart as that relies on being able to communicate with VMware Tools, so doing a power off/on worked.
0
Comprehensive Backup Solutions for Microsoft

Acronis protects the complete Microsoft technology stack: Windows Server, Windows PC, laptop and Surface data; Microsoft business applications; Microsoft Hyper-V; Azure VMs; Microsoft Windows Server 2016; Microsoft Exchange 2016 and SQL Server 2016.

 
LVL 1

Author Comment

by:Rik Van Lier
ID: 39799180
reinstalled VMWare tools yesterday. last night. server hangs... no respond anymore.

in ESX i see VMWare tools still running. I try to reset. Progress bar goes to 95% and hangs there...

after few minutes i got error: A general system error occurred: Unknow error
0
 
LVL 1

Author Comment

by:Rik Van Lier
ID: 39799182
client virtual machine is running Ubuntu Server 12.

don't think it is a client problem as i cannot reset the client. could be wrong but must this not be a esx problem?
0
 
LVL 48

Expert Comment

by:Tintin
ID: 39799188
Can you still ping the virtual host when it hangs?

Can you connect to the vhost console?
0
 
LVL 1

Author Comment

by:Rik Van Lier
ID: 39799260
No ping does not work and connecting to the vhost console also does not work.
0
 
LVL 1

Author Comment

by:Rik Van Lier
ID: 39802177
very strange. this morning another machine was hanging. same problem.

i am using Acronis vmProtect 9 to backup all my vhosts.

can this be a part of the problem?
0
 
LVL 121
ID: 39802367
Please see my questions.

Check for Snapshots?

What disk storage are you using?
0
 
LVL 1

Author Comment

by:Rik Van Lier
ID: 39802406
The server is a HP DL360 G6 and supported for ESX. In fact this server is running for several years without problems.

I have internal storage in that server and external storage on a D2500 from HP.

the failing vhost is running on internal storage.

the machine has snaphots.
0
 
LVL 121
ID: 39802494
I would investigate why your virtual machine has snapshots and attend to them.

If unsure how to deal with Snapshots on VMs, please see my EE Article

HOW TO: VMware Snapshots :- Be Patient
0
 
LVL 20

Expert Comment

by:compdigit44
ID: 39805223
You can do a test by omitting one of your VM's which has been hanging from your backup for one night and see if the server is still responsive in the mornng.
0
 
LVL 20

Expert Comment

by:compdigit44
ID: 39805227
On another note, you mention that you are using local storage have you upgraded the firmware on your server recently? If not, you may want to look into this.
0
 
LVL 1

Author Comment

by:Rik Van Lier
ID: 39823376
OK here is an update. When i started this question i had only 1 vhost with this problem.

few days ago i had 2 vhosts with that problem. Machines hangs sometimes 2x each day.

i phoned with HP and the also adviced me to update the firmware of the whole server and do a bios update. I did all that. But with no solution.

Still the same problem. One VM is running Linux and the other is running Windows.

i created a backup of the windows machine inside windows. then created a new vm and restored that backup. since then no problems anymore with the Windows machine.

i tought lets do the same for the Linux machine. as i have no software to backup the full Linux machine i created a clone using vsphere. This did go well and the new machine boots up fine.

but! the problem on that new Linux machine is still the same. Also here same problems.

i found out that i can stop the machine using the kill process with the cli interface. but this is no solution as the vm goes down 2 times a day...

any other bright ideas? would it help to boot the vm up with a acronis backup dvd and create a backup and then restore this to a new vm? so i wont use the clone option?

good luck!
0
 
LVL 121
ID: 39823477
HP always tell you to do a BIOS and Firmware update!

I would look at hardware, check memory is seated correctly, heatsinks and fans are correctly fixed.

If the VMs are crashing and hanging, I would check the vmware.logs and also the logs for the host in /var/logs
0
 
LVL 1

Author Comment

by:Rik Van Lier
ID: 39823566
Andrew, can this be a hardware problem if not all vm's are hanging? i have more then 30 vm's running and only 1 still has problems.
0
 
LVL 121

Accepted Solution

by:
Andrew Hancock (VMware vExpert / EE MVE^2) earned 500 total points
ID: 39823594
Yes, it can, we've seen issues where the fault lies in a high memory module, e.g. 32Gb+

and it was only when all the memory was starting to be used in the server a VM would crash.

Only way to prove this case was to shutdown ALL VMs, and just start this server individually, or move to a new host.
0
 
LVL 1

Author Comment

by:Rik Van Lier
ID: 39823602
Andrew, yes i understand this and i will believe that this is possible. but it is strange that it would only happen to that same machine. and no other.

but i have an other esx running. i would move it to a new host and see what happens.

keep you posted!
0

Featured Post

U.S. Department of Agriculture and Acronis Access

With the new era of mobile computing, smartphones and tablets, wireless communications and cloud services, the USDA sought to take advantage of a mobilized workforce and the blurring lines between personal and corporate computing resources.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

In the first part of this tutorial we will cover the prerequisites for installing SQL Server vNext on Linux.
Here's a look at newsworthy articles and community happenings during the last month.
Teach the user how to install and configure the vCenter Orchestrator virtual appliance Open vSphere Web Client: Deploy vCenter Orchestrator virtual appliance OVA file: Verify vCenter Orchestrator virtual appliance boots successfully: Connect to the …
This video shows you how to use a vSphere client to connect to your ESX host as the root user. Demonstrates the basic connection of bypassing certification set up. Demonstrates how to access the traditional view to begin managing your virtual mac…

696 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question