vSphere HA virtual machine failover failed?

Hi,

I've upgraded my vCenter appliance to 5.5 and now the VMs of one of my cluster servers indicate this error message ... the VMs on the other two cluster servers are just fine. Any ideas what this is about and how to resolve it?

The actual virtual servers run fine though, they're accessible, etc. It's just that there is this error message in the vCenter client.

Thank you!
XeronimoAsked:
Who is Participating?
I wear a lot of hats...

"The solutions and answers provided on Experts Exchange have been extremely helpful to me over the last few years. I wear a lot of hats - Developer, Database Administrator, Help Desk, etc., so I know a lot of things but not a lot about one thing. Experts Exchange gives me answers from people who do know a lot about one thing, in a easy to use platform." -Todd S.

Andrew Hancock (VMware vExpert / EE MVE^2)VMware and Virtualization ConsultantCommented:
This is sometimes - normal. (bug!)

Can you check if all VMs are started on all your Hosts.
0
dipopoCommented:
Whats the HA status on the host housing the vm?
0
XeronimoAuthor Commented:
hanccocka: all the VMs are started!

dipopo: the HA status is 'running (master)'
0
10 Tips to Protect Your Business from Ransomware

Did you know that ransomware is the most widespread, destructive malware in the world today? It accounts for 39% of all security breaches, with ransomware gangsters projected to make $11.5B in profits from online extortion by 2019.

Andrew Hancock (VMware vExpert / EE MVE^2)VMware and Virtualization ConsultantCommented:
Yes, this error message occurs on all versions of VMware HA.

e.g. 4.1, 5.0, 5.1 and 5.5

Reconfigure each Host for HA - Reconfigure HA, or Disable Cluster and Re-enable. (eg. Edit Settings on Cluster, and remove Tick!)
0
XeronimoAuthor Commented:
I've already reconfigured the HA on all the hosts, that didn't remove the error messages. I'll try to disable and reenable the cluster then ...
0
XeronimoAuthor Commented:
Ok, so now I've disable and re-enabled the cluster.

The error messages on the VMs from that one host are still there though ... is that normal? Should I simply clear and acknowledge them then?
0
Andrew Hancock (VMware vExpert / EE MVE^2)VMware and Virtualization ConsultantCommented:
Yes, it's normal.

You have checked HA works for you?
0
XeronimoAuthor Commented:
Checked? You mean by shutting down one host to see if the others restart the VMs? No, not yet ... I just didn't get any error messages while disabling and re-enabling the cluster ... is that not enough to be sure? ;)
0
Andrew Hancock (VMware vExpert / EE MVE^2)VMware and Virtualization ConsultantCommented:
Okay, toggl-ing On and Off the cluster, does get rid of the error.

but you should really test VMware HA.

Also shutting down a Host does not test VMware HA.

Un-controlled Shutdown, e.g. Host failure, remove power is!
0

Experts Exchange Solution brought to you by

Your issues matter to us.

Facing a tech roadblock? Get the help and guidance you need from experienced professionals who care. Ask your question anytime, anywhere, with no hassle.

Start your 7-day free trial
XeronimoAuthor Commented:
Yes, remove power is what I meant ... but I can't test that right now ... those servers would be offline for a couple of minutes ...

If with toggling on/off you mean enabling/re-enabling the cluster option: I've done that and the servers still have a red flag next to them?
0
dipopoCommented:
Could you check to see if an attempt to vmotion a vm from the host throws up errors?
0
XeronimoAuthor Commented:
I've migrated one of the VMs to a host where no VMs have this HA failure message ... this VM keeps displaying that error though, even on the new machine! I guess the error message is not a 'live' one, it's simply still displayed because of an earlier error?

I think I'll move all my VMs, except for a test VM, from the problematic host to a different host and then power off the first one and restart it. And then let's see what it says ...
0
Andrew Hancock (VMware vExpert / EE MVE^2)VMware and Virtualization ConsultantCommented:
If testing VMware HA - just remove the power to host.
0
XeronimoAuthor Commented:
I'm, obviously, new to this cluster thing but just to be sure: since the VMs are stored on shared storage, will the other servers in the cluster immediately take over the virtual machines from the 'failed' host? Or do the VMs initially crash and the other servers will onlyx then immediately restart them? The former would be preferable though and make more sense ...

But what if the HA does not work on my problematic server then its VMs will surely crash, right? That's why I thought to migrate my VMs for to a different host and then test the HA with one test VM running on it.
0
dipopoCommented:
Yes the latter holods true

 VMs initially crash and the other servers will only then immediately restart them? - 1-5 minutes.
0
XeronimoAuthor Commented:
Ok, so I've tested it and the HA works! The test VM got restarted on a different host.

Also, all the VMs indicating that HA failure message have "vSphere HA Protection: Protected'.

So I guess I can move those VMs back to that server and simply clear the alarms?
0
Andrew Hancock (VMware vExpert / EE MVE^2)VMware and Virtualization ConsultantCommented:
Correct. Well Done, Most People do not test!
0
XeronimoAuthor Commented:
Ok, and thank you for your help and also your compliment! :)
0
Andrew Hancock (VMware vExpert / EE MVE^2)VMware and Virtualization ConsultantCommented:
No problems!
0
It's more than this solution.Get answers and train to solve all your tech problems - anytime, anywhere.Try it for free Edge Out The Competitionfor your dream job with proven skills and certifications.Get started today Stand Outas the employee with proven skills.Start learning today for free Move Your Career Forwardwith certification training in the latest technologies.Start your trial today
VMware

From novice to tech pro — start learning today.