Solved

HA clusters/how do they know when to kick in

Posted on 2013-11-25
1
454 Views
Last Modified: 2013-11-29
can I ask for vmware HA, how do other hosts joined to a cluster know when one of the other hosts has died? And what formula comes into play to determine which of the other hosts takes over the guest machines, if say you have 5 hosts joined to a cluster, how do you know which of the other 4 would kick in were 1 host to die/fail. Is there anything you need to "do"  or "configure" on the other hosts to ensure they are aware when a host fails/would be made aware if another host fais?
0
Comment
Question by:pma111
1 Comment
 
LVL 118

Accepted Solution

by:
Andrew Hancock (VMware vExpert / EE MVE) earned 500 total points
ID: 39674918
At least two ESXi Host Servers, added to a Cluster with VMware HA (High Availability) enabled. HA Agents are installed on both ESXi servers, and vCenter Server is used to configure VMware HA, but does not take part or control the HA Agents - vCenters role is only to configure VMware.

One server will be the Master, and the other will be the slave, this can be seen in the Host Summary.

VMs hosted on both ESXi Host Servers, become vSphere HA Protecti-ed - and there should be a green tick, which states Protected. This can be confirmed for the VM, under VM Summary.

The HA Agents on the hosts...

vSphere HA State - Master - A server which is elected as the master. This agent monitors the VMs on this server, and other operational Hosts, and it WILL attempt to restart VMs on failure.

vSphere HA State - Slave - This server is connected to the Master Agent, via the Management Network. The vSphere HA Protected VMs on this server are monitored by one or more vSphere HA Master Agents, and the agent will attempt to restart VMs after a failure.

vSphere HA Protected VM - vSphere will attempt to restart the VM after a supported failure of the VM.

VM is HA Protected on the following conditions:-

VM is in a vSphere HA enabled cluster.
VM is powered on successfully after a successful user power on.
vSphere HA has recorded that the power state is ON.

When an ESXi Host Server Fails (which is part of a VMware HA Cluster), all the Virtual Machines, which are hosted on that Host, also go down, e.g. fail.

A Host Failure could be:-

1. Pink/Purple Screen of Death - caused by memory fault.
2. Pink/Purple Screen of Death  - cause by cpu fault.
3. Power supply failure (if only a single power supply)

A Host Manual Shutdown, reboot, restart is not considered a host failure. Because it's a controlled shutdown.

So we have a Host which has failed, and ALL the VMs it was hosting are now DOWN!

Is there anything you need to "do"  or "configure" on the other hosts to ensure they are aware when a host fails/would be made aware if another host fais?

Nothing than ensure, it has been configured and tested correctly.       

The Master Host Decides where to restart virtual machines.
0

Featured Post

Is Your Active Directory as Secure as You Think?

More than 75% of all records are compromised because of the loss or theft of a privileged credential. Experts have been exploring Active Directory infrastructure to identify key threats and establish best practices for keeping data safe. Attend this month’s webinar to learn more.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

HOW TO: Connect to the VMware vSphere Hypervisor 6.5 (ESXi 6.5) using the vSphere (HTML5 Web) Host Client 6.5, and perform a simple configuration task of adding a new VMFS 6 datastore.
Veeam Backup & Replication has added a new integration – Veeam Backup for Microsoft Office 365.  In this blog, we will discuss how you can benefit from Office 365 email backup with the Veeam’s new product and try to shed some light on the needs and …
In this Micro Tutorial viewers will learn how they can get their files copied out from their unbootable system without need to use recovery services. As an example non-bootable Windows 2012R2 installation is used which has boot problems.
This tutorial will walk an individual through setting the global and backup job media overwrite and protection periods in Backup Exec 2012. Log onto the Backup Exec Central Administration Server. Examine the services. If all or most of them are stop…

896 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question

Need Help in Real-Time?

Connect with top rated Experts

18 Experts available now in Live!

Get 1:1 Help Now