Solved

Expected downtime of VM in VMWare cluster after simulating H/W failure.

Posted on 2016-09-07
9
21 Views
Last Modified: 2016-10-14
Hi guys - what the topic says.
I'm preparing my VM env for production and that's one of the things I'm trying.

Without using Fault Tolerance, it takes 15-20 pings for the VM to automatically start on a different host.

I have a pretty beefy setup with 4 powerful hosts and Compellent so if the above is not right, I would suspect a wrong config rather than H/W bottleneck.

I'm still new to VM, please try to keep it simple.

Thanks
0
Comment
Question by:tp-it-team
9 Comments
 
LVL 16

Accepted Solution

by:
Dirk Mare earned 250 total points (awarded by participants)
ID: 41787518
Remember if the host "fails" and the VM fails-over to a different host the VM still has to power on, on that host..
Post, Boot Operating System, ext..

Having a delay is normal as their is no active state the server resumes from.

DirkMare
0
 

Author Comment

by:tp-it-team
ID: 41787527
Sure, but... I believe it was something like 4-5 pings when it was demo'ed to me... But I can be wrong, I don't remember exactly. Shutting down that VM and powering it back on the same host actually takes much shorter time than failover.
0
 
LVL 16

Expert Comment

by:Dirk Mare
ID: 41787533
open up the console of the VM server and simulate fail over it could (if its Windows) have startup selection that counts down from 30 seconds to boot into Recovery because of a dirty shutdown..

DirkMare
0
Industry Leaders: We Want Your Opinion!

We value your feedback.

Take our survey and automatically be enter to win anyone of the following:
Yeti Cooler, Amazon eGift Card, and Movie eGift Card!

 
LVL 120

Assisted Solution

by:Andrew Hancock (VMware vExpert / EE MVE^2)
Andrew Hancock (VMware vExpert / EE MVE^2) earned 125 total points (awarded by participants)
ID: 41787535
1 to 2 minutes waiting for restart is Good metric.
0
 
LVL 120
ID: 41787536
4-5 pings is vMotion and that's slow!

vMotion and VMware HA us different if you require faster fail over or HA look at FT or Fail over Cluster or other in VM HA and replication.
0
 
LVL 120
ID: 41787543
also remember it takes time for HA to notice the host is down.

so how long after you have killed your host does it take BEFORE it attempts to start VM protected by HA
0
 
LVL 1

Assisted Solution

by:specialist Mohamed
specialist Mohamed earned 125 total points (awarded by participants)
ID: 41788346
Agree with Andrew on this.
If we are talking about vMotion, we might lose hardly half a dozen pings and not more than that for a good working setup.

There are conditions that HA should check for before it reboots the VM's on a specific host.
By the time it checks heartbeats from datastores and checks for Network Isolation response and decides to reboot, it will be a few pings (as you calculate).
Then the VM has to be registered to a different host. If there are stale locks held by the host that went down, then it takes a bit longer for HA to trigger the re-register process successfully.
The VM's boot operation time taken should also be considered in case of HA.

Note: HA "reboots" the VM on a different host.
0
 
LVL 120
ID: 41788354
HA does not reboot VM on other hosts, it's a COLD START-UP.

e.g. Power-Up.
0
 
LVL 16

Expert Comment

by:Dirk Mare
ID: 41843326
More then enough information given to answer authers question.

DirkMare
0

Featured Post

Announcing the Most Valuable Experts of 2016

MVEs are more concerned with the satisfaction of those they help than with the considerable points they can earn. They are the types of people you feel privileged to call colleagues. Join us in honoring this amazing group of Experts.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Suggested Solutions

In this article, I will show you HOW TO: Suppress Configuration Issues and Warnings Alert displayed in Summary status for ESXi 6.5 after enabling SSH or ESXi Shell.
When rebooting a vCenters 6.0 and try to connect using vSphere Client we get this issue "Invalid URL: The hostname could not parsed." When we get this error we need to do some changes in the vCenter advanced settings to fix the issue.
Teach the user how to configure vSphere Replication and how to protect and recover VMs Open vSphere Web Client: Verify vsphere Replication is enabled: Enable vSphere Replication for a virtual machine: Verify replicated VM is created: Recover replica…
Advanced tutorial on how to run the esxtop command to capture a batch file in csv format in order to export the file and use it for performance analysis. He demonstrates how to download the file using a vSphere web client (or vSphere client) and exp…

730 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question