Solved

I keep getting this error: Windows Cluster Service has become unavailable (temporary or lost quorum); every morning from my 2012 clustered VM's any ideas?

Posted on 2015-02-19
9
91 Views
Last Modified: 2015-03-15
[External] SQL Server Alert System: 'Windows Failover Cluster Service unavailable/AG failover occurred' occurred on XXXXXX

DATE/TIME:      2/19/2015 7:57:42 AM

DESCRIPTION:    (None)

COMMENT:        One or both of the following has occurred:

1. Windows - Windows Cluster Service has become unavailable (temporary or lost quorum);
2. SQL Server - Availability Group lost connection or AG failover has occrurred.

Please check and take appropriate actions.

JOB RUN:        (None)

There are 4 VMware VM's in a SQL 2012 cluster - OS is 2012 as well.  Keep getting these error every morning.  Any ideas?
0
Comment
Question by:Harper McDonald
[X]
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
  • 5
  • 3
9 Comments
 
LVL 33

Expert Comment

by:ste5an
ID: 40618924
Checking whether option 1. or 2. is true??
0
 
LVL 4

Author Comment

by:Harper McDonald
ID: 40618947
This is happening on multiple clusters in our environment - Get-ClusterLog generates cluster.log but there is nothing that shows why / what...Event logs don't really say much just that the quorum as been lost.  Our network team has looked into it and nothing in three logs.  We are running in a FlexPod environment with UCS / netapps and Nexus switches so it's up to date gear.  Didn't know if someone might be having the same issue or a solution.  These are VMware clusters and the NIC's all have the VMXNET3 driver for the NIC.
0
 
LVL 33

Expert Comment

by:ste5an
ID: 40618997
Three logs sounds like not that much, or do you have log consolidation?

You need to correlate all logs using the timestamp from the above error message +/- a meaningful grace period...
0
Ransomware: The New Cyber Threat & How to Stop It

This infographic explains ransomware, type of malware that blocks access to your files or your systems and holds them hostage until a ransom is paid. It also examines the different types of ransomware and explains what you can do to thwart this sinister online threat.  

 
LVL 4

Author Comment

by:Harper McDonald
ID: 40619235
We have tried even with SQL logs and in verbose mode.   It's very strange - We have even increased the VM resources.
0
 
LVL 33

Expert Comment

by:ste5an
ID: 40619246
Does it happen at the same time or is there any other pattern?
0
 
LVL 4

Author Comment

by:Harper McDonald
ID: 40619255
It usually happens very early in the mornings but not really much of a pattern.  I need to get with the backup admin and see if jobs run on specific clusters at that time...might bring some light.
0
 
LVL 49

Expert Comment

by:Vitor Montalvão
ID: 40627993
The AG depends on the Windows cluster so the 2nd error should be derived from me first one.
The quorum it's the only resource that is shared so I would check with the storage guys what's happening with that disk.
0
 
LVL 4

Accepted Solution

by:
Harper McDonald earned 0 total points
ID: 40656836
Removed vNIC and reinstalled on cluster nodes.
0
 
LVL 4

Author Closing Comment

by:Harper McDonald
ID: 40666002
It fixed the problem.
0

Featured Post

Free Tool: IP Lookup

Get more info about an IP address or domain name, such as organization, abuse contacts and geolocation.

One of a set of tools we are providing to everyone as a way of saying thank you for being a part of the community.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

If we need to check who deleted a Virtual Machine from our vCenter. Looking this task in logs can be painful and spend lot of time, so the best way to check this is in the vCenter DB. Just connect to vCenter DB(default DB should be VCDB and using…
This article will show you how to create an ISO CD-ROM/DVD-ROM image (*.iso), and MD5 checksum signature, for use with VMware vSphere Hypervisor 6.5 (ESXi 6.5). It's a good idea to compare checksums, because many installations fail because of a corr…
Advanced tutorial on how to run the esxtop command to capture a batch file in csv format in order to export the file and use it for performance analysis. He demonstrates how to download the file using a vSphere web client (or vSphere client) and exp…
In this Micro Tutorial viewers will learn how they can get their files copied out from their unbootable system without need to use recovery services. As an example non-bootable Windows 2012R2 installation is used which has boot problems.

763 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question