Solved

I keep getting this error: Windows Cluster Service has become unavailable (temporary or lost quorum); every morning from my 2012 clustered VM's any ideas?

Posted on 2015-02-19
9
88 Views
Last Modified: 2015-03-15
[External] SQL Server Alert System: 'Windows Failover Cluster Service unavailable/AG failover occurred' occurred on XXXXXX

DATE/TIME:      2/19/2015 7:57:42 AM

DESCRIPTION:    (None)

COMMENT:        One or both of the following has occurred:

1. Windows - Windows Cluster Service has become unavailable (temporary or lost quorum);
2. SQL Server - Availability Group lost connection or AG failover has occrurred.

Please check and take appropriate actions.

JOB RUN:        (None)

There are 4 VMware VM's in a SQL 2012 cluster - OS is 2012 as well.  Keep getting these error every morning.  Any ideas?
0
Comment
Question by:Harper McDonald
  • 5
  • 3
9 Comments
 
LVL 33

Expert Comment

by:ste5an
ID: 40618924
Checking whether option 1. or 2. is true??
0
 
LVL 4

Author Comment

by:Harper McDonald
ID: 40618947
This is happening on multiple clusters in our environment - Get-ClusterLog generates cluster.log but there is nothing that shows why / what...Event logs don't really say much just that the quorum as been lost.  Our network team has looked into it and nothing in three logs.  We are running in a FlexPod environment with UCS / netapps and Nexus switches so it's up to date gear.  Didn't know if someone might be having the same issue or a solution.  These are VMware clusters and the NIC's all have the VMXNET3 driver for the NIC.
0
 
LVL 33

Expert Comment

by:ste5an
ID: 40618997
Three logs sounds like not that much, or do you have log consolidation?

You need to correlate all logs using the timestamp from the above error message +/- a meaningful grace period...
0
Best Practices: Disaster Recovery Testing

Besides backup, any IT division should have a disaster recovery plan. You will find a few tips below relating to the development of such a plan and to what issues one should pay special attention in the course of backup planning.

 
LVL 4

Author Comment

by:Harper McDonald
ID: 40619235
We have tried even with SQL logs and in verbose mode.   It's very strange - We have even increased the VM resources.
0
 
LVL 33

Expert Comment

by:ste5an
ID: 40619246
Does it happen at the same time or is there any other pattern?
0
 
LVL 4

Author Comment

by:Harper McDonald
ID: 40619255
It usually happens very early in the mornings but not really much of a pattern.  I need to get with the backup admin and see if jobs run on specific clusters at that time...might bring some light.
0
 
LVL 47

Expert Comment

by:Vitor Montalvão
ID: 40627993
The AG depends on the Windows cluster so the 2nd error should be derived from me first one.
The quorum it's the only resource that is shared so I would check with the storage guys what's happening with that disk.
0
 
LVL 4

Accepted Solution

by:
Harper McDonald earned 0 total points
ID: 40656836
Removed vNIC and reinstalled on cluster nodes.
0
 
LVL 4

Author Closing Comment

by:Harper McDonald
ID: 40666002
It fixed the problem.
0

Featured Post

Netscaler Common Configuration How To guides

If you use NetScaler you will want to see these guides. The NetScaler How To Guides show administrators how to get NetScaler up and configured by providing instructions for common scenarios and some not so common ones.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

HOW TO: Install and Configure VMware vSphere Hypervisor 6.5 (ESXi 6.5), Step by Step Tutorial with screenshots. From Download, Checking Media, to Completed Installation.
In this step by step tutorial with screenshots, we will show you HOW TO: Enable SSH Remote Access on a VMware vSphere Hypervisor 6.5 (ESXi 6.5). This is important if you need to enable SSH remote access for additional troubleshooting of the ESXi hos…
In this Micro Tutorial viewers will learn how to use Windows Server Backup to create full image of their system. Tutorial shows how to install Windows Server Backup Feature on Windows 2012R2 and how to configure scheduled Bare Metal Recovery backup.…
This tutorial will walk an individual through the process of transferring the five major, necessary Active Directory Roles, commonly referred to as the FSMO roles from a Windows Server 2008 domain controller to a Windows Server 2012 domain controlle…

810 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question