Solved

Keeps getting Critical errors for Microsoft-Windows-FailoverClustering and Failover cluster cannot failover automatically.

Posted on 2014-12-01
10
193 Views
Last Modified: 2015-01-06
I'm having a mysterious problem with my Failover cluster,

Cluster name: SQLCluster01.domain.com
 Members: SQLServer01.domain.com andSQLServer02.domain.com

in the Failover Cluster Management – Cluster Event I received the Critical error message 1135 and 1177:

Log Name: System
 Source: Microsoft-Windows-FailoverClustering
 Date: 15/06/2014 9:07:49 PM
 Event ID: 1177
 Task Category: None
 Level: Critical
 Keywords:
 User: SYSTEM
 Computer: SQLServer01.domain.com
 Description:
 The Cluster service is shutting down because quorum was lost. This could be due to the loss of network connectivity between some or all nodes in the cluster, or a failover of the witness disk.
 Run the Validate a Configuration wizard to check your network configuration. If the condition persists, check for hardware or software errors related to the network adapter. Also check for failures in any other network components to which the node is connected such as hubs, switches, or bridges.


 Log Name: System
 Source: Microsoft-Windows-FailoverClustering
 Date: 15/06/2014 9:07:28 PM
 Event ID: 1135
 Task Category: None
 Level: Critical
 Keywords:
 User: SYSTEM
 Computer: SQLServer01.domain.com
 Description:
 Cluster node 'SQLServer02' was removed from the active failover cluster membership. The Cluster service on this node may have stopped. This could also be due to the node having lost communication with other active nodes in the failover cluster. Run the Validate a Configuration wizard to check your network configuration. If the condition persists, check for hardware or software errors related to the network adapters on this node. Also check for failures in any other network components to which the node is connected such as hubs, switches, or bridges.

After further investigation, I found some interesting error here, from the very first critical error message logged in the Event viewer on SQLServer02:


 Log Name: System
 Source: Tcpip
 Date: 15/06/2014 9:07:29 PM
 Event ID: 4199
 Task Category: None
 Level: Error
 Keywords: Classic
 User: N/A
 Computer: SQLServer02-VM.domain.com
 Description:
 The system detected an address conflict for IP address 192.168.127.142 with the system having network hardware address 00-50-56-AE-29-23. Network operations on this system may be disrupted as a result.

Any help would be greatly appreciated.
0
Comment
Question by:Maricel Lepana
[X]
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
  • 5
  • 3
  • 2
10 Comments
 
LVL 50

Expert Comment

by:Vitor Montalvão
ID: 40473538
Looks like you're having network problems. Also check for a machine with same IP of one of your servers.
The system detected an address conflict for IP address 192.168.127.142 with the system having network hardware address 00-50-56-AE-29-23. Network operations on this system may be disrupted as a result.
0
 

Author Comment

by:Maricel Lepana
ID: 40473553
Thank you for your reply,

That IP is for Fialover cluster,  And I have double check in all of the cluster members that all IP addresses is now unique.
0
 
LVL 15

Expert Comment

by:JBond2010
ID: 40473562
Run the validation tests on the networking in Failover Cluster Manager console and then troubleshoot from there. Make sure that the bindings are in the correct order.
0
What is SQL Server and how does it work?

The purpose of this paper is to provide you background on SQL Server. It’s your self-study guide for learning fundamentals. It includes both the history of SQL and its technical basics. Concepts and definitions will form the solid foundation of your future DBA expertise.

 

Author Comment

by:Maricel Lepana
ID: 40473568
Already done the validation test for networking in Failover Cluster Manager Console,
Every Parameter in Success. No errors or warning found.
0
 
LVL 15

Expert Comment

by:JBond2010
ID: 40473580
Can you please run a BPA on the servers and check the results. Also please see the link below.

https://social.technet.microsoft.com/Forums/windowsserver/en-US/5bd91581-7d07-4a01-a68e-b926237787f5/failover-cluster-failed-due-to-mysterious-ip-conflict-


Regards,

JBond2010
0
 
LVL 50

Expert Comment

by:Vitor Montalvão
ID: 40473582
That IP is for Fialover cluster,  And I have double check in all of the cluster members that all IP addresses is now unique.
I don't think the conflicted IP is on the same cluster. Must be a server in your network. Try to find it by the hardware address 00-50-56-AE-29-23.
0
 

Author Comment

by:Maricel Lepana
ID: 40473583
That hardware address for Cluster Node1 (SQLServer01)  and IP address is 192.168.127.155
0
 
LVL 15

Accepted Solution

by:
JBond2010 earned 500 total points
ID: 40473585
Can you please run a BPA on the servers and check the results. Also please see the link below.

https://social.technet.microsoft.com/Forums/windowsserver/en-US/5bd91581-7d07-4a01-a68e-b926237787f5/failover-cluster-failed-due-to-mysterious-ip-conflict-


 Regards,

 JBond2010
0
 

Author Comment

by:Maricel Lepana
ID: 40473589
Hi Jbond,

I am Working on that... I will let you know the result.

Thank you,
0
 

Author Closing Comment

by:Maricel Lepana
ID: 40535022
Network Glitch was the root cause of this issue.

Thanks you very much.
0

Featured Post

Industry Leaders: We Want Your Opinion!

We value your feedback.

Take our survey and automatically be enter to win anyone of the following:
Yeti Cooler, Amazon eGift Card, and Movie eGift Card!

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Suggested Solutions

Title # Comments Views Activity
Format Date fields 11 56
FTP servers in Windows 2008 5 90
Event 4515 - DNS Server Service - Another copy of zone in DomainDnsZones 3 41
Compatibility view list registry key 1 34
In this article we will get to know that how can we recover deleted data if it happens accidently. We really can recover deleted rows if we know the time when data is deleted by using the transaction log.
Sometimes drives fill up and we don't know why.  If you don't understand the best way to use the tools available, you may end up being stumped as to why your drive says it's not full when you have no space left!  Here's how you can find out...
To efficiently enable the rotation of USB drives for backups, storage pools need to be created. This way no matter which USB drive is installed, the backups will successfully write without any administrative intervention. Multiple USB devices need t…
This tutorial will show how to configure a new Backup Exec 2012 server and move an existing database to that server with the use of the BEUtility. Install Backup Exec 2012 on the new server and apply all of the latest hotfixes and service packs. The…

734 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question