Solved

cluster failover

Posted on 2015-01-28
3
317 Views
Last Modified: 2015-02-12
I have an Exchange cluster that likes to switch over ever other day. It does not matter on time of day or time of week. It all starts with a 1135 Event ID and then is followed by 1562 and 1069 and 1564 and 1177.

We have servers in two different states.

The event ID says the cluster service may have stopped or lost communication with other active nodes. I've added to the nic team for communication.

The log on the near server says the far server has lost communication and the log on the far server says the near server has lost communication at the same time.
 
The switch is not reporting any problems. I'm not getting any reported problems with the nics either.

We've recently upped the samesubnetdelay and crosssubnetdelay.

There was one new program installed but that has since been uninstalled.

The only other things that have happened since the time when all is well to now is just windows updates.

Any help would be greatly appreciated.  Thank you
0
Comment
Question by:mfony
  • 2
3 Comments
 
LVL 19

Expert Comment

by:Miguel Angel Perez Muñoz
ID: 40576965
Could you check Cluster.log logfile and paste entries when cluster switch over?
0
 

Author Comment

by:mfony
ID: 40577324
on the first critical 1135 message:

[RES] Network Name: Agent: Sending request Netname/RecheckConfig to NN:463e84f0-5b46-4e78-ac45-e20054b0a8a4:Netbios
[RES] Network Name <Cluster Name>: Netbios: Slow Operation, FinishWithReply: 0
[RES] Network Name:  [NN] got sync reply: 0
[RES] Network Name <Cluster Name>: Netbios: End of Slow Operation, state: Initialized/Idle, prevWorkState: Idle

I looked through the file and most if not all of the  lines had

FinishWithReply:0
Netbios:Slow Operation
Dns:Slow Operation
got sync reply:0
prevWorkState:Idle

The other types of lines included:

[GEM] Sending 1 messages as a batched GEM message
[GUM] Node 2: Processing RequestLock 2:7932
[GUM] Node 2: Processing GrantLock to 2 (sent by 1 gumid: 2835855)
[GEM] Sending 1 messages as a batched GEM message

I don't see anything happening in this file. What am I looking for?
0
 
LVL 19

Accepted Solution

by:
Miguel Angel Perez Muñoz earned 500 total points
ID: 40577411
Messages about lost of connectivity between clusters, like link down or overload on links causes traffic lost and cluster switchover.
Could you ensure link between nodes are running ok when cluster switch over?
0

Featured Post

Optimizing Cloud Backup for Low Bandwidth

With cloud storage prices going down a growing number of SMBs start to use it for backup storage. Unfortunately, business data volume rarely fits the average Internet speed. This article provides an overview of main Internet speed challenges and reveals backup best practices.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

ADCs have gained traction within the last decade, largely due to increased demand for legacy load balancing appliances to handle more advanced application delivery requirements and improve application performance.
This article aims to explain the working of CircularLogArchiver. This tool was designed to solve the buildup of log file in cases where systems do not support circular logging or where circular logging is not enabled
In this video we show how to create a Distribution Group in Exchange 2013. We show this process by using the Exchange Admin Center. Log into Exchange Admin Center.: First we need to log into the Exchange Admin Center. Navigate to the Recipients >>…
To show how to generate a certificate request in Exchange 2013. We show this process by using the Exchange Admin Center. Log into Exchange Admin Center.:  First we need to log into the Exchange Admin Center. Navigate to the Servers >> Certificates…

895 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question

Need Help in Real-Time?

Connect with top rated Experts

14 Experts available now in Live!

Get 1:1 Help Now