Still celebrating National IT Professionals Day with 3 months of free Premium Membership. Use Code ITDAY17

x
?
Solved

Windows 2008 SP2 & Exchange 2007 CCR failover did not work properly

Posted on 2010-08-20
14
Medium Priority
?
1,751 Views
Last Modified: 2012-05-10
Hi Experts,

We are seeing a few strange issues in one of our clusters, (failover did not work properly)

Our environment is 2 Physical servers running Windows 2008 SP2(exchange 2007 SP2 CCR), multiple VMs running Windows 2008 SP2(share witness), and multiple CAS/HUB(vms)

today there was an issue that my cluster was unable to contact share witness(HUB), and the active node of CCR became unresponsive



There is also the following error in the system event log which might cause not being able to bring all the cluster resources online.

 

The Fibre Channel Platform Registration Service could not register the platform with fabric 10:00:00:05:1e:ba:48:00.


In addition to that,we are seeing errors as Event ID 1230/ 1146 task category resource control manager, and source FailoverClustering

Can anyone point me on the right direction?
0
Comment
Question by:Jerry Seinfield
[X]
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
  • 7
  • 6
14 Comments
 
LVL 32

Expert Comment

by:endital1097
ID: 33482806
that fibre channel error sounds like the system lost connection to the disk

i would run the following
Get-ClusteredMailboxServerStatus
0
 

Expert Comment

by:ykiran_kumar
ID: 33483513
Could register error sounds like your FC control lost communication with the Storage/SAN Switch. Try checking FC HBA drivers in the device manager. Try running FC-HBA application and check whether it is binded properly.
0
 

Author Comment

by:Jerry Seinfield
ID: 33489411
Any other suggestions?
0
Ransomware-A Revenue Bonanza for Service Providers

Ransomware – malware that gets on your customers’ computers, encrypts their data, and extorts a hefty ransom for the decryption keys – is a surging new threat.  The purpose of this eBook is to educate the reader about ransomware attacks.

 
LVL 32

Expert Comment

by:endital1097
ID: 33489471
can you post the results of the Get-ClusteredMailboxServerStatus?

Was the FSW available during this time?
0
 

Author Comment

by:Jerry Seinfield
ID: 33489613
By the time this issue happened, FSW was not available, and also, the active node was frozen(hung state)
0
 
LVL 32

Expert Comment

by:endital1097
ID: 33489626
then two nodes were down and you no longer had quorum so the exchange services went offline
you need to determine what happened to your fibre connection
check switch logs
0
 

Author Comment

by:Jerry Seinfield
ID: 33489645
I found the following issue on cluster

The Fibre Channel Platform Registration Service could not register the platform with fabric 10:00:00:05:1e:ba:48:00

However, the SAN admin guys states that Is not a cause for concern, so at this point anything to check at windows 2008 cluster side of things?
0
 
LVL 32

Expert Comment

by:endital1097
ID: 33489723
run the following to generate a cluster log file for both nodes that will be saved it the subdirectory clusterlogs under the current directory
cluster log /g /copy:clusterlogs /level:5

if you don't specify /copy: a log will be generated on each node under the following directory:
%windir%\Cluster\Reports

you could then analyze these logs to try to determine what happened and when
0
 

Author Comment

by:Jerry Seinfield
ID: 33489782
Thanks Endital for the quick answer

Lets say that the SAN is ok, what else can be wrong that my active node in the cluster becomes unresponsive?

What are most common issues in Windows 2008 clustering, and Exchange 2007 CCR?

Cheers
0
 
LVL 32

Expert Comment

by:endital1097
ID: 33489938
since there was a fibre alert, i would start by looking at the hba drivers and ensure that they are up-to-date
are you using mpio?
0
 

Author Comment

by:Jerry Seinfield
ID: 33489951
yes, we are using mpio
Any known issues with Windows 2008 MPIO, and Exchange 2007 SP2 CCR?
0
 
LVL 32

Expert Comment

by:endital1097
ID: 33490038
none that i am aware of currently. i use mpio with scc.

i asked because with mpio in place you should be able to stand a single hba failure. do you have each hba going into separate switches? any other server have issues at this time?
0
 

Author Comment

by:Jerry Seinfield
ID: 33490058
yes we have each HBA going to separate switches, and by the time this issue happen, another cluster failed, we have a total of 6 clusters (all hardware, except HUB/CAS)share witness in on HUBs
0
 
LVL 32

Accepted Solution

by:
endital1097 earned 2000 total points
ID: 33491128
Sounds like the san admin isn't telling you something. Multiple clusters having an issue together.
0

Featured Post

Prepare for your VMware VCP6-DCV exam.

Josh Coen and Jason Langer have prepared the latest edition of VCP study guide. Both authors have been working in the IT field for more than a decade, and both hold VMware certifications. This 163-page guide covers all 10 of the exam blueprint sections.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

This article lists the top 5 free OST to PST Converter Tools. These tools save a lot of time for users when they want to convert OST to PST after their exchange server is no longer available or some other critical issue with exchange server or impor…
This article will show how Aten was able to supply easy management and control for Artear's video walls and wide range display configurations of their newsroom.
how to add IIS SMTP to handle application/Scanner relays into office 365.
There are cases when e.g. an IT administrator wants to have full access and view into selected mailboxes on Exchange server, directly from his own email account in Outlook or Outlook Web Access. This proves useful when for example administrator want…
Suggested Courses

721 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question