all_experts
asked on
Windows 2003 Cluster problem
I have a Windows 2003 cluster with two (active/active) nodes. Occasionally, one of the nodes hangs. I have to manually power off and on. At the same time, the fail-over doesnt occur, until I reboot.
I've reviewed windows event logs and found the following:
Cluster File Share resource 'Share Name' has failed a status check. The error code is 64.
What can cause this?
Also, I've found in Application Event Log that MSDTC is not starting. After some searching, I found that MSDTC service needs to be clustered. Does it have to be clustered?
The node that fails has two File share resources, IP Address, Network name, Generic Service( nothing fancy, just a service), Physical Disk, Two generic application (ABE resources).
Thanks.
I've reviewed windows event logs and found the following:
Cluster File Share resource 'Share Name' has failed a status check. The error code is 64.
What can cause this?
Also, I've found in Application Event Log that MSDTC is not starting. After some searching, I found that MSDTC service needs to be clustered. Does it have to be clustered?
The node that fails has two File share resources, IP Address, Network name, Generic Service( nothing fancy, just a service), Physical Disk, Two generic application (ABE resources).
Thanks.
ASKER
The only thing that is around error 64 in the log is that this share resource failed to failover.
I dont know if I need MSDTC. That's what I am trying to find out.
Do I need it for File Share Resource?
I dont know if I need MSDTC. That's what I am trying to find out.
Do I need it for File Share Resource?
MSDTC reg'ed for file share, not that I'm aware of.
From the log no loss of IP address or network name?
Have the dependancies been reviewed for the file share resources?
From the log no loss of IP address or network name?
Have the dependancies been reviewed for the file share resources?
ASKER
I dont see any errors about loss of IP. I mean, if it did loose an IP or a Network Name, it should have failed over. for the file share, the only dependency is the physical drive.
Also, after I restarted server i got these errors:
"Cluster service is requesting a bus reset for device \Device\ClusDisk0."
"The driver for device \Device\RaidPort0 performed a bus reset upon request."
Also, after restart, Cluster Service didnt start. I had to manually start it.
Also, after I restarted server i got these errors:
"Cluster service is requesting a bus reset for device \Device\ClusDisk0."
"The driver for device \Device\RaidPort0 performed a bus reset upon request."
Also, after restart, Cluster Service didnt start. I had to manually start it.
Is this a new cluster?
Are the quorum and data drives physical and seperate?
Are the quorum and data drives physical and seperate?
ASKER
This cluster is not new. It has been configured for 5 months or so. I would say once a month it does this crash.
Quorum and data drives are located on SAN.
Quorum and data drives are located on SAN.
Ensure the cluster service account has full access ntfs permissions to the cluster disks and the shared folders.
ASKER
Cluster Service has full access ntfs permissions. It is also a member of local admins
But are the quorum and data disks separate physical disks?
ASKER
It is on SAN which has one HUGE RAID of drives for Storage.
It's two different Logical drives, but physically everything is on RAID.
It's two different Logical drives, but physically everything is on RAID.
ASKER CERTIFIED SOLUTION
membership
This solution is only available to members.
To access this solution, you must be a member of Experts Exchange.
Does the server and or applications need MSDTC? Can it be disabled?
Otherwise if it's on cluster it's going to be setup as a cluster resource.