Solved

Cluster service won't start on one node of cluster, error 1067

Posted on 2010-09-14
10
4,848 Views
Last Modified: 2012-05-10
My apologies for re-posting something but nothing I've tried has helped so far.

I have 2, win 2k3 nodes in my cluster.   The one that won't start has a bunch of 1000 and 7031 errors.  when i try to start the service through the CLI or 'Services' I get the following message. "A system error has occurred" "the process terminated unexpectedly".

I get the same thing when adding the /fixquorum switch as suggested in this KB; http://support.microsoft.com/kb/923838.  I also checked to make sure the driver was set to system and started per the KB.

I also verified the user account for the cluster service had the required rights per this KB; http://support.microsoft.com/kb/269229/en-us

I'm not sure what else to do.  Array Configuration Utility on the server can see the shared storage.

0
Comment
Question by:c2media
[X]
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
  • 5
  • 3
  • 2
10 Comments
 
LVL 3

Expert Comment

by:novaspoonman
ID: 33674138
How is the shared storage configured? FC/iSCSI? To what arrray configuration utility are you referring?
0
 

Author Comment

by:c2media
ID: 33674622
Array Configuration Utility is HP software that is used to manage storage.  We have a simple setup with 2 servers connected via SCSI to shared storage box.. (HP MSA500).  

0
 
LVL 3

Expert Comment

by:novaspoonman
ID: 33674840
Can each node see and access the quorum disk with the other node shut down?
0
Does Your Cloud Backup Use Blockchain Technology?

Blockchain technology has already revolutionized finance thanks to Bitcoin. Now it's disrupting other areas, including the realm of data protection. Learn how blockchain is now being used to authenticate backup files and keep them safe from hackers.

 

Author Comment

by:c2media
ID: 33675558
i won't be able to try that until after hours tonight.  i'll post the result in the morning.
0
 
LVL 22

Expert Comment

by:65td
ID: 33677681
Does the cluster service account have full control on the quorum and shared disks?
Had issues even when administrators have FC, added CSA issue went away (for us).
0
 

Author Comment

by:c2media
ID: 33702800
To answer above questions:
1. can each node see  and access the quorum disk with the other node shut down?
Even with the 2nd node shut down the problem node can't access either of the 2 disks.  They are visible through the HP Array Config Utility but windows only sees drives with no information.

maybe it's less a cluster issue and more of a hardware issue.  I'm more confused than before.

2. Does the cluster service have full control on the quorum and shared disks?  the CSA is a member of domain admins but I can't access the storage to see.
0
 
LVL 22

Expert Comment

by:65td
ID: 33704375
The CSA does not need to be apart of domain admins, does need to be a local admin on each node...
With good node down and rebooting the problem node, does it pickup the disks then?

Any events in the system log?
Hardware changes?
0
 

Author Comment

by:c2media
ID: 33705654
a few different errors in the system log.

7031 and 1000 on 1 node


on the other node I've got:
1069, Cluster resource 'Disk Q:' in Resource Group 'Cluster Group' failed.
 
1066, Cluster disk resource "Disk Q:" is corrupt. Run 'ChkDsk /F' to repair problems. The volume name for this resource is "\\?\Volume{7a15e43a-a19f-11df-b7e4-001438bde4f4}\".
 If available, ChkDsk output will be in the file "C:\WINDOWS\Cluster\ChkDsk_Disk4_Sig8B2E1C25.log".
 ChkDsk may write information to the Application Event Log with Event ID 26180.


I tried running chkdsk Q: /F but it fails with message "Can not open volume for direct access"

would I be better off getting rid of the cluster, getting it working on 1 node and then recreate the Q volume?
0
 
LVL 22

Expert Comment

by:65td
ID: 33716316
See MS's note re the 1069:
http://support.microsoft.com/kb/259237

0
 

Accepted Solution

by:
c2media earned 0 total points
ID: 33738739
the cluster was at a remote location but we brought in a local tech.  the problem was due to a corrupt database.
0

Featured Post

Online Training Solution

Drastically shorten your training time with WalkMe's advanced online training solution that Guides your trainees to action. Forget about retraining and skyrocket knowledge retention rates.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

by Batuhan Cetin Within the dynamic life of an IT administrator, we hold many information in our minds like user names, passwords, IDs, phone numbers, incomes, service tags, bills and the order from our wives to buy milk when coming back to home.…
Numerous times I have been asked this questions that what is it that makes my machine log on so slow, there have been cases where computers took 23 minute exactly after taking password and getting to the desktop. Interesting thing was the fact th…
In this video, viewers are given an introduction to using the Windows 10 Snipping Tool, how to quickly locate it when it's needed and also how make it always available with a single click of a mouse button, by pinning it to the Desktop Task Bar. Int…
This is my first video review of Microsoft Bookings, I will be doing a part two with a bit more information, but wanted to get this out to you folks.

636 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question