Still celebrating National IT Professionals Day with 3 months of free Premium Membership. Use Code ITDAY17

x
?
Solved

Cluster service won't start on one node of cluster, error 1067

Posted on 2010-09-14
10
Medium Priority
?
4,968 Views
Last Modified: 2012-05-10
My apologies for re-posting something but nothing I've tried has helped so far.

I have 2, win 2k3 nodes in my cluster.   The one that won't start has a bunch of 1000 and 7031 errors.  when i try to start the service through the CLI or 'Services' I get the following message. "A system error has occurred" "the process terminated unexpectedly".

I get the same thing when adding the /fixquorum switch as suggested in this KB; http://support.microsoft.com/kb/923838.  I also checked to make sure the driver was set to system and started per the KB.

I also verified the user account for the cluster service had the required rights per this KB; http://support.microsoft.com/kb/269229/en-us

I'm not sure what else to do.  Array Configuration Utility on the server can see the shared storage.

0
Comment
Question by:c2media
[X]
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
  • 5
  • 3
  • 2
10 Comments
 
LVL 3

Expert Comment

by:novaspoonman
ID: 33674138
How is the shared storage configured? FC/iSCSI? To what arrray configuration utility are you referring?
0
 

Author Comment

by:c2media
ID: 33674622
Array Configuration Utility is HP software that is used to manage storage.  We have a simple setup with 2 servers connected via SCSI to shared storage box.. (HP MSA500).  

0
 
LVL 3

Expert Comment

by:novaspoonman
ID: 33674840
Can each node see and access the quorum disk with the other node shut down?
0
Get your Disaster Recovery as a Service basics

Disaster Recovery as a Service is one go-to solution that revolutionizes DR planning. Implementing DRaaS could be an efficient process, easily accessible to non-DR experts. Learn about monitoring, testing, executing failovers and failbacks to ensure a "healthy" DR environment.

 

Author Comment

by:c2media
ID: 33675558
i won't be able to try that until after hours tonight.  i'll post the result in the morning.
0
 
LVL 22

Expert Comment

by:65td
ID: 33677681
Does the cluster service account have full control on the quorum and shared disks?
Had issues even when administrators have FC, added CSA issue went away (for us).
0
 

Author Comment

by:c2media
ID: 33702800
To answer above questions:
1. can each node see  and access the quorum disk with the other node shut down?
Even with the 2nd node shut down the problem node can't access either of the 2 disks.  They are visible through the HP Array Config Utility but windows only sees drives with no information.

maybe it's less a cluster issue and more of a hardware issue.  I'm more confused than before.

2. Does the cluster service have full control on the quorum and shared disks?  the CSA is a member of domain admins but I can't access the storage to see.
0
 
LVL 22

Expert Comment

by:65td
ID: 33704375
The CSA does not need to be apart of domain admins, does need to be a local admin on each node...
With good node down and rebooting the problem node, does it pickup the disks then?

Any events in the system log?
Hardware changes?
0
 

Author Comment

by:c2media
ID: 33705654
a few different errors in the system log.

7031 and 1000 on 1 node


on the other node I've got:
1069, Cluster resource 'Disk Q:' in Resource Group 'Cluster Group' failed.
 
1066, Cluster disk resource "Disk Q:" is corrupt. Run 'ChkDsk /F' to repair problems. The volume name for this resource is "\\?\Volume{7a15e43a-a19f-11df-b7e4-001438bde4f4}\".
 If available, ChkDsk output will be in the file "C:\WINDOWS\Cluster\ChkDsk_Disk4_Sig8B2E1C25.log".
 ChkDsk may write information to the Application Event Log with Event ID 26180.


I tried running chkdsk Q: /F but it fails with message "Can not open volume for direct access"

would I be better off getting rid of the cluster, getting it working on 1 node and then recreate the Q volume?
0
 
LVL 22

Expert Comment

by:65td
ID: 33716316
See MS's note re the 1069:
http://support.microsoft.com/kb/259237

0
 

Accepted Solution

by:
c2media earned 0 total points
ID: 33738739
the cluster was at a remote location but we brought in a local tech.  the problem was due to a corrupt database.
0

Featured Post

What does it mean to be "Always On"?

Is your cloud always on? With an Always On cloud you won't have to worry about downtime for maintenance or software application code updates, ensuring that your bottom line isn't affected.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

The HP utility "HP Lights-Out Online Configuration Utility for Windows Server 2003/2008" could be of great use when it comes to remotely configure a HP servers ILO WITHOUT rebooting the server. We would only need to create and run scripts using thi…
While rebooting windows server 2003 server , it's showing "active directory rebuilding indices please wait" at startup. It took a little while for this process to complete and once we logged on not all the services were started so another reboot is …
Add bar graphs to Access queries using Unicode block characters. Graphs appear on every record in the color you want. Give life to numbers. Hopes this gives you ideas on visualizing your data in new ways ~ Create a calculated field in a query: …
Visualize your data even better in Access queries. Given a date and a value, this lesson shows how to compare that value with the previous value, calculate the difference, and display a circle if the value is the same, an up triangle if it increased…

688 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question