Improve company productivity with a Business Account.Sign Up

x
  • Status: Solved
  • Priority: Medium
  • Security: Public
  • Views: 787
  • Last Modified:

How to know which EVENT failed on powerha?

Hi,

I know hacmp.out is the principal way to determine errors on powerHA but sometimes it's tricky to take a look at it... so, I'd like to know in which powerHA's EVENT was the error.. let's see it with a example:
After moving the RG to a new node, cluster went to  ERROR state, So I want to know why.

So, if I take a look at the log file /var/hacmp/adm/cluster.log I see this:

Apr  8 17:54:03 node2 user:notice HACMP for AIX: EVENT START: node_up node2
Apr  8 17:54:04 node2 user:notice HACMP for AIX: EVENT COMPLETED: node_up node2 0
Apr  8 17:54:06 node2 user:notice HACMP for AIX: EVENT START: node_up_complete node2
Apr  8 17:54:06 node2 user:notice HACMP for AIX: EVENT COMPLETED: node_up_complete node2 0
Apr  8 17:58:31 node2 user:notice HACMP for AIX: EVENT START: site_up siter37
Apr  8 17:58:31 node2 user:notice HACMP for AIX: EVENT START: site_up_remote siter37
Apr  8 17:58:31 node2 user:notice HACMP for AIX: EVENT COMPLETED: site_up_remote siter37 0
Apr  8 17:58:31 node2 user:notice HACMP for AIX: EVENT COMPLETED: site_up siter37 0
Apr  8 17:58:33 node2 user:notice HACMP for AIX: EVENT START: node_up node3 manual
Apr  8 17:58:33 node2 user:notice HACMP for AIX: EVENT COMPLETED: node_up node3 manual 0
Apr  8 17:58:36 node2 user:notice HACMP for AIX: EVENT START: node_up_complete node3
Apr  8 17:58:36 node2 user:notice HACMP for AIX: EVENT COMPLETED: node_up_complete node3 0
Apr  8 17:58:36 node2 user:notice HACMP for AIX: EVENT START: site_up_complete siter37
Apr  8 17:58:36 node2 user:notice HACMP for AIX: EVENT START: site_up_remote_complete siter37
Apr  8 17:58:36 node2 user:notice HACMP for AIX: EVENT COMPLETED: site_up_remote_complete siter37 0
Apr  8 17:58:37 node2 user:notice HACMP for AIX: EVENT COMPLETED: site_up_complete siter37 0
Apr  8 17:59:08 node2 user:notice HACMP for AIX: EVENT START: external_resource_state_change node3
Apr  8 17:59:08 node2 user:notice HACMP for AIX: EVENT COMPLETED: external_resource_state_change node3 0
Apr  8 17:59:09 node2 user:notice HACMP for AIX: EVENT START: rg_move_fence node3 1
Apr  8 17:59:09 node2 user:notice HACMP for AIX: EVENT COMPLETED: rg_move_fence node3 1 0
Apr  8 17:59:09 node2 user:notice HACMP for AIX: EVENT START: rg_move_acquire node3 1
Apr  8 17:59:09 node2 user:notice HACMP for AIX: EVENT START: rg_move node3 1 ACQUIRE
Apr  8 17:59:09 node2 user:notice HACMP for AIX: EVENT COMPLETED: rg_move node3 1 ACQUIRE 0
Apr  8 17:59:09 node2 user:notice HACMP for AIX: EVENT COMPLETED: rg_move_acquire node3 1 0

Open in new window


As these are EVENTs from powerHA, I wanted to know if the last number of every line is the STATUS ERROR from those EVENTs, because knowing this status I will be able to know which EVENT failed...

What's your opinion on this?

Thanks.
0
sminfo
Asked:
sminfo
1 Solution
 
woolmilkporcCommented:
Hi again,

cluster.log will show "EVENT FAILED" for ... failed events, so there should be no need to rely on status codes.

The last number in an "EVENT COMPLETED" or "EVENT FAILED"  line actually indicates the status, by the way.

hacmp.out, on the other hand, should contain the string "!!!!!!!!!! ERROR !!!!!!!!!!" near the place of a failure.

wmp
0
 
sminfoAuthor Commented:
Hi WMP.... very interesting

So with a grep to "!!!!!!!!!! ERROR !!!!!!!!!!"  I can see the error...

Thanks!!
0
Question has a verified solution.

Are you are experiencing a similar issue? Get a personalized answer when you ask a related question.

Have a better answer? Share it in a comment.

Join & Write a Comment

Featured Post

Upgrade your Question Security!

Your question, your audience. Choose who sees your identity—and your question—with question security.

Tackle projects and never again get stuck behind a technical roadblock.
Join Now