Solved

How to know which EVENT failed on powerha?

Posted on 2014-04-09
2
718 Views
Last Modified: 2014-04-09
Hi,

I know hacmp.out is the principal way to determine errors on powerHA but sometimes it's tricky to take a look at it... so, I'd like to know in which powerHA's EVENT was the error.. let's see it with a example:
After moving the RG to a new node, cluster went to  ERROR state, So I want to know why.

So, if I take a look at the log file /var/hacmp/adm/cluster.log I see this:

Apr  8 17:54:03 node2 user:notice HACMP for AIX: EVENT START: node_up node2
Apr  8 17:54:04 node2 user:notice HACMP for AIX: EVENT COMPLETED: node_up node2 0
Apr  8 17:54:06 node2 user:notice HACMP for AIX: EVENT START: node_up_complete node2
Apr  8 17:54:06 node2 user:notice HACMP for AIX: EVENT COMPLETED: node_up_complete node2 0
Apr  8 17:58:31 node2 user:notice HACMP for AIX: EVENT START: site_up siter37
Apr  8 17:58:31 node2 user:notice HACMP for AIX: EVENT START: site_up_remote siter37
Apr  8 17:58:31 node2 user:notice HACMP for AIX: EVENT COMPLETED: site_up_remote siter37 0
Apr  8 17:58:31 node2 user:notice HACMP for AIX: EVENT COMPLETED: site_up siter37 0
Apr  8 17:58:33 node2 user:notice HACMP for AIX: EVENT START: node_up node3 manual
Apr  8 17:58:33 node2 user:notice HACMP for AIX: EVENT COMPLETED: node_up node3 manual 0
Apr  8 17:58:36 node2 user:notice HACMP for AIX: EVENT START: node_up_complete node3
Apr  8 17:58:36 node2 user:notice HACMP for AIX: EVENT COMPLETED: node_up_complete node3 0
Apr  8 17:58:36 node2 user:notice HACMP for AIX: EVENT START: site_up_complete siter37
Apr  8 17:58:36 node2 user:notice HACMP for AIX: EVENT START: site_up_remote_complete siter37
Apr  8 17:58:36 node2 user:notice HACMP for AIX: EVENT COMPLETED: site_up_remote_complete siter37 0
Apr  8 17:58:37 node2 user:notice HACMP for AIX: EVENT COMPLETED: site_up_complete siter37 0
Apr  8 17:59:08 node2 user:notice HACMP for AIX: EVENT START: external_resource_state_change node3
Apr  8 17:59:08 node2 user:notice HACMP for AIX: EVENT COMPLETED: external_resource_state_change node3 0
Apr  8 17:59:09 node2 user:notice HACMP for AIX: EVENT START: rg_move_fence node3 1
Apr  8 17:59:09 node2 user:notice HACMP for AIX: EVENT COMPLETED: rg_move_fence node3 1 0
Apr  8 17:59:09 node2 user:notice HACMP for AIX: EVENT START: rg_move_acquire node3 1
Apr  8 17:59:09 node2 user:notice HACMP for AIX: EVENT START: rg_move node3 1 ACQUIRE
Apr  8 17:59:09 node2 user:notice HACMP for AIX: EVENT COMPLETED: rg_move node3 1 ACQUIRE 0
Apr  8 17:59:09 node2 user:notice HACMP for AIX: EVENT COMPLETED: rg_move_acquire node3 1 0

Open in new window


As these are EVENTs from powerHA, I wanted to know if the last number of every line is the STATUS ERROR from those EVENTs, because knowing this status I will be able to know which EVENT failed...

What's your opinion on this?

Thanks.
0
Comment
Question by:sminfo
[X]
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
2 Comments
 
LVL 68

Accepted Solution

by:
woolmilkporc earned 500 total points
ID: 39988288
Hi again,

cluster.log will show "EVENT FAILED" for ... failed events, so there should be no need to rely on status codes.

The last number in an "EVENT COMPLETED" or "EVENT FAILED"  line actually indicates the status, by the way.

hacmp.out, on the other hand, should contain the string "!!!!!!!!!! ERROR !!!!!!!!!!" near the place of a failure.

wmp
0
 

Author Closing Comment

by:sminfo
ID: 39988320
Hi WMP.... very interesting

So with a grep to "!!!!!!!!!! ERROR !!!!!!!!!!"  I can see the error...

Thanks!!
0

Featured Post

Independent Software Vendors: We Want Your Opinion

We value your feedback.

Take our survey and automatically be enter to win anyone of the following:
Yeti Cooler, Amazon eGift Card, and Movie eGift Card!

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

In tuning file systems on the Solaris Operating System, changing some parameters of a file system usually destroys the data on it. For instance, changing the cache segment block size in the volume of a T3 requires that you delete the existing volu…
FreeBSD on EC2 FreeBSD (https://www.freebsd.org) is a robust Unix-like operating system that has been around for many years. FreeBSD is available on Amazon EC2 through Amazon Machine Images (AMIs) provided by FreeBSD developer and security office…
Learn several ways to interact with files and get file information from the bash shell. ls lists the contents of a directory: Using the -a flag displays hidden files: Using the -l flag formats the output in a long list: The file command gives us mor…
This video shows how to set up a shell script to accept a positional parameter when called, pass that to a SQL script, accept the output from the statement back and then manipulate it in the Shell.

730 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question