How to do Health check for  AIX servers ? what information need to be collected ?

Posted on 2009-06-28
Last Modified: 2013-11-17
We have 400 hundred servers in our Implementation Project. We need to do health check For this servers ? What information I need to check and collect for AIX servers ?
Do we have any standard tools available to do that ? How to prepare report for it ?
Question by:rammaghenthar
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
  • 2
LVL 68

Accepted Solution

woolmilkporc earned 500 total points
ID: 24732467
Hi again,

there is no standard healthcheck script in AIX.

In one of our other cases we're talking about 'cfg2html' which is a fine tool to get
an overview of the vital data of your machines.

Furthermore, health checking cannot be done by sort of a 'snapshot', but should be a continuous process,
using a monitoring tool like e.g. nagios:

Anyway, to check the most important things you could run a little script
regularly against all machines contained in a server list for:

prtconf -> overview
errpt -> hardware error log
df -> filesystems
diag -cs -> hardware diagnostics
lppchk -v -> software packages' consistency

It could look like this (see attachment):

Note that you should have ssh access using publickey, in order to not get prompted for passwords.

And since you're talking about 400 servers, it seems nearly impossible to read all the output from
any check script, so I'd really suggest using a monitoring tool (see nagios above)!


for host in $(cat $serverlist)
   /usr/bin/ssh $host '
   echo RUNNING errpt
   echo RUNNING df -g
   /usr/bin/df -g
   echo RUNNING diag -cs
   /usr/sbin/diag -cs
   echo RUNNING lppchk -v
   /usr/bin/lppchk -v ' > [/path/to/]$host.$(date +"%Y.%m.%d").custom.check

Open in new window

LVL 62

Expert Comment

ID: 24735041
400000 AIX servers? Are you IBM?
diag has automated diagnostics facility whose config is stored in ODM
LVL 30

Expert Comment

by:Kerem ERSOY
ID: 24749166
Depends on what you understand from health chek. If you're after monitoring CPU load, Disk capacity etc. You need a systematic approach.  In this case you need central periodical monitoring and alerting. This could be done with with monitoring tools such as IBM's Tivoli or Nagios.
LVL 62

Expert Comment

ID: 24763543
diag contains part about chcheduled RAM/CPU/DISK/RAID/sysplanar0 diagnostics.
it serves practical and formal policy porposes quite well.

Featured Post

Secure Your Active Directory - April 20, 2017

Active Directory plays a critical role in your company’s IT infrastructure and keeping it secure in today’s hacker-infested world is a must.
Microsoft published 300+ pages of guidance, but who has the time, money, and resources to implement? Register now to find an easier way.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Suggested Solutions

My previous tech tip, Installing the Solaris OS From the Flash Archive On a Tape (, discussed installing the Solaris Operating S…
Java performance on Solaris - Managing CPUs There are various resource controls in operating system which directly/indirectly influence the performance of application. one of the most important resource controls is "CPU".   In a multithreaded…
Learn several ways to interact with files and get file information from the bash shell. ls lists the contents of a directory: Using the -a flag displays hidden files: Using the -l flag formats the output in a long list: The file command gives us mor…
Learn how to find files with the shell using the find and locate commands. Use locate to find a needle in a haystack.: With locate, check if the file still exists.: Use find to get the actual location of the file.:

733 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question