Health check on AIX 6.1 and Oracle databases

We have experienced 4 outages in the last 10 days on our new GIS environment running on the P750.  What are the things/steps that need to be taken to perform a health check?  This is not necessarily server related.  Our configuration includes:

 

AIX 6.1 TL5 SP1

Veritas Storage Foundation HA 5.1 MP1

Oracle 10 and 11g

P750 w/ 3 LPARS created
Rhiaanon44Asked:
Who is Participating?

[Product update] Infrastructure Analysis Tool is now available with Business Accounts.Learn More

x
I wear a lot of hats...

"The solutions and answers provided on Experts Exchange have been extremely helpful to me over the last few years. I wear a lot of hats - Developer, Database Administrator, Help Desk, etc., so I know a lot of things but not a lot about one thing. Experts Exchange gives me answers from people who do know a lot about one thing, in a easy to use platform." -Todd S.

balasundaram_sCommented:
General healthcheck includes, filesystem usage monitoring, process monitoring, etc.  For a specific application, a customized monitoring solution is highly recommended.
woolmilkporcCommented:
Hi,

as for the AIX server side -

- make sure the newest firmware is installed, which should be AL710_086. Get the latest firmware via FIX Central - http://www-933.ibm.com/support/fixcentral/
 
 - check errpt regularly for hardware/system software issues.

- Issue smitty alog_show and check all available alog files (console, boot, dumpsymp etc.)

- Issue "diag -c -s" regularly to run system diagnostics.

- Make sure there is a valid dump device of sufficient size, to hold a possible crash dump, for later analysis by IBM support.
Issue "sysdumpdev -l" to see the dump device(s) and "/usr/lib/dumpcheck -p" to check the size (no messages mean "device is big enough").

- Extend syslog to capture system debugging information - add the line

*.debug /var/adm/debug.log

to /etc/syslog.conf, issue "touch /var/adm/debug.log" and "refresh -s syslogd"

Check the resulting file for hints.

wmp

Experts Exchange Solution brought to you by

Your issues matter to us.

Facing a tech roadblock? Get the help and guidance you need from experienced professionals who care. Ask your question anytime, anywhere, with no hassle.

Start your 7-day free trial
madunix (Fadi SODAH)Chief Information Security Officer Commented:
in my case i use nagios to monitor AIX OS. look @
http://nagios.org/
http://exchange.nagios.org/directory/Distributions/Pre%252DCompiled-Binaries/AIX
i use AIX5.3
Rhiaanon44Author Commented:
Thanks to all of you!
It's more than this solution.Get answers and train to solve all your tech problems - anytime, anywhere.Try it for free Edge Out The Competitionfor your dream job with proven skills and certifications.Get started today Stand Outas the employee with proven skills.Start learning today for free Move Your Career Forwardwith certification training in the latest technologies.Start your trial today
Unix OS

From novice to tech pro — start learning today.