Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people, just like you, are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
Solved

AIX 5.1 -- What is the load average listed by "uptime"?

Posted on 2010-09-01
6
1,241 Views
Last Modified: 2013-11-17
The load average listed by uptime, does not seem to be directly related to the idle time listed in topas.  

I have idle lingering between 0-20 in topas, and uptime is still only giving a 4 load average.

Basically -- I am trying to figure out if the system can handle 70 more telnet sessions.... we do interviewing, and have almost 300 interviewer seats now.  and I am trying to figure out if I think it could handle another 70.



0
Comment
Question by:Xetroximyn
  • 3
  • 3
6 Comments
 
LVL 68

Accepted Solution

by:
woolmilkporc earned 500 total points
ID: 33576999
The load average is the number of runnable processes over the preceding 1-, 5-, 15-minute intervals.
It's not really related to idle time.
If you have few heavy threads the load might be low, but CPU utilization might be high (giving low idle time).
On the other hand, if you have an application firing up many small, short living and inexpensive threads, load might be elevated, with low CPU utilization (and high idle time) nonetheless.
With a high I/O wait you will have a high load and much idle time as well. The same is true for a situation with high paging I/O.
As for the telnet sessions - the sessions themselves will not be a problem. The question is - what applications are started from those sessions, how much CPU will they need, how many child processes/threads will those applications launch?
And for topas - if you're running an  LPAR in shared processor mode, this value is misleading. In such a scenario topas will always show low idle time.  because the partition will cede its unused CPU share to other partitions.
If it's not an LPAR - idle% 0-20 is not very much. How high is your I/O wait? If this value is also low, I fear your machine will not be able to support lots of additional applications.
wmp
 
 
0
 

Author Comment

by:Xetroximyn
ID: 33577039
How do I find out if it is LPAR?

 
I watched topas for a few minutes last night when about half the ports in use.
Below are the ranges I was seeing.
   
Kernal = 8-25  
User = 25-80  
Wait 0-81  
Idle 0-65  
   
For a good while Idle was mostly lingering below 10-20  
0
 
LVL 68

Assisted Solution

by:woolmilkporc
woolmilkporc earned 500 total points
ID: 33582012
Issue

uname -L

If you get "-1 NULL" or "1 NULL" this is not an LPAR.

If no LPAR (which I assume with AIX 5.1),  idle% below 10 means that there is not much reserve regarding CPU capacity.

I'd suggest not to launch all 70 additional sessions at once.

Rather increase the load step by step while carefully watching your system. And ask your interviewers about the performance and response times of their application. Ask more than a few people, and ask at various times of day.

But (to repeat what I already wrote) - all depends on what will happen during those telnet sessions. How heavy is your "interview" application? Is a database involved? Does it generate much I/O traffic, maybe due to queries? Are there many calculations to do?

wmp
0
Free Tool: Port Scanner

Check which ports are open to the outside world. Helps make sure that your firewall rules are working as intended.

One of a set of tools we are providing to everyone as a way of saying thank you for being a part of the community.

 

Author Comment

by:Xetroximyn
ID: 33588688
I think a lot of the stuff programmers/project managers do on the system spikes up the CPU a lot.  (running reports etc)

I looked at the system while we were nearly full, and late enough that there probably was not much else being done on the system beside interviewing.   It still spiked up, but it was generally 20-40 idle a lot -- spiking up to 60 or so idle sometimes.  There were a couple periods of 10 or so seconds sustained at 50-60 idle.

I feel like maybe the interviewing is using 40-50% -- and other stuff (reports being run, etc) causes all the other less constant CPU usage. So that other stuff might just take longer to run if interviewing starts taking 60-70%.  

Do you have any thoughts?


If you are curious -- here is 5 minutes of topas.
http://screencast.com/t/ODdmNzM4MW


0
 
LVL 68

Assisted Solution

by:woolmilkporc
woolmilkporc earned 500 total points
ID: 33596772
Yep,
interesting stuff.
It seems that the idle% going down was in most cases due to wait% going up, which would indicate that your system is somewhat I/O constrained.
Your disk paths seem rather well balanced, so I don't think there are some real hotspots which could be spread across more disks.
Your users/jobs do produce peaks in CPU load, but the average percentage seems to stay below 50%.
I think in case you're actually running into performance problems with the new sessions moving to faster disks, maybe a SAN box with some (more) cache available will have better effect than changing CPUs.
Should you consider changing server hardware nonetheless, better think of faster than of more processors.
Maybe moving you report jobs (which might be I/O intensive) off-shift could also help a lot!
wmp
0
 

Author Closing Comment

by:Xetroximyn
ID: 33601445
Thanks!
0

Featured Post

Free Tool: ZipGrep

ZipGrep is a utility that can list and search zip (.war, .ear, .jar, etc) archives for text patterns, without the need to extract the archive's contents.

One of a set of tools we're offering as a way to say thank you for being a part of the community.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Attention: This article will no longer be maintained. If you have any questions, please feel free to mail me. jgh@FreeBSD.org Please see http://www.freebsd.org/doc/en_US.ISO8859-1/articles/freebsd-update-server/ for the updated article. It is avail…
Java performance on Solaris - Managing CPUs There are various resource controls in operating system which directly/indirectly influence the performance of application. one of the most important resource controls is "CPU".   In a multithreaded…
Learn how to find files with the shell using the find and locate commands. Use locate to find a needle in a haystack.: With locate, check if the file still exists.: Use find to get the actual location of the file.:
Learn how to navigate the file tree with the shell. Use pwd to print the current working directory: Use ls to list a directory's contents: Use cd to change to a new directory: Use wildcards instead of typing out long directory names: Use ../ to move…

840 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question