Solved

AIX 5.1 -- What is the load average listed by "uptime"?

Posted on 2010-09-01
6
1,251 Views
Last Modified: 2013-11-17
The load average listed by uptime, does not seem to be directly related to the idle time listed in topas.  

I have idle lingering between 0-20 in topas, and uptime is still only giving a 4 load average.

Basically -- I am trying to figure out if the system can handle 70 more telnet sessions.... we do interviewing, and have almost 300 interviewer seats now.  and I am trying to figure out if I think it could handle another 70.



0
Comment
Question by:Xetroximyn
[X]
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
  • 3
  • 3
6 Comments
 
LVL 68

Accepted Solution

by:
woolmilkporc earned 500 total points
ID: 33576999
The load average is the number of runnable processes over the preceding 1-, 5-, 15-minute intervals.
It's not really related to idle time.
If you have few heavy threads the load might be low, but CPU utilization might be high (giving low idle time).
On the other hand, if you have an application firing up many small, short living and inexpensive threads, load might be elevated, with low CPU utilization (and high idle time) nonetheless.
With a high I/O wait you will have a high load and much idle time as well. The same is true for a situation with high paging I/O.
As for the telnet sessions - the sessions themselves will not be a problem. The question is - what applications are started from those sessions, how much CPU will they need, how many child processes/threads will those applications launch?
And for topas - if you're running an  LPAR in shared processor mode, this value is misleading. In such a scenario topas will always show low idle time.  because the partition will cede its unused CPU share to other partitions.
If it's not an LPAR - idle% 0-20 is not very much. How high is your I/O wait? If this value is also low, I fear your machine will not be able to support lots of additional applications.
wmp
 
 
0
 

Author Comment

by:Xetroximyn
ID: 33577039
How do I find out if it is LPAR?

 
I watched topas for a few minutes last night when about half the ports in use.
Below are the ranges I was seeing.
   
Kernal = 8-25  
User = 25-80  
Wait 0-81  
Idle 0-65  
   
For a good while Idle was mostly lingering below 10-20  
0
 
LVL 68

Assisted Solution

by:woolmilkporc
woolmilkporc earned 500 total points
ID: 33582012
Issue

uname -L

If you get "-1 NULL" or "1 NULL" this is not an LPAR.

If no LPAR (which I assume with AIX 5.1),  idle% below 10 means that there is not much reserve regarding CPU capacity.

I'd suggest not to launch all 70 additional sessions at once.

Rather increase the load step by step while carefully watching your system. And ask your interviewers about the performance and response times of their application. Ask more than a few people, and ask at various times of day.

But (to repeat what I already wrote) - all depends on what will happen during those telnet sessions. How heavy is your "interview" application? Is a database involved? Does it generate much I/O traffic, maybe due to queries? Are there many calculations to do?

wmp
0
Technology Partners: We Want Your Opinion!

We value your feedback.

Take our survey and automatically be enter to win anyone of the following:
Yeti Cooler, Amazon eGift Card, and Movie eGift Card!

 

Author Comment

by:Xetroximyn
ID: 33588688
I think a lot of the stuff programmers/project managers do on the system spikes up the CPU a lot.  (running reports etc)

I looked at the system while we were nearly full, and late enough that there probably was not much else being done on the system beside interviewing.   It still spiked up, but it was generally 20-40 idle a lot -- spiking up to 60 or so idle sometimes.  There were a couple periods of 10 or so seconds sustained at 50-60 idle.

I feel like maybe the interviewing is using 40-50% -- and other stuff (reports being run, etc) causes all the other less constant CPU usage. So that other stuff might just take longer to run if interviewing starts taking 60-70%.  

Do you have any thoughts?


If you are curious -- here is 5 minutes of topas.
http://screencast.com/t/ODdmNzM4MW


0
 
LVL 68

Assisted Solution

by:woolmilkporc
woolmilkporc earned 500 total points
ID: 33596772
Yep,
interesting stuff.
It seems that the idle% going down was in most cases due to wait% going up, which would indicate that your system is somewhat I/O constrained.
Your disk paths seem rather well balanced, so I don't think there are some real hotspots which could be spread across more disks.
Your users/jobs do produce peaks in CPU load, but the average percentage seems to stay below 50%.
I think in case you're actually running into performance problems with the new sessions moving to faster disks, maybe a SAN box with some (more) cache available will have better effect than changing CPUs.
Should you consider changing server hardware nonetheless, better think of faster than of more processors.
Maybe moving you report jobs (which might be I/O intensive) off-shift could also help a lot!
wmp
0
 

Author Closing Comment

by:Xetroximyn
ID: 33601445
Thanks!
0

Featured Post

Secure Your Active Directory - April 20, 2017

Active Directory plays a critical role in your company’s IT infrastructure and keeping it secure in today’s hacker-infested world is a must.
Microsoft published 300+ pages of guidance, but who has the time, money, and resources to implement? Register now to find an easier way.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Suggested Solutions

Title # Comments Views Activity
Skill Development 15 195
OpenLDAP Proxy to Active Directy 6 465
Write an app 10 74
looking for unix program to search some keywords in side EAR WAR and JAR files 10 111
When you do backups in the Solaris Operating System, the file system must be inactive. Otherwise, the output may be inconsistent. A file system is inactive when it's unmounted or it's write-locked by the operating system. Although the fssnap utility…
Introduction Regular patching is part of a system administrator's tasks. However, many patches require that the system be in single-user mode before they can be installed. A cluster patch in particular can take quite a while to apply if the machine…
Learn how to find files with the shell using the find and locate commands. Use locate to find a needle in a haystack.: With locate, check if the file still exists.: Use find to get the actual location of the file.:
Learn how to navigate the file tree with the shell. Use pwd to print the current working directory: Use ls to list a directory's contents: Use cd to change to a new directory: Use wildcards instead of typing out long directory names: Use ../ to move…

730 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question