Question

SCO Unix Network Hang

Asked by: US-IT

Hi All,
We have a SCO OpenServer 5.0.7 server in our office. Lately (past week) it has been hanging very roughly every 12 hours, and I need to reboot. The reboot seems to clear it up, and it runs fine, for a while.  We have 2 remote NFS shares mounted to the local file system, and the box also serves several samba shares. When the system begins to hang, these become unreachable (NFS shares from SCO, Samba shares from remote systems). We have about 50 users logging in at a time to our foxbase applications. Sometimes (I think if not rebooted quick enough) it stops responding to any network protocol (ssh, telnet, etc.).

Here is the netstat -m from before the last reboot (this was fairly deep into the hang)
streams allocation:
                                             config    alloc     free       total      max     fail
stream                                8448      202     8246        2464      202        0
queues                                  908      413      495        4939      413        0
mblks                                 8634     8164      470    26890848     8599        0
buffer headers                  9018     8568      450      308777     8969        0
class  1,     64 bytes         396       74      322    10407791      393        0
class  2,    128 bytes         50       20       30     2273073       76        0
class  3,    256 bytes         44       28       16     5951744      585        0
class  4,    512 bytes         14       10        4       34800       58        1
class  5,   1024 bytes       31        0       31       38453       54        3
class  6,   2048 bytes      7310     7309        1     2284286     7310      804
class  7,   4096 bytes       532      532        0       15352      585 43361300
class  8,   8192 bytes        0        0        0      114823        9       47
class  9,  16384 bytes        0        0        0      224169        3       43
class 10,  32768 bytes        0        0        0      154546        3      893
class 11,  65536 bytes        0        0        0        1830        3       54
class 12, 131072 bytes        0        0        0           0        0        0
class 13, 262144 bytes        0        0        0           0        0        0
class 14, 524288 bytes        0        0        0           0        0        0
total configured streams memory: 17024.00KB
streams memory in use: 17129.64KB
maximum streams memory used: 18012.29KB


I get this error message repeated quite a bit:
WARNING: allocb failed - NSTRPAGES exceeded


I am out of ideas where to look next, any help is greatly appreciated. Thank you.

This Question has been solved and asker verified All Experts Exchange premium technology solutions are available to subscription members.

Subscribe now for full access to Experts Exchange and get

Instant Access to this Solution

  • Plus...
  • 30 Day FREE access, no risk, no obligation
  • Collaborate with the world's top tech experts
  • Unlimited access to our exclusive solution database
  • Never be left without tech help again

Subscribe Now

Asked On
2008-01-24 at 12:08:39ID23109001
Tags

UNIX

,

SCO OpenServer

,

5.0.7

Topics

SCO Unix

,

Unix Operating Systems

Participating Experts
4
Points
500
Comments
11

Trusted by hundreds of thousands everyday for fast, accurate and reliable tech support.

  • "The time we save is the biggest benefit of Experts Exchange to Warner Bros. What could take multiple guys 2 hours or more each to find is accessed in around 15 minutes on Experts Exchange." Mike Kapnisakis, Warner Bros.
  • "Our team likes having a resource that is more secure than just using Google and most experts using this service really know their stuff. It's nice to look here first versus using Google." Dayna Sellner, Lockheed Martin
  • "Anytime that I've been stumped with a problem, 9 out of 10 times Experts Exchange has either the accepted solution or an open discussion of the potential solution to the problem." Kenny Red, eBay Inc.

See what Experts Exchange can do for you.

Got a question?

We've got the answer.

Experts Exchange has been collecting answers to technology questions since 1996…3 million and counting! If you have a question, chances are we already have your answer.

Screenshot of Experts Exchange Knowledgebase

Need individual assistance?

Our experts are ready to help.

If you can't find the exact answer you're looking for, ask our exclusive community of 50,000 experts. You’ll get a personalized answer from a trusted professional.

Screenshot of Experts Exchange Knowledgebase

Want to learn from the best?

Read articles from industry experts.

Thousands of free tech tips, tricks, how-to’s and tutorials are available in our peer reviewed articles section. See for yourself how smart our experts are, no login required.

Screenshot of an Article

Working on a long term project?

Store your work and research.

Save solutions to your questions, answers you’ve discovered through searching plus helpful articles in your personal knowledgebase for easy future access.

Screenshot of Experts Exchange Knowledgebase

Access the answers to your technology questions today.

Subscribe Now

30-day free trial. Register in 60 seconds.

What Makes Experts Exchange Unique?

Members of the expert community talk about why the experience at Experts Exchange is different than what you will find anywhere else.

Trusted by the world's most respected brands.

image of each brand's logo

Faithfully serving IT professionals since 1996.

Experts Exchange Logo

Try it out and discover for yourself.

Subscribe Now

30-day free trial. Register in 60 seconds.

Related Solutions

  1. SCO
    What is SCO UnixWare and SCO OpenServer ? Is there any relationship between them ? Andrew
  2. SCO Unix
    Do I need anything additional(that is not come with SCO Unix) if I would allow Dial-in to my server ?
  3. SCO OpenServer 5.0.5  -- 80 Second Login Problem
    Hello, We have 2 SCO OpenServer 5.0.5 UNIX Servers. We are having a problem when remotely logging in (via telnet or ssh) that it waits exactly 80 seconds before giving up a login prompt after connection. Example: ---top--- Attempting connection to node 10.20.20.10... Conne...
  4. SCO Openserver - low on space
    is there a simple command to find out how much space i have left on a particular drive? my root was running low and I have freed up some space, not sure how much % i have freed up. i am running SCO openserver 5
  5. Samba, Sco Unix & Windows Server
    My goal is to move data from Sco Unix to Windows Server. I have loaded samba and is able to see the Unix Server Directory to be backup to the Windows Server. However, I'm not sure what's the best route to take to move the data. my initial thought was to mount the windows d...

Free Tech Articles

  1. WARNING: 5 Reasons why you should NEVER fix a computer for free.
    It is in our nature to love the puzzle. We are obsessed. The lot of us. We love puzzles. We love the challenge. We thrive on finding the answer. We hate disarray. It bothers us deep in our soul. W...
  2. SCCM OSD Basic troubleshooting
    SCCM 2007 OSD is a fantastic way to deploy operating systems, however, like most things SCCM issues can sometimes be difficult to resolve due to the sheer volume of logs to sift through and the dispe...
  3. Migrate Small Business Server 2003 to Exchange 2010 and Windows 2008 R2
    This guide is intended to provide step by step instructions on how to migrate from Small Business Server 2003 to Windows 2008 R2 with Exchange 2010. For this migration to work you will need the fo...
  4. Create a Win7 Gadget
    This article shows you how to create a simple "Gadget" -- a sort of mini-application supported by Windows 7 and Vista. Gadgets can be dropped anywhere on the desktop to provide instant information, ...
  5. Outlook continually prompting for username and password
    There have been a lot of questions recently regarding Outlook prompting for a username and password whilst using Exchange 2007. There are a few reasons why this would happen and I will try to cover t...
  6. Backup Exchange 2010 Information Store using Windows Backup
    There seems to be quite a lot of confusion around the ability to backup Exchange 2010 using the built in Windows Backup feature. This stems from the omission of this feature prior to Exchange 2007 s...

Cloud Class Webinars

  1. Avoiding Bugs in Microsoft Access
    Alison Balter takes and in-depth look at avoiding bugs in Access. In this webinar you will learn about using the immediate window to debug your applications, invoking the debugger, using breakpoints to troubleshoot, stepping through code, setting the next statement to execute, ...
  2. Top 10 Best New Features in Visio 2010
    Scott Helmers gives live demonstrations of the top 10 new features in Visio 2010. This webinar will teach you how to create compelling diagrams by adding shapes to the page with a single click, linking the shapes in a diagram to data in Excel (or SQL Server, or SharePoint), ...
  3. IT Consultant Business Secrets Revealed
    Michael Munger, Experts Exchange tech pro and IT consultant, pulls back the curtain on his very successful businesses and answers question on every IT consultant and business owner should know about. He shares secrets on what he did to solve the 5 most common problems in IT, ...
  4. Disaster Recovery and Business Continuity
    Quest CTO, Mike Billon, gives an overview of the steps involved in building a dunamic disaster recovery plan. Through case studies and an examination of software/hardware tooles for monitoring and testing, you'll gain a better understandin of where you are, where you want ...
  5. Organize Your Visio Diagrams with Containers and Lists
    Scott Helmers uses cross functional flowcharts, wireframe diagrams, data graphic legends and seating charts to teach you: how to ustilize all three new structured diagram components in Visio 2010, the best practices for organizeing shapes in previous version of Visio, how to organize ...
  6. How to Us Objects, Properties, Events and Methods in Microsoft Access
    Alison Dalter gives an in-depbth look at objects, properties, events and methods in Microsoft Access. In this webinar you will learn about using the object browser, referring to objects, working with properties and methods, working with object variables, understanding the ...

Join the Community

Give a Little. Get a Lot.

Join the community of experts here and help other tech pros by answering question in your area of expertise. You can earn FREE access to all Experts Exchange's premium features and resources.

Join the Community

Answers

 

by: dfkePosted on 2008-01-25 at 01:09:01ID: 20741089

Looks like a kernel issue as the fail colunm should be all zero's.
I see that the total configured streams memory almost matches the streams memory in use. You should increase the  number of NSTRPAGES.  NSTRPAGES controls the number of 4K pages  of memory that can be dynamically allocated for STREAMS use.

Furthermore NSTREAM should be set to at least 256 on systems that mount NFS-filesystems or invoke remote X clients.

 

by: Smart_ManPosted on 2008-01-25 at 01:42:56ID: 20741243

right , looks liek a bottle neck issue . needs to reduce users or increase allocated resources. or check for teh hardware real phisycal limit.

 

by: US-ITPosted on 2008-01-25 at 04:17:48ID: 20741792

I'm thinking it is a kernel issue as well. Last night, I disconnected a mapped drive we had set up from a 2003 sever, and I ran /etc/conf/cf.d/configure to change the streams values. The server has been up for almost 19 hours, with only only 2 fails in class 8, and one in class 7. Are any fails acceptable? Or do I need to get these down to 0? Also, I say I ran the configure command for the streams, but with only reading about it a little I wasn't too comfortable with the changes. They seemed to have helped, but there were a lot of parameters to change. Which ones, or all of them, should I be focused on?

 

by: US-ITPosted on 2008-01-25 at 05:55:35ID: 20742432

Spoke too early. After little over 20 hours:

streams allocation:
                         config    alloc     free       total      max     fail
stream                    15000      332    14668        7527      333        0
queues                     1362      674      688       15066      676        0
mblks                     16996    16785      211    59642597    16939        0
buffer headers            17082    16998       84     3695278    17058   237255
class  1,     64 bytes      342      256       86    25284596      383        0
class  2,    128 bytes      213      192       21     4416542      212        0
class  3,    256 bytes      322      253       69    10898034     1052       26
class  4,    512 bytes       13       11        2       63549       44        4
class  5,   1024 bytes       33        0       33       49609       70        8
class  6,   2048 bytes    14742    14740        2     4360914    14741        1
class  7,   4096 bytes     1000     1000        0       22585     1050      640
class  8,   8192 bytes        0        0        0      191912        9      147
class  9,  16384 bytes        0        0        0      414915        4        2
class 10,  32768 bytes        0        0        0      290699        3        2
class 11,  65536 bytes        0        0        0        1993        3        0
class 12, 131072 bytes        0        0        0           0        0        0
class 13, 262144 bytes        0        0        0           0        0        0
class 14, 524288 bytes        0        0        0           0        0        0
total configured streams memory: 32000.00KB
streams memory in use: 34311.99KB
maximum streams memory used: 35239.42KB

Note: Users began logging in after the 19th hour of uptime. Also note, we have a web application that accesses a shared drive, most likely traffic beginning around the same time.

 

by: US-ITPosted on 2008-01-25 at 06:12:23ID: 20742584

Don't know if this information helps at all.

Client nfs:
calls      badcalls   nclget     nclsleep
34186      0          34223      0          
null       getattr    setattr    root       lookup     readlink   read      
0  0%      2279  6%   10  0%     0  0%      4097 11%   0  0%      17128 50%  
wrcache    write      create     remove     rename     link       symlink    
0  0%      9519 27%   429  1%    39  0%     79  0%     0  0%      0  0%      
mkdir      rmdir      readdir    fsstat    
0  0%      0  0%      452  1%    154  0%    



$ ls
[Lists all files in nfs mount]
$ l
[hangs]

 

by: dfkePosted on 2008-01-25 at 07:30:21ID: 20743229

Ok just maybe there is a problem with the network card or driver. Check to see if there is an updated driver of try to switch cards.

If that doesn't help you can try to sniff the packets in some way and compare the timings with the netstat -m output:

Try a shell script that records `netstat -m` output:

while :; do
date
netstat -m
sleep 1      #
done > netstat-m.log

Meanwhile, put a packet sniffer on the LAN, tell it to capture
everything being sent to servers IP address.  Try to make sure the
sniffer and server agree closely about the time (within a second or
better).  Then run the sniff for long enough to observe the buffers
rising significantly, according to the `netstat -m` log.

You should be able to identify specific times when buffers were
consumed.  Look at the corresponding times in the sniffer log: is there
a particular kind of incoming packet that seems to be causing this?

 

by: US-ITPosted on 2008-01-25 at 13:06:29ID: 20746340

I believe the NIC is onboard. This may be  a dumb question, but, what would be the best brand/model NIC to use to try out (DELL Power Edge 2500)? My knowledge is much more suited to Linux, so while I know some things, I'm almost a newcomer to SCO/Unix.

I will work on getting the packet sniffer going.

Thanks for your help.

 

by: Smart_ManPosted on 2008-01-25 at 13:31:46ID: 20746539

what about having 2 nics ? plus teh built-in . all on teh same networking prviding the same service ?

packet sniffer is a good idea. but you may try a bandwidth manager and it is a better idea , so you can both monitor and control overshots

 

by: mikelfritzPosted on 2008-01-28 at 21:43:31ID: 20765674

You can get a free trial of SarCheck that will ID all of the kernel tunables to adjust - you may need to get to near crash status to have it give the desired result.

Sarcheck:

http://www.sarcheck.com/scosr5.htm

go back to the home page and you can find the free trial.

 

by: mikelfritzPosted on 2008-01-28 at 22:01:41ID: 20765725

Also - make sure you are not running out of space on the /, /var (if it's there), /usr (if it's there) filesystems.

Look at:
http://docsrv.sco.com:507/en/PERFORM/kernel_configure.html

In particular:  
STRMSGSZ
   

Although, if the problem is new, and no configuration changes were made before the problem cropped up, I'd suspect either a network issue or a Chatty Cathy client inundating the server with packets.  

 

by: yotechPosted on 2008-02-06 at 17:41:14ID: 20837395

Check the way your connections to samba shares are being made, It's better to map the share on client
computers than have them reconnect every time then need access. The
netstat -an | grep 139
should show you current connections to samba shares. If you run the command every few seconds and the remote ports keeps changing you will  eventually run out of stream resources the way it's happening now. Some network card drivers may also be causing a memory leak, but mapping shares should take care of your problem or aliviate it to a more managable level.

20120131-EE-VQP-002

3 Ways to Join

30-Day Free Trial

The Experts

98% positive feedback on 31,087 answers since March 2000. angeliii is a Microsoft Most Valuable Professional for his work with MS SQL Server & Develoment.

He has also proven his knowledge of Visual Basic Programming, PHP Scripting and Oracle Databases.

The Experts

97% positive feedback on 10,752 answers since July 2000. lrmoore has more than 18 years experience in the networking industry.

The six-time Mircosoft MVPs specialties include firewalls, virtual private networking, and network management.

Testimonials

"...and excellent source for support... Kind of like having your very own IT dept." Electriciansnet

Testimonials

"I was apprehensive at signing up at first. However... it has already made my life as an IT administrator much easier." JaCrews

Testimonials

"WOW! You guys have great, active, and knowledgeable people on here." moore50

Business Clients

Business Clients

In the Press

"If you’ve got a question... Experts Exchange can supply an answer.”

In the Press

"...an invaluable aid for both IT professionals and those who require tech support."

In the Press

"where IT professionals provide quick answers on just about any topic"

Business Account Plans

Loading Advertisement...