Avatar of tomwe
tomwe
 asked on

SCO UNIX too many queued interupts

I have a SCO UNIX Open Server 5.0.7 system that had been running fine for years
however it started panicking with the error too many queued CPU interrupts.
after the 1st time it waited a couple months until it did it again, as time went on it got more frequent. now it is doing it a couple times a week.
I found a knowledge base article on the SCO web site that addressed it and said to adjust MAXACPUS, I have cranked that up to their max and still no good.
I am not an expert on SCO, just know enough to be dangerous. to the best of my knowledge the system is fully patched.
the system is an HP ML570 g4, 4 dual core CPU,s, 4gig memory, p600 controller card.
don't know now if I am looking at a software or hardware issue. if hardware what piece?

any ideas anyone?

thanks in advance
Tom
Unix OS

Avatar of undefined
Last Comment
tomwe

8/22/2022 - Mon
David Favor

Wow... SCO... I feel like I'm in the Wayback Machine...

This situation rarely occurs anymore, because hardware runs so fast.

For SCO, likely you're running very old hardware too.

My guess is you have some sort of failing device, like a disk or disk controller.

With failing disks, interrupts can get thrown in excess.

You might try simply replacing your disk.

Tip: You can test this theory by installing SCO on a fresh disk + installing the fresh installed disk. If problem persists, then likely the disk controller board will have to be replaced.

Tip: Rather than guessing, refer to your logs + look for hardware failures or odd messages.

SCO systems are rare these days, so you'll have to dig around to find the correct log file locations.

Likely /var/log/* will be your starting point.
tomwe

ASKER
David

thanks for the feedback.
yes I know the hardware is a dozen years old but I need to keep it going for another year or so.
trying a new hard drive is not that simple as I am running a disk array that has 18 drives, and I don't have the drives, also if the system isn't running 24/7 I am notified as soon as it goes down to get it back ASAP.
anyway thanks for the advice and I will try to see what I can do about swapping some things

Tom
dfke

Hi,

Without knowing the application that you're running it is a bit tricky to give advise as it might cripple the system even more.

To get a better understanding about how SCO handles interrupts, read up here.

Cheers
Experts Exchange is like having an extremely knowledgeable team sitting and waiting for your call. Couldn't do my job half as well as I do without it!
James Murphy
David Favor

Likely /var/log/* will provide you with only clues, as SCO... is, well... SCO...

So old + odd. Challenging to guess at what might be occurring.

Tip: You mentioned your running a disk array. This likely means you have some sort of custom i/o card in this machine, so best to setup a parallel RAID system today with same or greater space, then start nightly clones of your data...

Because if the problem is an i/o card about to die... you may be hard pressed to find a replacement card...

14TB drives can be had for <$500 now sometimes, so easy to replace an 18x disk array with smaller number of disks.
ASKER CERTIFIED SOLUTION
tomwe

THIS SOLUTION ONLY AVAILABLE TO MEMBERS.
View this solution by signing up for a free trial.
Members can start a 7-Day free trial and enjoy unlimited access to the platform.
See Pricing Options
Start Free Trial
GET A PERSONALIZED SOLUTION
Ask your own question & get feedback from real experts
Find out why thousands trust the EE community with their toughest problems.