We recently built a dual AMD Opteron Windows 2003 Standard server for a customer. After a period of bringing it into their network environment and loading applications, it has developed a bad habit. After some hours of operation ( 4 to 100), we see the second CPU at 100% utilization, and the first CPU running at 10-25%. Everything is very sluggish. It takes a long time to log in, for screens to refresh, for it to provide the applications, and serve the database. But it does muddle through. We can down it gracefully, reboot, and then it comes back as quick and strong as it should be.
So I am running task manager to see these CPU utilization conditions. If I look in at the Processes tab and sort by CPU, it shows the System Idle Process at 95-99%. And there are no processes that appear to be soaking up CPU horsepower. Yet on the Performance tab, the second CPU is pegged to the top, with occasional drops to 98%.
The server is Windows 2003 Standard, SP1, fully patched and security patched. It is a server for 8 workstations, is the Active Domain controller, the CA eTrust Anti-Virus server, 2Point FaxServe (formerly AccPac FaxServe) server, is running an instance of MSDN SQL engine for an application called DocStar (a document imaging/storage/retrieval application). It has a Novell client loaded, and GroupWise. The event logs look clean. There are no obvious signs of hardware failure.
Hardware is a GigaByte 7A8DRH motherboard with two Opteron 244 CPUs, 2gb RAM, Adaptec 2020ZCR card, SCSI RAID-5 array
What can I do to figure out what is saturating the second CPU?