Learn how to a build a cloud-first strategyRegister Now

x
  • Status: Solved
  • Priority: Medium
  • Security: Public
  • Views: 1177
  • Last Modified:

The device, \Device\Scsi\aac1, did not respond within the timeout period.

Hi Everyone,

One of my servers has crashed twice for pas two days, same time. In the logs it shows "The device, \Device\Scsi\aac1, did not respond within the timeout period"
The server is a HP Proliant Server and has Server 2003 with SP2 installed.
I ran the Proliant Integrated Management Log viewer and showed a Blue Screen Trap. The actual descriptions are:
Blue Screen Trap(BugCheck, STOP:0X00000077 (0x00000185, 0X00000185, 0X00000001, 0X0138E000))
Blue Screen Trap(BugCheck, STOP:0X0000007A ( (0xC02E686C, 0X00000185, 0XB9A1B8D7, 0X46F95860))

These are 2 separate errors, 1 each for the past 2 days.

Any help would be appreciated.

Kind Regards,
tiki1017
0
tiki1017
Asked:
tiki1017
  • 3
  • 2
1 Solution
 
DavidCommented:
This indicates timeouts due to deep recovery of disk drive. By any chance are these your basic desktop class drives ... or server / enterprise disks.

If desktop drives then you have a bunch of unrecovered or partially recoverable read errors, and that means that disks are probably due to be retired.

Look at RAID diagnostic logs for errors.
0
 
tiki1017Author Commented:
Hi dlethe,

Thanks for the prompt response. This is a server disk (SCSI) with 2 partitions (System and Data). Ill have alook at the Raid Logs and check what I can find.

Do you reckon it could also be a driver issue.
0
 
tiki1017Author Commented:
Hi dlethe,

On second thought the one I posted was the report given to me by the HP utility?
Is that the RAID diagnostic log? If not how to I run and generate the logs.
0
 
DavidCommented:
The HP utility knows what the physical disks are doing ... but be prepared for a drive failure and bad blocks, bad blocks on several drives, .. that sort of thing.  Unreadable data or lots of retries causing timeouts.  

Are these disks the HP drives, or some cheap desktop drives?  If they are the desktop drives the firmware is not correct (due to TLER timing), so now you are losing data.  

So official answer, is if these are desktop drives, replace all of them if you value your data and want to cure this.  They must have the proper firmware and be the correct type of disk for that controller.

If they are enterprise class disks, then this indicates (logs would confirm) at least one drive should be replaced.

A temporary treatment (AFTER YOU TAKE A FULL BACKUP) is to do the windows scan disk with the option to repair bad blocks.  This will force full read and make the raid controller repair any unreadable blocks. if it can't in time, then windows will probably let you know.  But this does have risk of causing a marginal disk to crash ... hence need to backup first.
0
 
tiki1017Author Commented:
Thanks dlethe! Information was very helpful
0

Featured Post

Upgrade your Question Security!

Add Premium security features to your question to ensure its privacy or anonymity. Learn more about your ability to control Question Security today.

  • 3
  • 2
Tackle projects and never again get stuck behind a technical roadblock.
Join Now