Link to home
Start Free TrialLog in
Avatar of apunkabollywood
apunkabollywoodFlag for United States of America

asked on

qla2xxx scsi errors in the logs

Dec  8 11:56:36  kernel: st: Version 20070203, fixed bufsize 32768, s/g segs 256
Dec  8 11:56:36  kernel: st 0:0:10:0: Attached scsi tape st7
Dec  8 11:56:36  kernel: st7: try direct i/o: yes (alignment 512 B)
Dec  8 15:46:30  kernel: qla2xxx 0000:06:00.0: scsi(0:10:0): Abort command issued -- 1 10fb 2002.
Dec  8 15:46:41  kernel: qla2xxx 0000:06:00.0: scsi(0:10:0): Abort command issued -- 1 10fb 2002.
Dec  8 15:46:41  kernel: qla2xxx 0000:06:00.0: scsi(0:10:0): DEVICE RESET ISSUED.
Dec  8 15:46:41  kernel: qla2xxx 0000:06:00.0: scsi(0:10:0): DEVICE RESET SUCCEEDED.
Dec  8 15:46:52 rek-oradb-p04 kernel: qla2xxx 0000:06:00.0: scsi(0:10:0): Abort command issued -- 1 10fb 2002.
Dec  8 15:46:52  kernel: qla2xxx 0000:06:00.0: scsi(0:10:0): LOOP RESET ISSUED.
Dec  8 15:46:52  kernel: qla2xxx 0000:06:00.0: LOOP DOWN detected (0 0 0).
Dec  8 15:46:57  kernel: qla2xxx 0000:06:00.0: LOOP UP detected (8 Gbps).
Dec  8 15:47:18  kernel: qla2xxx 0000:06:00.0: qla2xxx_eh_bus_reset: reset succeeded
Dec  8 15:47:39  kernel: qla2xxx 0000:06:00.0: scsi(0:10:0): Abort command issued -- 1 10fb 2002.
Dec  8 15:47:39  kernel: qla2xxx 0000:06:00.0: scsi(0:10:0): ADAPTER RESET ISSUED.
Dec  8 15:47:39  kernel: qla2xxx 0000:06:00.0: Performing ISP error recovery - ha= ffff810139dd44f8.
Dec  8 15:47:45  kernel: qla2xxx 0000:06:00.0: LOOP UP detected (8 Gbps).
Dec  8 15:47:47  kernel: qla2xxx 0000:06:00.0: qla2xxx_eh_host_reset: reset succeeded
Dec  8 15:48:08  kernel: qla2xxx 0000:06:00.0: scsi(0:10:0): Abort command issued -- 1 10fb 2002.
Dec  8 15:48:08  kernel: st 0:0:10:0: scsi: Device offlined - not ready after error recovery
Dec  8 15:48:08  kernel: st 0:0:10:0: timing out command, waited 14000s
Dec  8 15:48:08  kernel: st7: Error 6080000 (sugg. bt 0x0, driver bt 0x6, host bt 0x8).
Avatar of apunkabollywood
apunkabollywood
Flag of United States of America image

ASKER

please help with the path forward thanks
Avatar of David
The HDD at scsi(0:10:0) is spinning down and generating LIPs.   Replace it.
If you reboot the machine then you'll be able to see the cross referencing of 0:10:0 to phys drive make/model and likely serial number.
Check with SAN End something is wrong either at SVC level or at switch level.
You'll note the errors start with the SCSI target device. the switch is not initiating the fault.
Please also consider the scsi_timout configured in system
The scsi_timeout threshold are a function of the device driver.  It is so overwhelmingly high that the only way one can get a timeout is if the SCSI target itself failed to respond to I/O requests.  This all indicates SCSI target device problems.

Besides, if it was a generic timeout issue, then all targets would exceed the threshold, not just one specific target, as shown by the logs.
ASKER CERTIFIED SOLUTION
Avatar of Member_2_231077
Member_2_231077

Link to home
membership
This solution is only available to members.
To access this solution, you must be a member of Experts Exchange.
Start Free Trial