?
Solved

SCSI Bus Reset - Disk or SCSI adapter issue

Posted on 2013-12-20
7
Medium Priority
?
663 Views
Last Modified: 2014-02-11
One of our Solaris servers stopped responding this morning.  The application that runs on the server is vendor supported.  The vendor found no errors reported by iostat or prtdiag.  No amber light was showing on the server either.  However, two disks were disconnected.  They shutdown and turned the server backup on and were able to boot up.  

The only errors reported on the server are the one found in the /var/adm/messages which I have attached a copy of.

Although I understand that multiple hard disks may fail, I am not sure the issue are the HDs, SCSI controller or motherboard.  I was hoping someone could tell by perhaps looking at the attached log.  Please let me know if there are other commands that can be run that might give you a better idea of the problem.
messages.0.txt
0
Comment
Question by:cartereverett
[X]
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
7 Comments
 
LVL 47

Expert Comment

by:David
ID: 39732518
Check cabling & termination. There are no other entries in the log that reveal other issues.
Now you can spend some money and buy some diagnostic software that will get to the bottom of things, but it probably isn't worth the money.
0
 
LVL 17

Expert Comment

by:Gerald Connolly
ID: 39733958
As David said check the cabling and termination.
NB. SCSI is a bus and requires termination at both ends of the bus.
No termination or multiple terminations per end will cause problems
0
 
LVL 62

Expert Comment

by:gheist
ID: 39734342
Would be nice if you provide reasonable system information e.g. at least if disks are builtin and if your server is a pc or sparc....

ASC 02 -> no seek complete... i.e scsi device did not do anything on command...
Given failing command is "write" you most likely lose 4KB every couple of minutes...

Fos system info send in prtconf (-v)

What do you mean by "vendor" - was it oracle saying continuous disk errors involving data loss is ok for them to leave?
0
The Ideal Solution for Multi-Display Applications

Check out ATEN’s VS1912 12-Port DP Video Wall Media Player at InfoComm 2017. Kerri describes how easy it is to design creative video walls in asymmetric layouts and schedule detailed playlists ahead of time with its advanced scheduling feature.

 
LVL 47

Expert Comment

by:David
ID: 39734372
Gheist - You are misreading this.
it is ASC=29h, ASCQ=02h, not ASC=02h.  This is defined as a SCSI bus reset per the ANSI spec.

A no seek complete would be ASC=02h, ASCQ=06h  (Which can't happen on a WRITE10 CDB anyway).
P.S. I write SCSI diagnostic code professionally.
0
 
LVL 16

Accepted Solution

by:
Joseph Gan earned 1500 total points
ID: 39735192
The system had lots of errors:

Dec 18 02:38:42 VRCdata.braishfield.local scsi: [ID 107833 kern.notice]       Requested Block: 114103696                 Error Block: 114103696
Dec 18 02:38:42 VRCdata.braishfield.local scsi: [ID 107833 kern.notice]       Vendor: FUJITSU                            Serial Number: 0745B0PAJU  

I asume this was a Fujitsu internal disk or disks, which has OS installed on it.

If you could show output of "iostat -En" here?
0
 

Author Comment

by:cartereverett
ID: 39735986
The issue was one of the hard disks in the data mirror.  Replaced the drive, resynced and everything is back to normal.
0
 
LVL 16

Expert Comment

by:Joseph Gan
ID: 39736986
Yes, that's it!
0

Featured Post

Supports up to 4K resolution!

The VS192 2-Port 4K DisplayPort Splitter is perfect for anyone who needs to send one source of DisplayPort high definition video to two or four DisplayPort displays. The VS192 can split and also expand DisplayPort audio/video signal on two or four DisplayPort monitors.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Attention: This article will no longer be maintained. If you have any questions, please feel free to mail me. jgh@FreeBSD.org Please see http://www.freebsd.org/doc/en_US.ISO8859-1/articles/freebsd-update-server/ for the updated article. It is avail…
Java performance on Solaris - Managing CPUs There are various resource controls in operating system which directly/indirectly influence the performance of application. one of the most important resource controls is "CPU".   In a multithreaded…
In a previous video, we went over how to export a DynamoDB table into Amazon S3.  In this video, we show how to load the export from S3 into a DynamoDB table.
In this video, Percona Director of Solution Engineering Jon Tobin discusses the function and features of Percona Server for MongoDB. How Percona can help Percona can help you determine if Percona Server for MongoDB is the right solution for …
Suggested Courses
Course of the Month13 days, 1 hour left to enroll

777 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question