Solved

Hang when invoking the autochanger device

Posted on 2008-06-11
9
1,120 Views
Last Modified: 2013-12-21
Refer to the question "http://www.experts-exchange.com/OS/Unix/Solaris/Q_23413012.html#a21749881". I have a SUN Storedge L8 connecting to a Solaris 9 with Solstice Backup 6.1 Build 186 installed.

The networker hang up whenever I try to invoke the tape changer (using inventory, label, mount or unmount).

In the judebox menu I see the jukebox installed and the control port is "scsidev@2.0.0" and the device is "/dev/rmt/0cbn".

The device should be working in the past coz I see some test backup script is there. What should I do now ?

BTW, inquire shows:-
/etc/LGTOuscsi/inquire
scsidev@0.6.0:TOSHIBA DVD-ROM SD-M14011009|CD-ROM
scsidev@2.0.0:ADIC    FastStor 2      A12r|Autochanger (Jukebox)
scsidev@2.5.0:HP      Ultrium 1-SCSI  E3  |Tape
scsidev@2.21.0:HP      Ultrium 1-SCSI  E3  |Tape

where cfgadm -alv shows:-
Ap_Id                          Receptacle   Occupant     Condition  Information
When         Type         Busy     Phys_Id
c0                             connected    configured   unknown
unavailable  scsi-bus     n        /devices/pci@8,700000/scsi@6:scsi
c0::/dev/lus                   connected    configured   unknown    TOSHIBA DVD-ROM SD-M1401
unavailable  CD-ROM       n        /devices/pci@8,700000/scsi@6:scsi::/dev/lus
c0::dsk/c0t6d0                 connected    configured   unknown    TOSHIBA DVD-ROM SD-M1401
unavailable  CD-ROM       y        /devices/pci@8,700000/scsi@6:scsi::dsk/c0t6d0
c1                             connected    configured   unknown
unavailable  fc-private   n        /devices/pci@8,600000/SUNW,qlc@4/fp@0,0:fc
c1::21000004cfe34092           connected    configured   unknown    SEAGATE  ST373307FSUN72G
unavailable  disk         n        /devices/pci@8,600000/SUNW,qlc@4/fp@0,0:fc::21000004cfe34092
c1::21000004cfe34350           connected    configured   unknown    SEAGATE  ST373307FSUN72G
unavailable  disk         n        /devices/pci@8,600000/SUNW,qlc@4/fp@0,0:fc::21000004cfe34350
c2                             connected    configured   unknown
unavailable  scsi-bus     n        /devices/pci@8,700000/scsi@6,1:scsi
c2::lus1                       connected    configured   unknown
unavailable  unknown      n        /devices/pci@8,700000/scsi@6,1:scsi::lus1
c3                             connected    configured   unknown
unavailable  scsi-bus     n        /devices/pci@8,600000/pci@1/scsi@4:scsi
c3::lus2                       connected    configured   unknown    HP Ultrium 1-SCSI
unavailable  unknown      n        /devices/pci@8,600000/pci@1/scsi@4:scsi::lus2
c3::rmt/0                      connected    configured   unknown    HP Ultrium 1-SCSI
unavailable  tape         n        /devices/pci@8,600000/pci@1/scsi@4:scsi::rmt/0
c4                             connected    configured   unknown
unavailable  scsi-bus     n        /devices/pci@8,600000/pci@1/scsi@5:scsi
c4::lus3                       connected    configured   unknown
unavailable  unknown      n        /devices/pci@8,600000/pci@1/scsi@5:scsi::lus3
usb0/1                         connected    configured   ok         Mfg: <undef>  Product: <undef>  NConfigs: 1  Config: 0  <no cfg str descr>
unavailable  usb-kbd      n        /devices/pci@8,700000/usb@5,3:1
usb0/2                         empty        unconfigured ok
unavailable  unknown      n        /devices/pci@8,700000/usb@5,3:2
usb0/3                         connected    configured   ok         Mfg: <undef>  Product: <undef>  NConfigs: 1  Config: 0  <no cfg str descr>
unavailable  usb-mouse    n        /devices/pci@8,700000/usb@5,3:3
usb0/4                         empty        unconfigured ok
unavailable  unknown      n        /devices/pci@8,700000/usb@5,3:4

0
Comment
Question by:frankcheong
  • 6
  • 2
9 Comments
 
LVL 16

Expert Comment

by:Hanno Schröder
Comment Utility
what does
  nsrjb
show? Any output?

Have you tried resetting your JB first, using
  nsrjb -HHE
0
 
LVL 4

Author Comment

by:frankcheong
Comment Utility
nsrjb shows

Jukebox L8:
  slot  volume                       pool            volume id        recyclable
     1: dwbak.inc.day0.wk0.001*                      2481680129
     2: dwbak.inc.day0.wk0.002       DW Incremental  2481704705       no
     3: dwbak.inc.day0.wk0.003       DW Incremental  2481729793       no
     4: -*                                           -
     5: dwbak.inc.day0.wk0.005       DW Incremental  2481779201       no
     6: dwbak.inc.day0.wk0.006       DW Incremental  2481804289       no
     7: dwbak.inc.day0.wk0.007       DW Incremental  2481829889       no
     8: dwbak.inc.day0.wk0.008       DW Incremental  2481854209       no
        *not registered in the NetWorker media data base

drive 1 (/dev/rmt/0cbn) slot  5: dwbak.inc.day0.wk0.005

nsrjb -HE hang  indefinitly, so does nsrjb -HHE

devfsadm -c tape also hang while simply devfsadm can run without problem.

cfgadm -alv c3 provide the following:-

Ap_Id                          Receptacle   Occupant     Condition  Information
When         Type         Busy     Phys_Id
c3                             connected    configured   unknown
unavailable  scsi-bus     n        /devices/pci@8,600000/pci@1/scsi@4:scsi
c3::lus2                       connected    configured   unknown    HP Ultrium 1-SCSI
unavailable  unknown      n        /devices/pci@8,600000/pci@1/scsi@4:scsi::lus2
c3::rmt/0                      connected    configured   unknown    HP Ultrium 1-SCSI
unavailable  tape         n        /devices/pci@8,600000/pci@1/scsi@4:scsi::rmt/0


/etc/LGTOuscsi/inquire did provide the same output
scsidev@0.6.0:TOSHIBA DVD-ROM SD-M14011009|CD-ROM
scsidev@2.0.0:ADIC    FastStor 2      A12r|Autochanger (Jukebox)
scsidev@2.5.0:HP      Ultrium 1-SCSI  E3  |Tape
scsidev@2.21.0:HP      Ultrium 1-SCSI  E3  |Tape

what can I do now ?
0
 
LVL 16

Expert Comment

by:Hanno Schröder
Comment Utility
a) does the result from "nsrjb" match reality?
    is this what you really have in your box?
b) some config in your system got screwed up :-(
    As you only have one tape drive, inquire should
    only find one (!)
    Have the cables on controller c3 checked (unconnect and
    reconnect, check for bend pins etc.). After this had been
    done, power off the jukebox (and tape drive, if with
    seperate power) wait some time and power back on.
    Wait for the Jukebox to finish it's self-test.
    Try another
        devfsadm -c tape -v -C
    Now, run "nsrjb" to see what it displays. Is this the same
    as your real (physical) situation?
c) You may want to run
         nsrjb -EH
    now.
0
 
LVL 4

Author Comment

by:frankcheong
Comment Utility
nsjrb should be past history, coz I am doing tar and manually changing the slot for quite some time already.

now I unconnect the cable and run devfsadm and then cfgadm -alv shows

cfgadm -alv
Ap_Id                          Receptacle   Occupant     Condition  Information
When         Type         Busy     Phys_Id
c0                             connected    configured   unknown
unavailable  scsi-bus     n        /devices/pci@8,700000/scsi@6:scsi
c0::/dev/lus                   connected    configured   unknown    TOSHIBA DVD-ROM SD-M1401
unavailable  CD-ROM       n        /devices/pci@8,700000/scsi@6:scsi::/dev/lus
c0::dsk/c0t6d0                 connected    configured   unknown    TOSHIBA DVD-ROM SD-M1401
unavailable  CD-ROM       y        /devices/pci@8,700000/scsi@6:scsi::dsk/c0t6d0
c1                             connected    configured   unknown
unavailable  fc-private   n        /devices/pci@8,600000/SUNW,qlc@4/fp@0,0:fc
c1::21000004cfe34092           connected    configured   unknown    SEAGATE  ST373307FSUN72G
unavailable  disk         n        /devices/pci@8,600000/SUNW,qlc@4/fp@0,0:fc::21000004cfe34092
c1::21000004cfe34350           connected    configured   unknown    SEAGATE  ST373307FSUN72G
unavailable  disk         n        /devices/pci@8,600000/SUNW,qlc@4/fp@0,0:fc::21000004cfe34350
c2                             connected    configured   unknown
unavailable  scsi-bus     n        /devices/pci@8,700000/scsi@6,1:scsi
c2::lus1                       connected    configured   unknown
unavailable  unknown      n        /devices/pci@8,700000/scsi@6,1:scsi::lus1
c3                             connected    configured   unknown
unavailable  scsi-bus     n        /devices/pci@8,600000/pci@1/scsi@4:scsi
c3::lus2                       connected    configured   unknown    HP Ultrium 1-SCSI
unavailable  unknown      n        /devices/pci@8,600000/pci@1/scsi@4:scsi::lus2
c3::rmt/0                      connected    configured   unknown    HP Ultrium 1-SCSI
unavailable  tape         n        /devices/pci@8,600000/pci@1/scsi@4:scsi::rmt/0
c4                             connected    configured   unknown
unavailable  scsi-bus     n        /devices/pci@8,600000/pci@1/scsi@5:scsi
c4::lus3                       connected    configured   unknown
unavailable  unknown      n        /devices/pci@8,600000/pci@1/scsi@5:scsi::lus3
usb0/1                         connected    configured   ok         Mfg: <undef>  Product: <undef>  NConfigs: 1  Config: 0  <no cfg str descr>
unavailable  usb-kbd      n        /devices/pci@8,700000/usb@5,3:1
usb0/2                         empty        unconfigured ok
unavailable  unknown      n        /devices/pci@8,700000/usb@5,3:2
usb0/3                         connected    configured   ok         Mfg: <undef>  Product: <undef>  NConfigs: 1  Config: 0  <no cfg str descr>
unavailable  usb-mouse    n        /devices/pci@8,700000/usb@5,3:3
usb0/4                         empty        unconfigured ok
unavailable  unknown      n        /devices/pci@8,700000/usb@5,3:4

and inquire still shows
scsidev@0.6.0:TOSHIBA DVD-ROM SD-M14011009|CD-ROM
scsidev@2.0.0:ADIC    FastStor 2      A12r|Autochanger (Jukebox)
scsidev@2.5.0:HP      Ultrium 1-SCSI  E3  |Tape
scsidev@2.21.0:HP      Ultrium 1-SCSI  E3  |Tape

and then I reconnect the cable I saw a lot of error message in dmesg:-

Jun 19 13:47:37 server scsi: [ID 107833 kern.warning] WARNING: /pci@8,600000/pci@1/scsi@4 (qus0):
Jun 19 13:47:37 server        Received unexpected SCSI Reset
Jun 19 13:47:37 server scsi: [ID 107833 kern.warning] WARNING: /pci@8,600000/pci@1/scsi@4 (qus0):
Jun 19 13:47:37 server        Received unexpected SCSI Reset
Jun 19 13:51:30 server scsi: [ID 107833 kern.notice] /pci@8,600000/pci@1/scsi@4 (qus0):
Jun 19 13:51:30 server        Failed to Get Features
Jun 19 13:51:50 server scsi: [ID 107833 kern.warning] WARNING: /pci@8,600000/pci@1/scsi@4 (qus0):
Jun 19 13:51:50 server        ISP: Firmware cmd timeout
Jun 19 13:51:50 server scsi: [ID 107833 kern.warning] WARNING: /pci@8,600000/pci@1/scsi@4 (qus0):
Jun 19 13:51:50 server        Fatal error, resetting interface, flg 1

and devfsadm hang. So I double check the past demsg and found a lot of similar SCSI error message as below:-

Jun 19 13:43:12 server        probe request for LUN 0 denied: max LUN -1
Jun 19 13:43:12 server scsi: [ID 107833 kern.warning] WARNING: /pci@8,600000/pci@1/scsi@5 (qus1):

What should I do now ?
0
Scale it in WD Gold

With up to ten times the workload capacity of desktop drives, WD Gold hard drives employ advanced technology to deliver among the best in reliability, capacity, power efficiency and performance.

 
LVL 4

Author Comment

by:frankcheong
Comment Utility
What about ?
1. unconnect the changer
2. turn off the changer
3. remove the jukebox in the solstice backup 6.1
4. remove the device in the solstice backup 6.1
5. reconnect the changer
6. turn on the changer
7. reboo the server
8. wait for all the self test complete
9. create the device back in the solstice backup 6.1
10. create the jukebox back in the solstice backup 6.1

Would that help?
0
 
LVL 4

Author Comment

by:frankcheong
Comment Utility
Or should I file a support request to sun regarding the below kernel message?

Jun 19 13:47:37 server scsi: [ID 107833 kern.warning] WARNING: /pci@8,600000/pci@1/scsi@4 (qus0):
Jun 19 13:47:37 server        Received unexpected SCSI Reset
Jun 19 13:47:37 server scsi: [ID 107833 kern.warning] WARNING: /pci@8,600000/pci@1/scsi@4 (qus0):
Jun 19 13:47:37 server        Received unexpected SCSI Reset
Jun 19 13:51:30 server scsi: [ID 107833 kern.notice] /pci@8,600000/pci@1/scsi@4 (qus0):
Jun 19 13:51:30 server        Failed to Get Features
Jun 19 13:51:50 server scsi: [ID 107833 kern.warning] WARNING: /pci@8,600000/pci@1/scsi@4 (qus0):
Jun 19 13:51:50 server        ISP: Firmware cmd timeout
Jun 19 13:51:50 server scsi: [ID 107833 kern.warning] WARNING: /pci@8,600000/pci@1/scsi@4 (qus0):
Jun 19 13:51:50 server        Fatal error, resetting interface, flg 1
0
 
LVL 4

Accepted Solution

by:
frankcheong earned 0 total points
Comment Utility
Finally I did solved my problem and would list down the steps below and PAQ it so as to help other people.

1. I have deleted the jukebox and device under using nwadmin interface.
2. uninstall the package SUNWsbus2 (driver) and SUNWsbun (storage node)
3. shutdown the machine and switch using another SCSI Port.
4. reinstall the package SUNWsbus2 (driver) and SUNWsbun (storage node)
5. run jbconfig to create the changer

and it works.
while since the device file is /dev/rmt/1* instead of /dev/rmt/0* so I have done the following:-
1. I have deleted /dev/rmt/*
2. uninstall the package SUNWsbus2 (driver) and SUNWsbun (storage node)
3. shutdown the machine and switch using original SCSI Port.
4. reinstall the driver.
5. run jbconfig to add the changer but it failed and complaining "no driver found or problem with my driver installation".

Then I have done the following: -
1. uninstall the package SUNWsbus2 (driver) and SUNWsbun (storage node)
2. shutdown the machine and switch using another SCSI Port.
3. reinstall the package SUNWsbus2 (driver) and SUNWsbun (storage node)
I thought it should works coz I am using back the /dev/rmt/1* while the result is the same, the same error message "no driver found or problem with my driver installation" again during jbconfig.

While a sudden flash comes, I have then do the following.
1. delete /dev/rmt/*
2. run "nsr_shutdown"
3. uninstall the package SUNWsbus2 (driver) and SUNWsbun (storage node)
4. run "devfsadm -i st"
5. reinstall the package SUNWsbus2 (driver) and SUNWsbun (storage node)
6. run jbconfig to create the changer

And it works. I really have no idea on whats going on and why it is working. But it really works.
0
 
LVL 4

Author Comment

by:frankcheong
Comment Utility
suggest to PAQ for other reference.
0

Featured Post

IT, Stop Being Called Into Every Meeting

Highfive is so simple that setting up every meeting room takes just minutes and every employee will be able to start or join a call from any room with ease. Never be called into a meeting just to get it started again. This is how video conferencing should work!

Join & Write a Comment

Every server (virtual or physical) needs a console: and the console can be provided through hardware directly connected, software for remote connections, local connections, through a KVM, etc. This document explains the different types of consol…
AWS Glacier is Amazons cheapest storage option and is their answer to a ‘Cold’ storage service.  Customers primarily use this service for archival purposes and storage of infrastructure backups.  Its unlimited storage potential and low storage cost …
Learn several ways to interact with files and get file information from the bash shell. ls lists the contents of a directory: Using the -a flag displays hidden files: Using the -l flag formats the output in a long list: The file command gives us mor…
This video shows how to set up a shell script to accept a positional parameter when called, pass that to a SQL script, accept the output from the statement back and then manipulate it in the Shell.

744 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question

Need Help in Real-Time?

Connect with top rated Experts

17 Experts available now in Live!

Get 1:1 Help Now