Solved

Hang when invoking the autochanger device

Posted on 2008-06-11
9
1,131 Views
Last Modified: 2013-12-21
Refer to the question "http://www.experts-exchange.com/OS/Unix/Solaris/Q_23413012.html#a21749881". I have a SUN Storedge L8 connecting to a Solaris 9 with Solstice Backup 6.1 Build 186 installed.

The networker hang up whenever I try to invoke the tape changer (using inventory, label, mount or unmount).

In the judebox menu I see the jukebox installed and the control port is "scsidev@2.0.0" and the device is "/dev/rmt/0cbn".

The device should be working in the past coz I see some test backup script is there. What should I do now ?

BTW, inquire shows:-
/etc/LGTOuscsi/inquire
scsidev@0.6.0:TOSHIBA DVD-ROM SD-M14011009|CD-ROM
scsidev@2.0.0:ADIC    FastStor 2      A12r|Autochanger (Jukebox)
scsidev@2.5.0:HP      Ultrium 1-SCSI  E3  |Tape
scsidev@2.21.0:HP      Ultrium 1-SCSI  E3  |Tape

where cfgadm -alv shows:-
Ap_Id                          Receptacle   Occupant     Condition  Information
When         Type         Busy     Phys_Id
c0                             connected    configured   unknown
unavailable  scsi-bus     n        /devices/pci@8,700000/scsi@6:scsi
c0::/dev/lus                   connected    configured   unknown    TOSHIBA DVD-ROM SD-M1401
unavailable  CD-ROM       n        /devices/pci@8,700000/scsi@6:scsi::/dev/lus
c0::dsk/c0t6d0                 connected    configured   unknown    TOSHIBA DVD-ROM SD-M1401
unavailable  CD-ROM       y        /devices/pci@8,700000/scsi@6:scsi::dsk/c0t6d0
c1                             connected    configured   unknown
unavailable  fc-private   n        /devices/pci@8,600000/SUNW,qlc@4/fp@0,0:fc
c1::21000004cfe34092           connected    configured   unknown    SEAGATE  ST373307FSUN72G
unavailable  disk         n        /devices/pci@8,600000/SUNW,qlc@4/fp@0,0:fc::21000004cfe34092
c1::21000004cfe34350           connected    configured   unknown    SEAGATE  ST373307FSUN72G
unavailable  disk         n        /devices/pci@8,600000/SUNW,qlc@4/fp@0,0:fc::21000004cfe34350
c2                             connected    configured   unknown
unavailable  scsi-bus     n        /devices/pci@8,700000/scsi@6,1:scsi
c2::lus1                       connected    configured   unknown
unavailable  unknown      n        /devices/pci@8,700000/scsi@6,1:scsi::lus1
c3                             connected    configured   unknown
unavailable  scsi-bus     n        /devices/pci@8,600000/pci@1/scsi@4:scsi
c3::lus2                       connected    configured   unknown    HP Ultrium 1-SCSI
unavailable  unknown      n        /devices/pci@8,600000/pci@1/scsi@4:scsi::lus2
c3::rmt/0                      connected    configured   unknown    HP Ultrium 1-SCSI
unavailable  tape         n        /devices/pci@8,600000/pci@1/scsi@4:scsi::rmt/0
c4                             connected    configured   unknown
unavailable  scsi-bus     n        /devices/pci@8,600000/pci@1/scsi@5:scsi
c4::lus3                       connected    configured   unknown
unavailable  unknown      n        /devices/pci@8,600000/pci@1/scsi@5:scsi::lus3
usb0/1                         connected    configured   ok         Mfg: <undef>  Product: <undef>  NConfigs: 1  Config: 0  <no cfg str descr>
unavailable  usb-kbd      n        /devices/pci@8,700000/usb@5,3:1
usb0/2                         empty        unconfigured ok
unavailable  unknown      n        /devices/pci@8,700000/usb@5,3:2
usb0/3                         connected    configured   ok         Mfg: <undef>  Product: <undef>  NConfigs: 1  Config: 0  <no cfg str descr>
unavailable  usb-mouse    n        /devices/pci@8,700000/usb@5,3:3
usb0/4                         empty        unconfigured ok
unavailable  unknown      n        /devices/pci@8,700000/usb@5,3:4

0
Comment
Question by:frankcheong
  • 6
  • 2
9 Comments
 
LVL 16

Expert Comment

by:Hanno Schröder
ID: 21772039
what does
  nsrjb
show? Any output?

Have you tried resetting your JB first, using
  nsrjb -HHE
0
 
LVL 4

Author Comment

by:frankcheong
ID: 21775223
nsrjb shows

Jukebox L8:
  slot  volume                       pool            volume id        recyclable
     1: dwbak.inc.day0.wk0.001*                      2481680129
     2: dwbak.inc.day0.wk0.002       DW Incremental  2481704705       no
     3: dwbak.inc.day0.wk0.003       DW Incremental  2481729793       no
     4: -*                                           -
     5: dwbak.inc.day0.wk0.005       DW Incremental  2481779201       no
     6: dwbak.inc.day0.wk0.006       DW Incremental  2481804289       no
     7: dwbak.inc.day0.wk0.007       DW Incremental  2481829889       no
     8: dwbak.inc.day0.wk0.008       DW Incremental  2481854209       no
        *not registered in the NetWorker media data base

drive 1 (/dev/rmt/0cbn) slot  5: dwbak.inc.day0.wk0.005

nsrjb -HE hang  indefinitly, so does nsrjb -HHE

devfsadm -c tape also hang while simply devfsadm can run without problem.

cfgadm -alv c3 provide the following:-

Ap_Id                          Receptacle   Occupant     Condition  Information
When         Type         Busy     Phys_Id
c3                             connected    configured   unknown
unavailable  scsi-bus     n        /devices/pci@8,600000/pci@1/scsi@4:scsi
c3::lus2                       connected    configured   unknown    HP Ultrium 1-SCSI
unavailable  unknown      n        /devices/pci@8,600000/pci@1/scsi@4:scsi::lus2
c3::rmt/0                      connected    configured   unknown    HP Ultrium 1-SCSI
unavailable  tape         n        /devices/pci@8,600000/pci@1/scsi@4:scsi::rmt/0


/etc/LGTOuscsi/inquire did provide the same output
scsidev@0.6.0:TOSHIBA DVD-ROM SD-M14011009|CD-ROM
scsidev@2.0.0:ADIC    FastStor 2      A12r|Autochanger (Jukebox)
scsidev@2.5.0:HP      Ultrium 1-SCSI  E3  |Tape
scsidev@2.21.0:HP      Ultrium 1-SCSI  E3  |Tape

what can I do now ?
0
 
LVL 16

Expert Comment

by:Hanno Schröder
ID: 21777550
a) does the result from "nsrjb" match reality?
    is this what you really have in your box?
b) some config in your system got screwed up :-(
    As you only have one tape drive, inquire should
    only find one (!)
    Have the cables on controller c3 checked (unconnect and
    reconnect, check for bend pins etc.). After this had been
    done, power off the jukebox (and tape drive, if with
    seperate power) wait some time and power back on.
    Wait for the Jukebox to finish it's self-test.
    Try another
        devfsadm -c tape -v -C
    Now, run "nsrjb" to see what it displays. Is this the same
    as your real (physical) situation?
c) You may want to run
         nsrjb -EH
    now.
0
Best Practices: Disaster Recovery Testing

Besides backup, any IT division should have a disaster recovery plan. You will find a few tips below relating to the development of such a plan and to what issues one should pay special attention in the course of backup planning.

 
LVL 4

Author Comment

by:frankcheong
ID: 21819675
nsjrb should be past history, coz I am doing tar and manually changing the slot for quite some time already.

now I unconnect the cable and run devfsadm and then cfgadm -alv shows

cfgadm -alv
Ap_Id                          Receptacle   Occupant     Condition  Information
When         Type         Busy     Phys_Id
c0                             connected    configured   unknown
unavailable  scsi-bus     n        /devices/pci@8,700000/scsi@6:scsi
c0::/dev/lus                   connected    configured   unknown    TOSHIBA DVD-ROM SD-M1401
unavailable  CD-ROM       n        /devices/pci@8,700000/scsi@6:scsi::/dev/lus
c0::dsk/c0t6d0                 connected    configured   unknown    TOSHIBA DVD-ROM SD-M1401
unavailable  CD-ROM       y        /devices/pci@8,700000/scsi@6:scsi::dsk/c0t6d0
c1                             connected    configured   unknown
unavailable  fc-private   n        /devices/pci@8,600000/SUNW,qlc@4/fp@0,0:fc
c1::21000004cfe34092           connected    configured   unknown    SEAGATE  ST373307FSUN72G
unavailable  disk         n        /devices/pci@8,600000/SUNW,qlc@4/fp@0,0:fc::21000004cfe34092
c1::21000004cfe34350           connected    configured   unknown    SEAGATE  ST373307FSUN72G
unavailable  disk         n        /devices/pci@8,600000/SUNW,qlc@4/fp@0,0:fc::21000004cfe34350
c2                             connected    configured   unknown
unavailable  scsi-bus     n        /devices/pci@8,700000/scsi@6,1:scsi
c2::lus1                       connected    configured   unknown
unavailable  unknown      n        /devices/pci@8,700000/scsi@6,1:scsi::lus1
c3                             connected    configured   unknown
unavailable  scsi-bus     n        /devices/pci@8,600000/pci@1/scsi@4:scsi
c3::lus2                       connected    configured   unknown    HP Ultrium 1-SCSI
unavailable  unknown      n        /devices/pci@8,600000/pci@1/scsi@4:scsi::lus2
c3::rmt/0                      connected    configured   unknown    HP Ultrium 1-SCSI
unavailable  tape         n        /devices/pci@8,600000/pci@1/scsi@4:scsi::rmt/0
c4                             connected    configured   unknown
unavailable  scsi-bus     n        /devices/pci@8,600000/pci@1/scsi@5:scsi
c4::lus3                       connected    configured   unknown
unavailable  unknown      n        /devices/pci@8,600000/pci@1/scsi@5:scsi::lus3
usb0/1                         connected    configured   ok         Mfg: <undef>  Product: <undef>  NConfigs: 1  Config: 0  <no cfg str descr>
unavailable  usb-kbd      n        /devices/pci@8,700000/usb@5,3:1
usb0/2                         empty        unconfigured ok
unavailable  unknown      n        /devices/pci@8,700000/usb@5,3:2
usb0/3                         connected    configured   ok         Mfg: <undef>  Product: <undef>  NConfigs: 1  Config: 0  <no cfg str descr>
unavailable  usb-mouse    n        /devices/pci@8,700000/usb@5,3:3
usb0/4                         empty        unconfigured ok
unavailable  unknown      n        /devices/pci@8,700000/usb@5,3:4

and inquire still shows
scsidev@0.6.0:TOSHIBA DVD-ROM SD-M14011009|CD-ROM
scsidev@2.0.0:ADIC    FastStor 2      A12r|Autochanger (Jukebox)
scsidev@2.5.0:HP      Ultrium 1-SCSI  E3  |Tape
scsidev@2.21.0:HP      Ultrium 1-SCSI  E3  |Tape

and then I reconnect the cable I saw a lot of error message in dmesg:-

Jun 19 13:47:37 server scsi: [ID 107833 kern.warning] WARNING: /pci@8,600000/pci@1/scsi@4 (qus0):
Jun 19 13:47:37 server        Received unexpected SCSI Reset
Jun 19 13:47:37 server scsi: [ID 107833 kern.warning] WARNING: /pci@8,600000/pci@1/scsi@4 (qus0):
Jun 19 13:47:37 server        Received unexpected SCSI Reset
Jun 19 13:51:30 server scsi: [ID 107833 kern.notice] /pci@8,600000/pci@1/scsi@4 (qus0):
Jun 19 13:51:30 server        Failed to Get Features
Jun 19 13:51:50 server scsi: [ID 107833 kern.warning] WARNING: /pci@8,600000/pci@1/scsi@4 (qus0):
Jun 19 13:51:50 server        ISP: Firmware cmd timeout
Jun 19 13:51:50 server scsi: [ID 107833 kern.warning] WARNING: /pci@8,600000/pci@1/scsi@4 (qus0):
Jun 19 13:51:50 server        Fatal error, resetting interface, flg 1

and devfsadm hang. So I double check the past demsg and found a lot of similar SCSI error message as below:-

Jun 19 13:43:12 server        probe request for LUN 0 denied: max LUN -1
Jun 19 13:43:12 server scsi: [ID 107833 kern.warning] WARNING: /pci@8,600000/pci@1/scsi@5 (qus1):

What should I do now ?
0
 
LVL 4

Author Comment

by:frankcheong
ID: 22782916
What about ?
1. unconnect the changer
2. turn off the changer
3. remove the jukebox in the solstice backup 6.1
4. remove the device in the solstice backup 6.1
5. reconnect the changer
6. turn on the changer
7. reboo the server
8. wait for all the self test complete
9. create the device back in the solstice backup 6.1
10. create the jukebox back in the solstice backup 6.1

Would that help?
0
 
LVL 4

Author Comment

by:frankcheong
ID: 22782926
Or should I file a support request to sun regarding the below kernel message?

Jun 19 13:47:37 server scsi: [ID 107833 kern.warning] WARNING: /pci@8,600000/pci@1/scsi@4 (qus0):
Jun 19 13:47:37 server        Received unexpected SCSI Reset
Jun 19 13:47:37 server scsi: [ID 107833 kern.warning] WARNING: /pci@8,600000/pci@1/scsi@4 (qus0):
Jun 19 13:47:37 server        Received unexpected SCSI Reset
Jun 19 13:51:30 server scsi: [ID 107833 kern.notice] /pci@8,600000/pci@1/scsi@4 (qus0):
Jun 19 13:51:30 server        Failed to Get Features
Jun 19 13:51:50 server scsi: [ID 107833 kern.warning] WARNING: /pci@8,600000/pci@1/scsi@4 (qus0):
Jun 19 13:51:50 server        ISP: Firmware cmd timeout
Jun 19 13:51:50 server scsi: [ID 107833 kern.warning] WARNING: /pci@8,600000/pci@1/scsi@4 (qus0):
Jun 19 13:51:50 server        Fatal error, resetting interface, flg 1
0
 
LVL 4

Accepted Solution

by:
frankcheong earned 0 total points
ID: 23302687
Finally I did solved my problem and would list down the steps below and PAQ it so as to help other people.

1. I have deleted the jukebox and device under using nwadmin interface.
2. uninstall the package SUNWsbus2 (driver) and SUNWsbun (storage node)
3. shutdown the machine and switch using another SCSI Port.
4. reinstall the package SUNWsbus2 (driver) and SUNWsbun (storage node)
5. run jbconfig to create the changer

and it works.
while since the device file is /dev/rmt/1* instead of /dev/rmt/0* so I have done the following:-
1. I have deleted /dev/rmt/*
2. uninstall the package SUNWsbus2 (driver) and SUNWsbun (storage node)
3. shutdown the machine and switch using original SCSI Port.
4. reinstall the driver.
5. run jbconfig to add the changer but it failed and complaining "no driver found or problem with my driver installation".

Then I have done the following: -
1. uninstall the package SUNWsbus2 (driver) and SUNWsbun (storage node)
2. shutdown the machine and switch using another SCSI Port.
3. reinstall the package SUNWsbus2 (driver) and SUNWsbun (storage node)
I thought it should works coz I am using back the /dev/rmt/1* while the result is the same, the same error message "no driver found or problem with my driver installation" again during jbconfig.

While a sudden flash comes, I have then do the following.
1. delete /dev/rmt/*
2. run "nsr_shutdown"
3. uninstall the package SUNWsbus2 (driver) and SUNWsbun (storage node)
4. run "devfsadm -i st"
5. reinstall the package SUNWsbus2 (driver) and SUNWsbun (storage node)
6. run jbconfig to create the changer

And it works. I really have no idea on whats going on and why it is working. But it really works.
0
 
LVL 4

Author Comment

by:frankcheong
ID: 23302690
suggest to PAQ for other reference.
0

Featured Post

Comprehensive Backup Solutions for Microsoft

Acronis protects the complete Microsoft technology stack: Windows Server, Windows PC, laptop and Surface data; Microsoft business applications; Microsoft Hyper-V; Azure VMs; Microsoft Windows Server 2016; Microsoft Exchange 2016 and SQL Server 2016.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

FreeBSD on EC2 FreeBSD (https://www.freebsd.org) is a robust Unix-like operating system that has been around for many years. FreeBSD is available on Amazon EC2 through Amazon Machine Images (AMIs) provided by FreeBSD developer and security office…
Why Shell Scripting? Shell scripting is a powerful method of accessing UNIX systems and it is very flexible. Shell scripts are required when we want to execute a sequence of commands in Unix flavored operating systems. “Shell” is the command line i…
Learn how to find files with the shell using the find and locate commands. Use locate to find a needle in a haystack.: With locate, check if the file still exists.: Use find to get the actual location of the file.:
This Micro Tutorial will teach you how to reformat your flash drive. Sometimes your flash drive may have issues carrying files so this will completely restore it to manufacturing settings. Make sure to backup all files before reformatting. This w…

770 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question