Link to home
Start Free TrialLog in
Avatar of virgo0880
virgo0880

asked on

Tape read errors

I am getting lot of read errors for a tape in the library due to which some tape drives throwing h/w errors. How can I move the data from this tape to other tape and remove this tape from library, so that it should not harm the tape drives, also whether the move data can be done when other backps sessions are running, let me know the commands for the same. Here is the output of that volume :


Volume Name: 005171
             Storage Pool Name: TBACK6
             Device Class Name: TDEV3592
            Estimated Capacity: 488,211.9
       Scaled Capacity Applied: 100
                      Pct Util: 100.0
                 Volume Status: Full
                        Access: Read-Only
        Pct. Reclaimable Space: 0.0
               Scratch Volume?: Yes
               In Error State?: No
      Number of Writable Sides: 1
       Number of Times Mounted: 23
             Write Pass Number: 1
     Approx. Date Last Written: 06/20/11 01:54:08
        Approx. Date Last Read: 06/27/11 09:26:29
           Date Became Pending:
        Number of Write Errors: 0
         Number of Read Errors: 7
               Volume Location:
Volume is MVS Lanfree Capable : No
Last Update by (administrator):
         Last Update Date/Time: 06/09/11 21:57:47
          Begin Reclaim Period:
            End Reclaim Period:
  Drive Encryption Key Manager: None

Thanks
Pankaj
Avatar of balasundaram_s
balasundaram_s

Looks like bad media !!

   Pct Util: 100.0
                 Volume Status: Full
                        Access: Read-Only
        Pct. Reclaimable Space: 0.0
               Scratch Volume?: Yes
Avatar of virgo0880

ASKER

so what has to be done next in this case ?

Thanks
virgo
SOLUTION
Avatar of balasundaram_s
balasundaram_s

Link to home
membership
This solution is only available to members.
To access this solution, you must be a member of Experts Exchange.
Start Free Trial
I did the audit volume previously I got the following error in the activity logs for the volume, but I have not done fix=yes option, just audit option and it was saying following. What I did is made the tape read-only and have requested offsite tapes to restore this volume. Is that ok or should I try with the commands you have given ?

06/27/11 13:43:18     ANR8944E Hardware or media error on drive TDRV7           
                       (/dev/rmt27) with volume 005171(OP=READ, Error Number=   
                       110, CC=0, KEY=03, ASC=11, ASCQ=00, SENSE=F0.00.03.00.04-
                       .00.00.58.00.00.00.00.11.00.86.0A.50.60.41.00.00.03.01.9-
                       1.00.7E.30.00.A0.A5.08.1F.48.50.60.1F.48.50.60.1F.48.50.-
                       60.1F.48.00.00.0C.A1.8C.04.F8.1F.23.00.00.00.01.34.BA.1B-
                       .01.28.56.14.24.FE.00.10.00.20.00.43.39.30.20.20.20.20.0-
                       0.A0.41.4A.F0.F0.F5.F1.F7.F1.0F.00.03.00.00.7E.41,       
                       Description=An undetermined error has occurred). Refer to
                       Appendix C in the 'Messages' manual for recommended      
                       action. (SESSION: 21232, PROCESS: 1318)                  
06/27/11 13:43:18     ANR8359E Media fault detected on 3592 volume 005171 in    
                       drive TDRV7 (/dev/rmt27) of library L3494B. (SESSION:    
                       21232, PROCESS: 1318)                                    
06/27/11 13:43:18     ANR2335W Audit Volume has encountered an I/O error for    
                       volume 005171 while attempting to read: Node EPMAILBOX1B-
                       AK, Type Backup (Inactive), Filespace EPMAILBOX1\SG3,    
                       fsId 20, File Name \data\0000\DoNotUse_SG3\ full.        
                       (SESSION: 21232, PROCESS: 1318)                          
06/27/11 13:43:18     ANR2317W Audit Volume found damaged file on volume 005171:
                       Node EPMAILBOX1BAK, Type Backup (Inactive), File space   
                       EPMAILBOX1\SG3, fsId 20, File name \data\0000\DoNotUse_S-
                       G3\ full is number 7 of 28 versions. (SESSION: 21232,    
                       PROCESS: 1318)               

Open in new window


Thanks
virgo
Avatar of woolmilkporc
If you restore a volume from an offsite copy the original volume will be "EMPTY" from a TSM perspective.
It is marked as "DESTROYED" and you can safely remove it from the library using
CHECKOUT LIBVOLUME L3494B 005171 REMOVE=YES

wmp
By the way, "Scratch = Yes" only means that TSM has added the volume in question to the storage pool by taking it from the scratch pool, as opposed to volumes which have been added manually to the pool by a storage administrator.
Consequently, this is not an error.
So, what do you think should I go ahead and restore the volume. Whether trying move data is worth, before on the volume as we are seeing already seeing read errors on the same.

virgo
If you indeed have a backup copy of the volume's data run a "restore volume" as soon as possible.
The damaged volume itself will not get physically involved in this process.

If it's not empty afterwards (because some of the data on it have not been copied yet) you can still try a "move data" first, and if this fails an "audit volume fix=yes" followed by another "move data" as a last resort.
"Audit volume fix=yes" will not recover any data, it will just clean up references to irrecoverable data in TSM's database.

wmp

Ok, I have started the restore volume process for the bad volume. Once the volume is restored I will check the commands given by you and revert. One more thing, after restore process completes, do I need to change the access of the bad volume to readonly or readwrite so that I can do audit volume on that or  move data on that ?

Also what is the syntax for move data command ?

Thanks
virgo
ASKER CERTIFIED SOLUTION
Link to home
membership
This solution is only available to members.
To access this solution, you must be a member of Experts Exchange.
Start Free Trial