Want to protect your cyber security and still get fast solutions? Ask a secure question today.Go Premium

x
  • Status: Solved
  • Priority: Medium
  • Security: Public
  • Views: 3479
  • Last Modified:

RDXmon.exe, RD1000 drive and SENS

I've got a client with a dell PowerEdge T310 which has an internal RD1000 drive.  MS Server 2008R2 standard x64.

This drive tends to disappear for no reason on a daily basis.  initiating a search for new hardware in device manager finds it and loads it again.

in the event log I see this for hours on end.

"The description for Event ID 0 from source RDXmon cannot be found. Either the component that raises this event is not installed on your local computer or the installation is corrupted. You can install or repair the component on the local computer.

If the event originated on another computer, the display information had to be saved with the event.

The following information was included with the event:

RDXmon:
Check Condition on SCSI command, status:0x2, CDB length:10 CDB:0x4A 0x1 0x0 0x0 0x10 0x0 0x0 0x0 0x8 0x0"


I was able to get around this by scheduling a script that initiates the check for new hardware every day just prior to the backups that write to this drive kicking off.

In the last week however a new and exciting twist has occurred.  The drive becomes inaccessible but still shows up in My Computer and the device manager.  Then when I log out of the remote session I use to manage the server I get this...
 
"please wait for system event notification service......"

This is where it locks up and nothing short of a hard reboot will break this....
0
AshcorTech
Asked:
AshcorTech
  • 5
  • 5
1 Solution
 
DavidCommented:
I don't know how much more obvious this could be .. REPLACE THE DRIVE!
0
 
AshcorTechAuthor Commented:
Had that argument with dell... it went nowhere.
0
 
DavidCommented:
The CDB (the hex string) is asking the RD1000 to send status information in immediate mode.   A check condition (0x02) means that the command aborted improperly, as if the command was ignored by the tape.  So this confirms that tape drive went offline.  This is irrefutable evidence of a hardware problem.  Ask for somebody in level 2 support.
0
What does it mean to be "Always On"?

Is your cloud always on? With an Always On cloud you won't have to worry about downtime for maintenance or software application code updates, ensuring that your bottom line isn't affected.

 
AshcorTechAuthor Commented:
dlethe,

MUCH better answer than your first. Thank you!  I'll get them on the horn this afternoon.
0
 
DavidCommented:
Level2 is the key.  The official ANSI command (4A) it is sending is called, "GET EVENT STATUS NOTIFICATION", and the 01 means IMMEDIATE, which is self-explanatory. The 02/Check Condition comes with additional data that says the specific nature of the problem, like drive is not ready, or power was lost, etc, but the error message doesn't reveal those extra bytes, so all you can run with is the fact that the command failed.

The other check condition possibilities could mean that the command isn't supported, but if that was the case you would see these messages all of the time.  If the drive was unplugged and not there at all, you wouldn't get a check condition.  So now you have enough supporting information.  Ask how you could get intermittent check conditions on the GET EVENT STATUS NOTIFICATION unless the tape wasn't going offline??  And since this establishes it went offline, you want authorization for a warranty RMA.

(Yes, I do this stuff for a living ;)
0
 
AshcorTechAuthor Commented:
"(Yes, I do this stuff for a living ;)"

Clearly :)

The additional info is much appreciated!
0
 
DavidCommented:
.. and don't let Dell revert to their standard diagnostic technique .. Telling you it is a registry issue so you should reinstall the O/S ;)
0
 
AshcorTechAuthor Commented:
lol, no, too many rodeo's under my belt for that....

thanks
0
 
AshcorTechAuthor Commented:
must have dialed extra careful this time because the tech just said,

"Well it's either the drive or cartridge but I can't tell which from that error so we'll send you both...."
0
 
DavidCommented:
Was it the "level 2"or the fact that you sounded like you did your homework, or did you just get lucky with an operator who actually cared about customer satisfaction,  just curious.
0

Featured Post

[Webinar] Database Backup and Recovery

Does your company store data on premises, off site, in the cloud, or a combination of these? If you answered “yes”, you need a data backup recovery plan that fits each and every platform. Watch now as as Percona teaches us how to build agile data backup recovery plan.

  • 5
  • 5
Tackle projects and never again get stuck behind a technical roadblock.
Join Now