RDXmon.exe, RD1000 drive and SENS

I've got a client with a dell PowerEdge T310 which has an internal RD1000 drive.  MS Server 2008R2 standard x64.

This drive tends to disappear for no reason on a daily basis.  initiating a search for new hardware in device manager finds it and loads it again.

in the event log I see this for hours on end.

"The description for Event ID 0 from source RDXmon cannot be found. Either the component that raises this event is not installed on your local computer or the installation is corrupted. You can install or repair the component on the local computer.

If the event originated on another computer, the display information had to be saved with the event.

The following information was included with the event:

Check Condition on SCSI command, status:0x2, CDB length:10 CDB:0x4A 0x1 0x0 0x0 0x10 0x0 0x0 0x0 0x8 0x0"

I was able to get around this by scheduling a script that initiates the check for new hardware every day just prior to the backups that write to this drive kicking off.

In the last week however a new and exciting twist has occurred.  The drive becomes inaccessible but still shows up in My Computer and the device manager.  Then when I log out of the remote session I use to manage the server I get this...
"please wait for system event notification service......"

This is where it locks up and nothing short of a hard reboot will break this....
Who is Participating?

[Product update] Infrastructure Analysis Tool is now available with Business Accounts.Learn More

I wear a lot of hats...

"The solutions and answers provided on Experts Exchange have been extremely helpful to me over the last few years. I wear a lot of hats - Developer, Database Administrator, Help Desk, etc., so I know a lot of things but not a lot about one thing. Experts Exchange gives me answers from people who do know a lot about one thing, in a easy to use platform." -Todd S.

I don't know how much more obvious this could be .. REPLACE THE DRIVE!
AshcorTechAuthor Commented:
Had that argument with dell... it went nowhere.
The CDB (the hex string) is asking the RD1000 to send status information in immediate mode.   A check condition (0x02) means that the command aborted improperly, as if the command was ignored by the tape.  So this confirms that tape drive went offline.  This is irrefutable evidence of a hardware problem.  Ask for somebody in level 2 support.
SolarWinds® IP Control Bundle (IPCB)

Combines SolarWinds IP Address Manager and User Device Tracker to help detect IP conflicts, quickly identify affected systems, and help your team take near instantaneous action. Help improve visibility and enhance reliability with SolarWinds IP Control Bundle.

AshcorTechAuthor Commented:

MUCH better answer than your first. Thank you!  I'll get them on the horn this afternoon.
Level2 is the key.  The official ANSI command (4A) it is sending is called, "GET EVENT STATUS NOTIFICATION", and the 01 means IMMEDIATE, which is self-explanatory. The 02/Check Condition comes with additional data that says the specific nature of the problem, like drive is not ready, or power was lost, etc, but the error message doesn't reveal those extra bytes, so all you can run with is the fact that the command failed.

The other check condition possibilities could mean that the command isn't supported, but if that was the case you would see these messages all of the time.  If the drive was unplugged and not there at all, you wouldn't get a check condition.  So now you have enough supporting information.  Ask how you could get intermittent check conditions on the GET EVENT STATUS NOTIFICATION unless the tape wasn't going offline??  And since this establishes it went offline, you want authorization for a warranty RMA.

(Yes, I do this stuff for a living ;)

Experts Exchange Solution brought to you by

Your issues matter to us.

Facing a tech roadblock? Get the help and guidance you need from experienced professionals who care. Ask your question anytime, anywhere, with no hassle.

Start your 7-day free trial
AshcorTechAuthor Commented:
"(Yes, I do this stuff for a living ;)"

Clearly :)

The additional info is much appreciated!
.. and don't let Dell revert to their standard diagnostic technique .. Telling you it is a registry issue so you should reinstall the O/S ;)
AshcorTechAuthor Commented:
lol, no, too many rodeo's under my belt for that....

AshcorTechAuthor Commented:
must have dialed extra careful this time because the tech just said,

"Well it's either the drive or cartridge but I can't tell which from that error so we'll send you both...."
Was it the "level 2"or the fact that you sounded like you did your homework, or did you just get lucky with an operator who actually cared about customer satisfaction,  just curious.
It's more than this solution.Get answers and train to solve all your tech problems - anytime, anywhere.Try it for free Edge Out The Competitionfor your dream job with proven skills and certifications.Get started today Stand Outas the employee with proven skills.Start learning today for free Move Your Career Forwardwith certification training in the latest technologies.Start your trial today
Microsoft Server OS

From novice to tech pro — start learning today.