Want to win a PS4? Go Premium and enter to win our High-Tech Treats giveaway. Enter to Win

x
?
Solved

RDXmon.exe, RD1000 drive and SENS

Posted on 2012-04-13
10
Medium Priority
?
3,440 Views
Last Modified: 2016-11-23
I've got a client with a dell PowerEdge T310 which has an internal RD1000 drive.  MS Server 2008R2 standard x64.

This drive tends to disappear for no reason on a daily basis.  initiating a search for new hardware in device manager finds it and loads it again.

in the event log I see this for hours on end.

"The description for Event ID 0 from source RDXmon cannot be found. Either the component that raises this event is not installed on your local computer or the installation is corrupted. You can install or repair the component on the local computer.

If the event originated on another computer, the display information had to be saved with the event.

The following information was included with the event:

RDXmon:
Check Condition on SCSI command, status:0x2, CDB length:10 CDB:0x4A 0x1 0x0 0x0 0x10 0x0 0x0 0x0 0x8 0x0"


I was able to get around this by scheduling a script that initiates the check for new hardware every day just prior to the backups that write to this drive kicking off.

In the last week however a new and exciting twist has occurred.  The drive becomes inaccessible but still shows up in My Computer and the device manager.  Then when I log out of the remote session I use to manage the server I get this...
 
"please wait for system event notification service......"

This is where it locks up and nothing short of a hard reboot will break this....
0
Comment
Question by:AshcorTech
[X]
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
  • 5
  • 5
10 Comments
 
LVL 47

Expert Comment

by:David
ID: 37842204
I don't know how much more obvious this could be .. REPLACE THE DRIVE!
0
 

Author Comment

by:AshcorTech
ID: 37842393
Had that argument with dell... it went nowhere.
0
 
LVL 47

Expert Comment

by:David
ID: 37842553
The CDB (the hex string) is asking the RD1000 to send status information in immediate mode.   A check condition (0x02) means that the command aborted improperly, as if the command was ignored by the tape.  So this confirms that tape drive went offline.  This is irrefutable evidence of a hardware problem.  Ask for somebody in level 2 support.
0
Free learning courses: Active Directory Deep Dive

Get a firm grasp on your IT environment when you learn Active Directory best practices with Veeam! Watch all, or choose any amount, of this three-part webinar series to improve your skills. From the basics to virtualization and backup, we got you covered.

 

Author Comment

by:AshcorTech
ID: 37842612
dlethe,

MUCH better answer than your first. Thank you!  I'll get them on the horn this afternoon.
0
 
LVL 47

Accepted Solution

by:
David earned 400 total points
ID: 37842767
Level2 is the key.  The official ANSI command (4A) it is sending is called, "GET EVENT STATUS NOTIFICATION", and the 01 means IMMEDIATE, which is self-explanatory. The 02/Check Condition comes with additional data that says the specific nature of the problem, like drive is not ready, or power was lost, etc, but the error message doesn't reveal those extra bytes, so all you can run with is the fact that the command failed.

The other check condition possibilities could mean that the command isn't supported, but if that was the case you would see these messages all of the time.  If the drive was unplugged and not there at all, you wouldn't get a check condition.  So now you have enough supporting information.  Ask how you could get intermittent check conditions on the GET EVENT STATUS NOTIFICATION unless the tape wasn't going offline??  And since this establishes it went offline, you want authorization for a warranty RMA.

(Yes, I do this stuff for a living ;)
0
 

Author Comment

by:AshcorTech
ID: 37842787
"(Yes, I do this stuff for a living ;)"

Clearly :)

The additional info is much appreciated!
0
 
LVL 47

Expert Comment

by:David
ID: 37842904
.. and don't let Dell revert to their standard diagnostic technique .. Telling you it is a registry issue so you should reinstall the O/S ;)
0
 

Author Comment

by:AshcorTech
ID: 37843417
lol, no, too many rodeo's under my belt for that....

thanks
0
 

Author Comment

by:AshcorTech
ID: 37843518
must have dialed extra careful this time because the tech just said,

"Well it's either the drive or cartridge but I can't tell which from that error so we'll send you both...."
0
 
LVL 47

Expert Comment

by:David
ID: 37843541
Was it the "level 2"or the fact that you sounded like you did your homework, or did you just get lucky with an operator who actually cared about customer satisfaction,  just curious.
0

Featured Post

What does it mean to be "Always On"?

Is your cloud always on? With an Always On cloud you won't have to worry about downtime for maintenance or software application code updates, ensuring that your bottom line isn't affected.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Hyper-convergence systems have taken the IT world by storm and have quickly started to change our point of view of how the data center should and could be architected. In this article, I’ll explain the benefits of employing a hyper-converged system …
Know what services you can and cannot, should and should not combine on your server.
Michael from AdRem Software outlines event notifications and Automatic Corrective Actions in network monitoring. Automatic Corrective Actions are scripts, which can automatically run upon discovery of a certain undesirable condition in your network.…
In this video, Percona Solutions Engineer Barrett Chambers discusses some of the basic syntax differences between MySQL and MongoDB. To learn more check out our webinar on MongoDB administration for MySQL DBA: https://www.percona.com/resources/we…
Suggested Courses

610 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question