Dell idrac goes unresponsive

I've got an R710 with an idrac 6 express.  The system is running esxi 5.1, BIOS 6.3.0, Lifecycle Controller 1.6.0.73, and idrac6 1.95.05.  On the network side it's using shared LOM1, which is shared with the primary vmware management interface.


The problem is that the idrac goes unresponsive after some amount of uptime.  A reboot usually fixes it for somewhere between a few minutes to a few hours.  The rest of the network (esx) is fine.


I have an identical system (same hardware, firmwares) works fine.

Could this be hardware related?  Unfortunately my support just ended so I can't get hardware replacements.  Anything else to try?
LVL 1
shanepresleyAsked:
Who is Participating?
I wear a lot of hats...

"The solutions and answers provided on Experts Exchange have been extremely helpful to me over the last few years. I wear a lot of hats - Developer, Database Administrator, Help Desk, etc., so I know a lot of things but not a lot about one thing. Experts Exchange gives me answers from people who do know a lot about one thing, in a easy to use platform." -Todd S.

Andrew Hancock (VMware vExpert / EE MVE^2)VMware and Virtualization ConsultantCommented:
Are you using the OEM version of ESXi 5.1 from the Dell Support site, e.g. input your tag number and download.
0
shanepresleyAuthor Commented:
Yes, OEM Dell version of ESXi 5.1  


I can download software, and I've already downloaded/applied all the latest firmwares.
0
Philip ElderTechnical Architect - HA/Compute/StorageCommented:
Is the firmware on the server up to date?

Run the iDRAC Firmware update to reseat the code if you can.

Philip
0
Powerful Yet Easy-to-Use Network Monitoring

Identify excessive bandwidth utilization or unexpected application traffic with SolarWinds Bandwidth Analyzer Pack.

shanepresleyAuthor Commented:
Yes, all firmware is up to date.  And yes, I re-ran the idrac firmware utility to re-install the firmware.
0
Philip ElderTechnical Architect - HA/Compute/StorageCommented:
Okay, if you can have a down moment with the server open it up and remove and re-seat the iDRAC daughter card. Make sure any cables/connectors are re-seated as well.

Boot up. Does that solve the issue?

Any dust bunny colonies of significant size living in the box?

Philip
0
shanepresleyAuthor Commented:
Thanks for the replies.  Just tried re-seating and reconnecting the card.  No change.  No dust either.
0
Philip ElderTechnical Architect - HA/Compute/StorageCommented:
Some further steps: http://dell.to/1fOLosi (Troubleshooting iDRAC)

Then: Since support/warranty is over perhaps a small investment in a new iDRAC card via third party (eBay) might be on order?

If everything appears to be in order and the firmware has been flashed then that is pretty much the only option left.

Philip
0

Experts Exchange Solution brought to you by

Your issues matter to us.

Facing a tech roadblock? Get the help and guidance you need from experienced professionals who care. Ask your question anytime, anywhere, with no hassle.

Start your 7-day free trial
SteveCommented:
looks like you've covered most of the obvious bases already. I'm leaning towards a fault on the iDrac.

Had a similar issue on a client's server but was under warranty and replacement of the iDrac card fixed it.
0
Andrew Hancock (VMware vExpert / EE MVE^2)VMware and Virtualization ConsultantCommented:
Does this card show in Hardware Status or Health Status?
0
shanepresleyAuthor Commented:
Hardware replacement fixed it
0
It's more than this solution.Get answers and train to solve all your tech problems - anytime, anywhere.Try it for free Edge Out The Competitionfor your dream job with proven skills and certifications.Get started today Stand Outas the employee with proven skills.Start learning today for free Move Your Career Forwardwith certification training in the latest technologies.Start your trial today
Server Hardware

From novice to tech pro — start learning today.