ESX 3.5 Locking up

Posted on 2008-11-15
Last Modified: 2012-08-14

I installed 2 ESX 3.5 Update 2 servers at the same time a few months ago. They are both DL380 G5s 14 GB RAM, 2 x Quad core processors and are attached to an MSA500.

ESX02 as I called it locks up once a week. The guests are in accessible and the Infrastructure client cannot connect to the serve. The only way I can get it back up is to connect to ILO and reboot it. The server is by no means stressed and has plenty of available resources.

I managed to get a look at what it says on the console when the above happens. It read
"kernel 2.4.21-57. Elvmnix on an i686 you probably have a hardware problem with your RAM chips. Please consult hardware error logs"

I booted the server off a diagnostic cd and ran a memory test and it gave the all clear, as usual

I am looking for a way to get the server to send out logs or see what is happening in the back-ground when this happens. I also have the HP Insight Management agents installed and the ILO doesn't have any errors when the server is locked up.

Help much appreciated!

Question by:davewex
    LVL 1

    Expert Comment

    I was told my VMware tech support awhile back that those HP Management Agents can cause problems sometimes.

    I haven't always had good luck testing RAM with software apps like the diagnostic disks.  Try swapping it out for sure.  It's an easy fix if it is indeed the RAM.
    LVL 18

    Expert Comment

    I second azjeep on that this is probably a hw issue, and that your memory is likely the culprit. Checking your memory dimms before going into prod is very important.


    Author Comment

    I installed the agents after I started getting the issue. I am going to run memtest on it this evening as I used HP diagnostics the last time. I guess you guys can't help and its down to trial and error...I was just hoping for an easy fix

    thanks anyway
    LVL 1

    Expert Comment

    It doesn't get much easier than swapping out a couple of DIMMs ;)

    Accepted Solution

    There is 14 GB of RAM in this server and I don't have that spare and the issue only arises once every few weeks.

    Memtest found an error with the RAM in DIMM A so I have replaced this and all seems well


    Write Comment

    Please enter a first name

    Please enter a last name

    We will never share this with anyone.

    Featured Post

    Free Trending Threat Insights Every Day

    Enhance your security with threat intelligence from the web. Get trending threat insights on hackers, exploits, and suspicious IP addresses delivered to your inbox with our free Cyber Daily.

    VMware ESX/ESXi Backup Guide If you have a licensed version of ESX/ESXi, (paid for license) you could purchase the following third party applications to perform backups. If you do not have a licensed version of ESX/ESXi, your options are limited,…
    The original payload size or maximum transmission unit (MTU) of an ethernet frame is 1500 bytes. A jumbo frame has an ethernet frame size of 9000 bytes or over. Common Jumbo Frame sizes are 9000, 9216 bytes (example - HP switches). Enabling Jumb…
    Teach the user how to edit .vmx files to add advanced configuration options Open vSphere Web Client: Edit Settings for a VM: Choose VM Options -> Advanced: Add Configuration Parameters:
    This Micro Tutorial steps you through the configuration steps to configure your ESXi host Management Network settings and test the management network, ensure the host is recognized by the DNS Server, configure a new password, and the troubleshooting…

    779 members asked questions and received personalized solutions in the past 7 days.

    Join the community of 500,000 technology professionals and ask your questions.

    Join & Ask a Question

    Need Help in Real-Time?

    Connect with top rated Experts

    14 Experts available now in Live!

    Get 1:1 Help Now