ESXi 4.0 U1 - Management network becomes unstable after a few days

    Question by:
    On

    Topics:

    I’ve installed ESXi 4.0 Update 1 on two identical machines that reside in the same network segment. On both servers, I’ve created two virtual machines. One runs RedHat Enterprise Linux 5.4 and one runs a small load balancer appliance (Hercules).

    Hardware:
    Dell PowerEdge R210
    Intel Xeon X3450 2.66GHz HT
    8GB RAM
    2x500 GB in RAID 1
    Using ONE port of the internal Broadcom netxtreme II bcm5716 NIC (this port is shared between the management network and the VM’s).
    (all hardware is marked as ‘supported’ by VMware)

    We applied all available patches, including the recent april 1st patch; we’re at build 244038 now.

    The Problem
    After a few days the vSphere client cannot establish a connection to the ESXi hosts anymore. The virtual machines continue to keep running without any problem, however. Only a full reset (applied thru the remote power cycle) restores the connectivity to the management network. We experience this issue on both servers: about three days after power-on/reset, the vSphere client cannot connect anymore.

    Observations:
    • Only the management network suffers from connectivity problems.
    • Restarting the management network (agents) via the physical console doesn’t restore service
    • The physical console offers some basic diagnostics like ‘testing the management network’. The PING tests intermittently fail: about half of the PINGs to the gateway or dns-servers fails. The hardware and the network config MUST be correct, since the management network works for a few days before failing and the VM’s keep running without any problem.
    • We’ve investigated the network traffic from a remote vSphere client that is trying to connect to the ESXi server using a packet sniffer. The remote ESXi hosts resets the connection after initial contact, so there IS packet interchange.

    Given the above, I strongly suspect a problem in the network driver in ESXi, but I don’t know how to diagnose the issue any further. I’ve exhausted all options on the physical ESXi console. I know how to access the (unsupported) commandline console, but don’t know what to look for. Could it be a problem that the management network shares the same NIC as the VM’s?

    I’ve been struggling with this issue for a several weeks now – any help/suggestions is highly appreciated.

    Good Question?
    0
     

    ?

    The member who asked this question verified this comment provided the solution that solved their problem.

    Accepted Solution on 2010-04-09 at 02:56:42ID: 30191013

    According to VMWare's website:
    http://www.vmware.com/resources/compatibility/search.php?action=search&deviceCategory=io&productId=1&advancedORbasic=advanced&maxDisplayRows=50&key=bcm5716&release[]=-1&datePosted=-1&partnerId[]=-1&manufacturer[]=-1&vid=&did=&svid=&ssid=&rorre=0

    Broadcom      NetXtreme II BCM5716 Gigabit Ethernet      is supported up to      ESX 3.5 U5
    Broadcom      NetXtreme II BCM5716S Gigabit Ethernet is supported in ESX / ESXi 4.0 U1

    Top Expert Contributor

    Essential articles and videos from the Experts

    More valuable questions with Expert answers

    201511-LO-Qu-074

    Extend your technology team with the Experts Exchange community.

    — trusted by —

    Who answers my questions?Our community has technology experts around the world.

    Andrew Hancock

    86

    Articles

    Expert in:

    • VMware
    • Virtualization
    • Backup / Restore
    • Server Hardware
    • Storage

    Mr Tortur

    Expert in:

    • Backup / Restore

    Abhilash

    2

    Articles

    Expert in:

    • VMware
    • Virtualization

    Experts Exchange

    5

    Articles

    serialband

    Expert in:

    • Mac OS X
    • Apple OS
    • Linux
    • Apple Hardware
    • Apple Networking

    Dan Lutey

    geek_vj

    Expert in:

    • MS SQL Server
    • MS SQL Server 2005
    • MS SQL Server 2008

    bigeven2002

    Expert in:

    • Outlook
    • Windows 7

    HuaMinChen

    Expert in:

    • MS SQL Server
    • MS SQL Server 2008

    RELATED TOPICS view all topics

    1. Virtualization
      (15,417)
    2. Windows Server 2008
      (79,867)
    3. Storage
      (41,159)
    4. Server Hardware
      (25,987)
    5. Windows 2003 Server
      (129,250)
    6. Linux
      (64,223)
    7. Backup / Restore
      (33,680)
    8. MS Server OS
      (55,732)
    9. MS Virtual Server
      (5,910)
    10. Exchange
      (195,882)