techadmn
asked on
VM Suddenly has no network connectivity
Running on a 2-node cluster, Vmware ESXi 5.5 the symtoms are that a VM becomes unreachable, no ping replies, and wireshark shows no replies to any kind of connection to the VM. You can logon locally to the VM and then reboot it using the Console connection and then the VM becomes responsive again, but in a matter of a few hours, the VM loses network connectivity again. The local VM Operating System, Windows Server 2008R2 still shows the VM has network connectivity during the problem.
If you reboot the VM, the problem temporarily goes away.
The problem also travels between ESX Hosts after a VMotion migration. I have even shut the box down done a cold migration and restarted it. Within hours the problem returns.
If anyone has seen this before or has any ideas on how to resolve this permanently, I'd be really happy to hear from you.
Thanks.
If you reboot the VM, the problem temporarily goes away.
The problem also travels between ESX Hosts after a VMotion migration. I have even shut the box down done a cold migration and restarted it. Within hours the problem returns.
If anyone has seen this before or has any ideas on how to resolve this permanently, I'd be really happy to hear from you.
Thanks.
The host logs could help. Do they report anything useful?
Do you have any IP Address conflicts?
Are you using the VMXNET3 interface ? (in the VM)
Can you still ping the host?
Are you using the VMXNET3 interface ? (in the VM)
Can you still ping the host?
ASKER
The Host logs show nothing untoward.
The Host still responds to pings on the management network and all the other VMs on the Same vSwitch still function as normal. To add more detail, sometimes its not always the same VM that has the issue, but there is always connectivity to the VMs that dont have the issue.
Which ever machine this problem ends up on a reboot clears the fault but it comes back later on.
Do you think this could be possibly realted to the physical switch? The VMs are uplinked to trunk port on the physical switch via the ESXi Host.
Thanks.
The Host still responds to pings on the management network and all the other VMs on the Same vSwitch still function as normal. To add more detail, sometimes its not always the same VM that has the issue, but there is always connectivity to the VMs that dont have the issue.
Which ever machine this problem ends up on a reboot clears the fault but it comes back later on.
Do you think this could be possibly realted to the physical switch? The VMs are uplinked to trunk port on the physical switch via the ESXi Host.
Thanks.
Did you consider the option of more VMs having the same MAC address?
SOLUTION
membership
This solution is only available to members.
To access this solution, you must be a member of Experts Exchange.
ASKER CERTIFIED SOLUTION
membership
This solution is only available to members.
To access this solution, you must be a member of Experts Exchange.
ASKER
Hello SagiEdoc,
Thanks for your comments. The configuration you described with the VLANs in your experience with this problem is very similar to mine. There are 4 VLANS trunked through a 2 port NIC in both ESXi Hosts.
2 UTP Cables per Host, a total of 4 Physical uplinks to the Switch.
Each VLAN has a vSwitch represented in the network configuration in ESXi. Each VLAN is tagged with an ID.
I will certainly have a look and ensure that all switch ports are configured with the correct VLAN memberships and also take your advice on the NIC teaming properties, which as you correctly pointed out, are currently set to Link Status Only and not Beacon Probe, so I will adjust this and observe the behaviour.
Thanks for your help thus far.
Thanks for your comments. The configuration you described with the VLANs in your experience with this problem is very similar to mine. There are 4 VLANS trunked through a 2 port NIC in both ESXi Hosts.
2 UTP Cables per Host, a total of 4 Physical uplinks to the Switch.
Each VLAN has a vSwitch represented in the network configuration in ESXi. Each VLAN is tagged with an ID.
I will certainly have a look and ensure that all switch ports are configured with the correct VLAN memberships and also take your advice on the NIC teaming properties, which as you correctly pointed out, are currently set to Link Status Only and not Beacon Probe, so I will adjust this and observe the behaviour.
Thanks for your help thus far.
No problem. Take a look at the observed IP ranges on the NICs under the vSwitches. You should be able to pick up pretty quickly if a vlan is missing based on the ip ranges you can see.
ASKER
I've requested that this question be closed as follows:
Accepted answer: 0 points for techadmn's comment #a39876508
for the following reason:
The expert response given resolved the issue and the problem has now gone away. I woud like to thank all of those people who kindly replied. The solution was supplied by SagiEDoc and I would like to extend this thanks to this expert.
Accepted answer: 0 points for techadmn's comment #a39876508
for the following reason:
The expert response given resolved the issue and the problem has now gone away. I woud like to thank all of those people who kindly replied. The solution was supplied by SagiEDoc and I would like to extend this thanks to this expert.
Dear requester, you should not close the question. You should Accept the expert's answer in order to award him.
:( No points? That sucks a bit.
ASKER
Sorry I am trying to get it ajusted I clicked on the wrong bit - Apologies.
ASKER
Apologies, I don't often raise questions on Experts Exchange and have made an error in the question closure procedure. I have pressed the request attention button and await a moderators' response.
ASKER
Thankyou all for your help and to the Moderators for allowing me to correct the points allocation. Thanks again.