Solved

Windows Unicast NLB virtual machine unable to ping each other ?

Posted on 2014-02-23
8
2,254 Views
Last Modified: 2014-03-30
Hi People,

I am managing a new environment where there are two nodes Unicast NLB Windows Server 2003 Unicast NLB Virtual Machine (let say VM1 and VM2).

Previously before I migrate it into ESXi 5.1u1 host, it was running on two different ESX 4.1 host (two different HP BL 380 G6 servers) and running fine (the NLB status are both converged).

But to my surprise, when I migrated both into two different Blade Server (on two different HP BL 460c G8), they cannot ping each other ?

so based on this article: http://kb.vmware.com/selfservice/microsites/search.do?language=en_US&cmd=displayKC&externalId=1556

I have to put it into the same ESXi host to make sure the NLB converged.

Why is that behaviour now changed on vSphere 5.1 ?
previously it was on the two different hosts and working just fine but now I have to host it on the same host.

Any kind of help and suggestion would be greatly appreciated.

Thanks.
0
Comment
8 Comments
 
LVL 7

Author Comment

by:Senior IT System Engineer
ID: 39880575
Some more details to add as follows,

previously it was like the following and working just fine on two separate ESX host:

ESX 4.1 VMHost 1
VM1 - NLB Unicast Node 1
Local IP 10.1.100.12 (VLAN 100)
NLB IP 10.1.200.5 (VLAN 200)
NLB Cluster Virtual IP 10.1.200.200 (VLAN 200)

ESX 4.1 VMHost 2
VM2 - NLB Unicast Node 2
Local IP 10.1.100.13 (VLAN 100)
NLB IP 10.1.200.6 (VLAN 200)
NLB Cluster Virtual IP 10.1.200.200 (VLAN 200)

but now the configuration like the following is not working:

ESXi 5.1u1 VMHost 1
VM1 - NLB Unicast Node 1
Local IP 172.15.20.25 (VLAN 20)
NLB IP 172.15.20.27 (VLAN 20)
NLB Cluster Virtual IP 172.15.20.29 (VLAN 20)

ESXi 5.1u1 VMHost 2
VM2 - NLB Unicast Node 2
Local IP 172.15.20.26 (VLAN 20)
NLB IP 172.15.20.28 (VLAN 20)
NLB Cluster Virtual IP 172.15.20.29 (VLAN 20)

why the configuration above is not working ?

I had to force it on the same VMhost1 to make it working as normal ?
does changing all of the IP address into the same VLAN causing it to not working ?
0
 
LVL 29

Assisted Solution

by:Rich Weissler
Rich Weissler earned 100 total points
ID: 39882530
I wasn't aware of behaviour changes.  NLB in unicast mode causes each of the servers to take on a common mac address.  When we first brought up a NLB on vmware for the first time (in 3.5, I think), we discovered that rebooting one server caused the switch to suppress the address for five minutes.  The solution there was to either configure static ARP addresses on the switch, or to switch to using multicast addresses rather than unicast.

One way to get two different machines in a NLB cluster to see each other is to give them a separate NIC for that private communication.  That way, the NIC used for that communication keeps their separate MAC addresses.  Looking at your new configuration, I'm not certain if the Local IP addresses aren't on the same NICs as the NLB traffic.

(I'm not certain I've answered the question though... does that help though?  I guess the next logical question would be... do you have one or two NICs in the virtual machines?)
0
 
LVL 19

Assisted Solution

by:compdigit44
compdigit44 earned 300 total points
ID: 39882770
On the NIC teaming software inside the VM's did you try to edit the MAC address as listed in the following article...

http://jhmeier.de/2010/10/19/using-windows-server-2008-r2-network-load-balancing-with-teamed-network-nics-in-a-hp-server/

Also were both you new and old host using the same upstream switch?
0
 
LVL 17

Assisted Solution

by:TimotiSt
TimotiSt earned 100 total points
ID: 39883428
Random guess:
Maybe a promiscuous mode configuration mismatch on the ESXi?

http://petermolnar.eu/linux-tech-coding/vmware-esxi-and-promiscuous-mode/

Tamas
0
New My Cloud Pro Series - organize everything!

With space to keep virtually everything, the My Cloud Pro Series offers your team the network storage to edit, save and share production files from anywhere with an internet connection. Compatible with both Mac and PC, you're able to protect your content regardless of OS.

 
LVL 19

Assisted Solution

by:compdigit44
compdigit44 earned 300 total points
ID: 39883510
Actually TimotiSt, you may be on to something. The vSwitch may have allow forget packets policy enabled.
0
 
LVL 7

Author Comment

by:Senior IT System Engineer
ID: 39884033
Each of the VMs got two vNICs.

Old configuration was connected to uplink switch A
Each vNICs got its own separate VLAN

New configuration was connected to uplink switch B
Each vNICs got the same VLAN

So totally new core switch. I believe that Unicast NLB doesn't require static ARP entry, only multicast needs it.
0
 
LVL 19

Accepted Solution

by:
compdigit44 earned 300 total points
ID: 39884201
Are the vSwitch NIC teaming policies the same?

Some switch may block unicast traffic as a safety measure since it can flood a switch
0
 
LVL 7

Author Comment

by:Senior IT System Engineer
ID: 39914083
comp,

yes the previous ESX host are 2x different HP DL 380 G6 rack servers, connected to a core Cisco Catalyst 6000 series switch (hence I can see the Cisco Discovery Protocol info on the vSwitch balloon).

the newly migrated Terminal Servers are now hosted in 2x different HP Blade servers BL 460c G8 connected to the HP Virtual Connect modules with Flex10 on HP c7000 blade enclosure.

but from VMware wise both vSwitch policy are exactly the same.
0

Featured Post

Optimizing Cloud Backup for Low Bandwidth

With cloud storage prices going down a growing number of SMBs start to use it for backup storage. Unfortunately, business data volume rarely fits the average Internet speed. This article provides an overview of main Internet speed challenges and reveals backup best practices.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Suggested Solutions

When you try to share a printer , you may receive one of the following error messages. Error message when you use the Add Printer Wizard to share a printer: Windows could not share your printer. Operation could not be completed (Error 0x000006…
In this article, I am going to show you how to simulate a multi-site Lab environment on a single Hyper-V host. I use this method successfully in my own lab to simulate three fully routed global AD Sites on a Windows 10 Hyper-V host.
Viewers will learn how to connect to a wireless network using the network security key. They will also learn how to access the IP address and DNS server for connections that must be done manually. After setting up a router, find the network security…
This Micro Tutorial steps you through the configuration steps to configure your ESXi host Management Network settings and test the management network, ensure the host is recognized by the DNS Server, configure a new password, and the troubleshooting…

948 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question

Need Help in Real-Time?

Connect with top rated Experts

20 Experts available now in Live!

Get 1:1 Help Now