Solved

ARP Storm Taking Down Default Gateway

Posted on 2013-01-05
2
951 Views
Last Modified: 2013-01-08
We have been experiencing a problem in our local network where the default gateway is being taken down due to what appears to be an ARP storm.

Originally, the default gateway was set to a Cisco 2851 router that routed traffic between several VLANs and had its default route set to our Cisco ASA router. Both the 2951 and the ASA were connected to a Cisco 2560 switch.

When the outage occurred, we lost all routing from the 2851 although we could still access it via Telnet. Clearing the ARP cache would instantly bring all functionality back. We saw a large amount of ARP requests coming in (thousands per minute) and the routing would go back down within about 15 minutes.

To test, we changed the default gateway (set by DHCP) to the ASA router. We experienced the same behaviour of ARP traffic and it would take down the internal interface of the ASA. Clearing ARP instantly brought all functionality back.

We also tried setting up a temporary internet gateway using a Cradlepoint router hooked to a Verizon aircard. It was connected through an intermediant HP switch that was connected to the 2560 switch. After an hour or so, the Cradlepoint was overwhelemed and also went down.

A little more information: We experienced this behaviour two days in a row. Communication inside the same subnets worked fine. Routing would go down around 9:30 AM each day and everything would settle down and become stable around 4:30 pm.

We think the problem is originating from a laptop and only starts happening when the employee arrives to work and then it stops when the employee leaves with their laptop.

Is there any other likely cause to this problem? If it is a laptop, what is the best way to handle this problem? We can wait until it starts happening again on Monday and disconnect switches and ports until we identify the culprit. However, I'd like to prevent any more downtime.

Thanks in advance.
0
Comment
Question by:HunterIT
[X]
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
2 Comments
 
LVL 22

Accepted Solution

by:
mcsween earned 500 total points
ID: 38747231
I would enable storm control on all access ports on all switches.  When the culprit starts going crazy the switch should shut down the port.  The user should either call with a complaint they lost connectivity or you can look on the switches to see which port has been shut down (show interface status)

To enable storm control add the following to every access interface, you can use the int range command to do multiple at once.  You do not want to set this on your trunks.
storm-control broadcast level 20.00 5.00
storm-control multicast level 50.00 30.00
storm-control action shutdown

Open in new window

More information on storm control here
http://www.cisco.com/en/US/docs/switches/lan/catalyst2950/software/release/12.1_22ea/SCG/swtrafc.html#wp1229873
0
 
LVL 20

Expert Comment

by:rauenpc
ID: 38747966
Dynamic arp inspection combined with dhcp snooping will mitigate this scenario, although it can be a lot of work to implement depending on you environment. Bottom line is that you need to find the source of the arps and remove it. I would guess a faulty nic or more likely a virus.
0

Featured Post

U.S. Department of Agriculture and Acronis Access

With the new era of mobile computing, smartphones and tablets, wireless communications and cloud services, the USDA sought to take advantage of a mobilized workforce and the blurring lines between personal and corporate computing resources.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Suggested Solutions

Outsource Your Fax Infrastructure to the Cloud (And come out looking like an IT Hero!) Relative to the many demands on today’s IT teams, spending capital, time and resources to maintain physical fax servers and infrastructure is not a high priority.
David Varnum recently wrote up his impressions of PRTG, based on a presentation by my colleague Christian at Tech Field Day at VMworld in Barcelona. Thanks David, for your detailed and honest evaluation!
Get a first impression of how PRTG looks and learn how it works.   This video is a short introduction to PRTG, as an initial overview or as a quick start for new PRTG users.
This video gives you a great overview about bandwidth monitoring with SNMP and WMI with our network monitoring solution PRTG Network Monitor (https://www.paessler.com/prtg). If you're looking for how to monitor bandwidth using netflow or packet s…

740 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question