Link to home
Start Free TrialLog in
Avatar of Robert Cullen
Robert CullenFlag for United States of America

asked on

Did router reboot cause 5 hour outage to MPLS network

Client HQ lost contact to all branches.  Note: Hub and Spoke network.  Since this is a major problem for a financial services company I recommended they reboot the Level3 edge router.  This is a 10 Meg MPLS connection to the MPLS cloud that connects to the 20 branches.  In my limited and long time ago technical experience there should not have been any issue that would not be resolved after say 15 minutes.

Yes I have warned them that this is a single point of failure in the design.

Background - client is running BGP on a Cisco (1 year old) Core  router behind the Level3 edgar router.  The core router is running BGP and there are 2 other connections - both internet - 100 Mege TWT and 10 Meg Skyriver wireless.

It is interesting to me that Level3 would blame the router reboot as we were down before we rebooted the router.  Client agrees with my logic and is not blaming me.  But, what issues could I have caused.

Thanks
ASKER CERTIFIED SOLUTION
Avatar of Rafael
Rafael
Flag of United States of America image

Link to home
membership
This solution is only available to members.
To access this solution, you must be a member of Experts Exchange.
Start Free Trial
Avatar of Robert Cullen

ASKER

Rafael - this is just overview.  I am management and not hands on technical.  Thanks for your response.  It reassures me.  I hate to screw up a client's operations.
There are two reasons that I can think of.
The first reason could be that running configuration was not saved to startup config. So, when router was reloaded it lost running configuration than later they configured router again  (4 hours and 55 minutes later). :)
The second reason could be that they are running OSPF in their network and that, since it is hub and spoke topology (all areas must be attached to area 0), they have network design error where it is possible for one of spokes routers to become DR. When current DR (hub should be DR in hub and spoke OSPF topology, and there should be no BDR) was rebooted some spoke router become DR, and if DR is misplaced all communication was lost simply because there is no internal route updates.