Switches / Hubs
--
Questions
--
Followers
Top Experts
Before I start to shut down parts of the network to locate the problem, I'd like to hear your opinions on this one:
Layer2 campus network, mostly D-Link DES-3526 and other D-Link DES switches, some DGS-12xxT switches, some 3Com 4400 boxes, one core 3Com 4050.
Protocol of choice is RSTP, except for the DGS boxes, which can only run STP.
Edge ports configured, core 3Com is the root. Not a lot of redundant paths.
Network topology is pretty far from the 3-tier model, at some points diameter is around 7 L2 hops.
Problem: at every exact 5 minutes, most switches directly connected to the core (and 6-8 others further away) report a topology change. Always the same switches, pattern not changing. Some of the switches can't report STP topology changes to syslog, so they might or might not receive TCNs. All switches report the topology change for their root port.
Switches running the latest firmware, L3 routing is provided by a Linux box with Vyatta, attached vlanned to the 3Com 4050 in the core.
Based on the exact 5 minute pattern, I'm starting to think it has something to do with MAC aging, but that doesn't really make sense for STP...
Any ideas appreciated!
Tamas
Zero AI Policy
We believe in human intelligence. Our moderation policy strictly prohibits the use of LLM content in our Q&A threads.
The syslog messages report TCNs from the root ports, so those should be what the root bridge propagates after it receives the initial TCN from the bridge reporting the change, right?
Also, the TCN traveling upstream does not contain the originating bridge identifier in RSTP, right?
Flaky switch might be it, but flaky link with errors in exact 5 minutes for months?
Make sure ALL edge ports are defined.






EARN REWARDS FOR ASKING, ANSWERING, AND MORE.
Earn free swag for participating on the platform.
I'll try to check the STP statistics again on the 3Com core in the morning...
Select menu option (bridge): summ
stpVersion: 2 (RSTP) defaultPathCosts: 802.1D-1998
stpState: enabled agingTime: 300
Time since topology change: 0 hrs 7 mins 43 seconds
Topology Changes: 64741
Bridge Identifier: 1000 000a0496ed40
Designated Root: 1000 000a0496ed40
maxAge: 20 bridgeMaxAge: 20
helloTime: 2 bridgeHelloTime: 2
forwardDelay: 15 bridgeFwdDelay: 15
holdTime: 1 rootCost: 0
rootPort: No Port priority: 4096
Tried setting up snmp traps and syslog, but it does not log the changes...
This one has gone over 5 minutes... barely.
Without a mechanism that reports where the TCNs are coming from, it's really hard to troubleshoot. Have you tried a protocol analyzer?

Get a FREE t-shirt when you ask your first question.
We believe in human intelligence. Our moderation policy strictly prohibits the use of LLM content in our Q&A threads.
I can wireshark, I'm just not sure where... The initial TCN is only propagated upstream on the root ports, so I'd only see it if I put a laptop with 2 bridged NICs in between the core and the problem switch.
Port priority might be a good idea. I don't exactly see the cause-effect logic to develop 5 minute problems, but I'll check anyways.
Thanks for the ideas so far!
Tamas
You're lucky that this is happening every 5 five minutes. Otherwise it could take a long time. :-o






EARN REWARDS FOR ASKING, ANSWERING, AND MORE.
Earn free swag for participating on the platform.
D-Link DGS-3324SR distribution switches (two of them), directly connected to the 3Com 4050 layer2 core.
Protocol set to RSTP on both, all vlans tagged on link. These DLinks are MSTP-capable.
Both DGS-3324SR boxes think they are the root (they have default priority, core is 4096).
Around a dozen DES-3250TG switches connected to them, happily participating in RSTP with the DGS boxes.
Double-checked, no STP protection of any kind on any ports...
I'll try to permit an untagged, empty vlan between them, see what happens...
Protocol set to RSTP on both, all vlans tagged on link.Typically, the native VLAN carries the BPDUs. If all VLANs are tagged, then you don't have a native VLAN.
Both DGS-3324SR boxes think they are the rootThat would indicate the DLinks can't see the BPDU's from the 4050. Which is consistent with the previous point.
One guess would be that the DLinks send BPDUs both with and without tags, while the 3Com does not?

Get a FREE t-shirt when you ask your first question.
We believe in human intelligence. Our moderation policy strictly prohibits the use of LLM content in our Q&A threads.
I agree, but: between the DGS and DES boxes, all links are also fully tagged, no native vlan, but RSTP is still working.Sounds like DLink is doing it their own way. :-)
One guess would be that the DLinks send BPDUs both with and without tags, while the 3Com does not?I think that DLink sends and expects to receive BPDUs on a tagged, native VLAN. Where 3Com is looking for BPDUs on the untagged, native VLAN.
Enabled untagged vlan1 on ports between core 3Com and DLink DGS boxes: RSTP up and running between them.
Topology changes still persist... :(
Tracked it down to a D-Link DGS-1224 rev.C switch; which are known to be barely-manageable and fairly unstable. You can't even define edge ports on them...
Disabled STP temporarily (I know, I know...), budgeted for a replace, the topology is happy once again after 2+ years of constant change...






EARN REWARDS FOR ASKING, ANSWERING, AND MORE.
Earn free swag for participating on the platform.
Switches / Hubs
--
Questions
--
Followers
Top Experts
A switch is a device that filters and forwards packets of data between LAN segments. Switches operate at the data link layer or the network layer of the Open Systems Interconnection (OSI) Reference Model and therefore support any packet protocol. LANs that use switches to join segments are called switched LANs or, in the case of Ethernet networks, switched Ethernet LANs. A hub is a connection point for devices in a network. Hubs are commonly used to connect segments of a LAN. A hub contains multiple ports; when a packet arrives at one port, it is copied to the other ports so that all segments of the LAN can see all packets.