RX error and overruns error on 10Gb Broadcom adaptor in Redhat Linux

Hi, we are receiving RX errors on one port to storage, we have changed the cable and the port on the switch, there are no errors on the switch only on this side.

i have read this could be due to the kernel which is 2.6 not being able to keep up.

is there a way to resolve this?

eth5      Link encap:Ethernet  HWaddr 94:F1:28:A4:04:28
          inet addr:
          inet6 addr: fe80::96g1:28ff:fea2:428/64 Scope:Link
          UP BROADCAST RUNNING MULTICAST  MTU:1500  Metric:1
          RX packets:49090224 errors:171 dropped:0 overruns:171 frame:0
          TX packets:64057242 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:1000
          RX bytes:45259514804 (42.1 GiB)  TX bytes:75797694806 (70.5 GiB)
          Interrupt:193 Memory:94000000-947fffff
Adam FlookSenior Technical Services EngineerAsked:
Who is Participating?

[Product update] Infrastructure Analysis Tool is now available with Business Accounts.Learn More

x
I wear a lot of hats...

"The solutions and answers provided on Experts Exchange have been extremely helpful to me over the last few years. I wear a lot of hats - Developer, Database Administrator, Help Desk, etc., so I know a lot of things but not a lot about one thing. Experts Exchange gives me answers from people who do know a lot about one thing, in a easy to use platform." -Todd S.

Gerald NegrotaIT ProfessionalCommented:
Just enable rx/tx flow control on both sides.

Experts Exchange Solution brought to you by

Your issues matter to us.

Facing a tech roadblock? Get the help and guidance you need from experienced professionals who care. Ask your question anytime, anywhere, with no hassle.

Start your 7-day free trial
David FavorLinux/LXD/WordPress/Hosting SavantCommented:
You said, "kernel which is 2.6 not being able to keep up".

Many 2.6 + 3.x kernels have zero days in them.

Also, the 3.13 + 4.X TCP enhancements have fixed many problem + give substantial performance improvements.

If possible, start with an install like Ubuntu Bionic (LTS - long term service) install, which will give you this Kernel...

4.15.X

Open in new window


Which has all zero days fixed + includes all recent TCP bug fixes + performance enhancements.

Note: You'll still have to follow Gerald's suggestion, which will likely fix your immediate problem.
Dr. KlahnPrincipal Software EngineerCommented:
I would not consider 171 errors to be excessive on a 10G adapter after moving 43 GB of data.

If the interface is locked down to 1G, I'd be willing to give good odds that the problem will go away.  Microwaves should be moved through waveguides, not over unshielded copper wires.
Adam FlookSenior Technical Services EngineerAuthor Commented:
Hi, the 171 errors were after 10 mins of run time. in terms of the RX/TX can this be enabled on a live port without downtime, the switch is a HPE 5920.
Gerald NegrotaIT ProfessionalCommented:
In theory, no downtime to enable flow control .
On HPE5920 (CLI), from manual:
# Enable TxRx mode generic flow control on the interface Ten-GigabitEthernet 1/0/1.
<Sysname> system-view
[Sysname] interface ten-gigabitethernet 1/0/1
[Sysname-Ten-GigabitEthernet1/0/1] flow-control


On RHEL box, use ethtool :
ethtool -A eth5 rx on
ethtool -A eth5 tx on


Be sure that you are on 10G link ( the switch support 1G !), because the flow control is supported only in duplex mode.
It's more than this solution.Get answers and train to solve all your tech problems - anytime, anywhere.Try it for free Edge Out The Competitionfor your dream job with proven skills and certifications.Get started today Stand Outas the employee with proven skills.Start learning today for free Move Your Career Forwardwith certification training in the latest technologies.Start your trial today
Linux

From novice to tech pro — start learning today.