A few months ago we got 3 Dell PowerEdge™ R6515 machines and installed Ubuntu 20.04 LTS. After a few days problems started showing on the network interface for the internal network. The network link is disconnected, and a few seconds later it is connected again. We have updated all the server firmware, checked the cables and switch but the problem persists. Digging deeper in the logs we found this in dmesg (full log in the attached file):
[Sat Jan 30 09:57:07 2021] NETDEV WATCHDOG: eno2 (tg3): transmit queue 0 timed out
[Sat Jan 30 09:57:07 2021] tg3 0000:c1:00.1 eno2: transmit timed out, resetting
[Sat Jan 30 09:57:08 2021] tg3 0000:c1:00.1 eno2: Link is down
[Sat Jan 30 09:57:12 2021] tg3 0000:c1:00.1 eno2: Link is up at 1000 Mbps, full duplex
For some reason, the link is disconnected and reconnected a few seconds later. This happens randomly on all 3 servers and it does not seem to correlate with the current traffic on that interface. The problem seems to be very similar to Bug #1331513 but this happens on the up to date Ubutnu 20.04 (last updated mid last week).
If you need any additional info, feel free to ask.
ethtool --show-offload eno2
Features for eno2:
rx-checksumming: on
tx-checksumming: on tx-checksum-ipv4: on tx-checksum-ip-generic: off [fixed] tx-checksum-ipv6: on tx-checksum-fcoe-crc: off [fixed] tx-checksum-sctp: off [fixed]
scatter-gather: on tx-scatter-gather: on tx-scatter-gather-fraglist: off [fixed]
tcp-segmentation-offload: on tx-tcp-segmentation: on tx-tcp-ecn-segmentation: on tx-tcp-mangleid-segmentation: off tx-tcp6-segmentation: on
generic-segmentation-offload: on
generic-receive-offload: on
large-receive-offload: off [fixed]
rx-vlan-offload: on [fixed]
tx-vlan-offload: on [fixed]
ntuple-filters: off [fixed]
receive-hashing: off [fixed]
highdma: on
rx-vlan-filter: off [fixed]
vlan-challenged: off [fixed]
tx-lockless: off [fixed]
netns-local: off [fixed]
tx-gso-robust: off [fixed]
tx-fcoe-segmentation: off [fixed]
tx-gre-segmentation: off [fixed]
tx-gre-csum-segmentation: off [fixed]
tx-ipxip4-segmentation: off [fixed]
tx-ipxip6-segmentation: off [fixed]
tx-udp_tnl-segmentation: off [fixed]
tx-udp_tnl-csum-segmentation: off [fixed]
tx-gso-partial: off [fixed]
tx-sctp-segmentation: off [fixed]
tx-esp-segmentation: off [fixed]
tx-udp-segmentation: off [fixed]
fcoe-mtu: off [fixed]
tx-nocache-copy: off
loopback: off [fixed]
rx-fcs: off [fixed]
rx-all: off [fixed]
tx-vlan-stag-hw-insert: off [fixed]
rx-vlan-stag-hw-parse: off [fixed]
rx-vlan-stag-filter: off [fixed]
l2-fwd-offload: off [fixed]
hw-tc-offload: off [fixed]
esp-hw-offload: off [fixed]
esp-tx-csum-hw-offload: off [fixed]
rx-udp_tunnel-port-offload: off [fixed]
tls-hw-tx-offload: off [fixed]
tls-hw-rx-offload: off [fixed]
rx-gro-hw: off [fixed]
tls-hw-record: off [fixed]
A few months ago we got 3 Dell PowerEdge™ R6515 machines and installed Ubuntu 20.04 LTS. After a few days problems started showing on the network interface for the internal network. The network link is disconnected, and a few seconds later it is connected again. We have updated all the server firmware, checked the cables and switch but the problem persists. Digging deeper in the logs we found this in dmesg (full log in the attached file):
[Sat Jan 30 09:57:07 2021] NETDEV WATCHDOG: eno2 (tg3): transmit queue 0 timed out
[Sat Jan 30 09:57:07 2021] tg3 0000:c1:00.1 eno2: transmit timed out, resetting
[Sat Jan 30 09:57:08 2021] tg3 0000:c1:00.1 eno2: Link is down
[Sat Jan 30 09:57:12 2021] tg3 0000:c1:00.1 eno2: Link is up at 1000 Mbps, full duplex
For some reason, the link is disconnected and reconnected a few seconds later. This happens randomly on all 3 servers and it does not seem to correlate with the current traffic on that interface. The problem seems to be very similar to Bug #1331513 but this happens on the up to date Ubutnu 20.04 (last updated mid last week).
If you need any additional info, feel free to ask.
Additional info: ------- ------- ------- ------- ---
-------
linux-firmware/now 1.187.8 all 5.4.0-64- generic/ focal-updates, focal-updates, now 5.4.0-64.72 amd64 generic/ now 5.4.0.64.67 amd64
linux-image-
linux-image-
------- ------- ------- ------- ------- ---
lspci -nnk | grep -iA2 net
c1:00.0 Ethernet controller [0200]: Broadcom Inc. and subsidiaries NetXtreme BCM5720 2-port Gigabit Ethernet PCIe [14e4:165f]
DeviceName: NIC1
Subsystem: Dell PowerEdge R6515/R7515 LOM [1028:08fd]
--
c1:00.1 Ethernet controller [0200]: Broadcom Inc. and subsidiaries NetXtreme BCM5720 2-port Gigabit Ethernet PCIe [14e4:165f]
DeviceName: NIC2
Subsystem: Dell PowerEdge R6515/R7515 LOM [1028:08fd]
------- ------- ------- ------- ------- ---
modinfo tg3 | grep ^version
version: 3.137
------- ------- ------- ------- ------- ---
lshw -C network
capabilities: pm vpd msi msix pciexpress bus_master cap_list rom ethernet physical tp 10bt 10bt-fd 100bt 100bt-fd 1000bt 1000bt-fd autonegotiation
configuration: autonegotiation=on broadcast=yes driver=tg3 driverversion=3.137 duplex=full firmware= FFV21.60. 16 bc 5720-v1.39 ip=213.133.114.208 latency=0 link=yes multicast=yes port=twisted pair speed=1Gbit/s c7030000- c703ffff memory: c7040000- c704ffff memory: c7050000- c705ffff memory: c7100000- c713ffff
capabilities: pm vpd msi msix pciexpress bus_master cap_list rom ethernet physical tp 10bt 10bt-fd 100bt 100bt-fd 1000bt 1000bt-fd autonegotiation
configuration: autonegotiation=on broadcast=yes driver=tg3 driverversion=3.137 duplex=full firmware= FFV21.60. 16 bc 5720-v1.39 ip=10.0.1.6 latency=0 link=yes multicast=yes port=twisted pair speed=1Gbit/s c7000000- c700ffff memory: c7010000- c701ffff memory: c7020000- c702ffff memory: c7140000- c717ffff
*-network:0
description: Ethernet interface
product: NetXtreme BCM5720 2-port Gigabit Ethernet PCIe
vendor: Broadcom Inc. and subsidiaries
physical id: 0
bus info: pci@0000:c1:00.0
logical name: eno1
version: 00
serial: 34:48:ed:ef:07:0e
size: 1Gbit/s
capacity: 1Gbit/s
width: 64 bits
clock: 33MHz
resources: irq:160 memory:
*-network:1
description: Ethernet interface
product: NetXtreme BCM5720 2-port Gigabit Ethernet PCIe
vendor: Broadcom Inc. and subsidiaries
physical id: 0.1
bus info: pci@0000:c1:00.1
logical name: eno2
version: 00
serial: 34:48:ed:ef:07:0f
size: 1Gbit/s
capacity: 1Gbit/s
width: 64 bits
clock: 33MHz
resources: irq:163 memory:
------- ------- ------- ------- ----
ethtool --show-offload eno2
tx-checksum- ipv4: on
tx-checksum- ip-generic: off [fixed]
tx-checksum- ipv6: on
tx-checksum- fcoe-crc: off [fixed]
tx-checksum- sctp: off [fixed]
tx-scatter- gather: on
tx-scatter- gather- fraglist: off [fixed] n-offload: on
tx-tcp- segmentation: on
tx-tcp- ecn-segmentatio n: on
tx-tcp- mangleid- segmentation: off
tx-tcp6- segmentation: on segmentation- offload: on receive- offload: on offload: off [fixed] segmentation: off [fixed] segmentation: off [fixed] csum-segmentati on: off [fixed] segmentation: off [fixed] segmentation: off [fixed] tnl-segmentatio n: off [fixed] tnl-csum- segmentation: off [fixed] segmentation: off [fixed] segmentation: off [fixed] segmentation: off [fixed] stag-hw- insert: off [fixed] stag-hw- parse: off [fixed] stag-filter: off [fixed] csum-hw- offload: off [fixed] tunnel- port-offload: off [fixed]
Features for eno2:
rx-checksumming: on
tx-checksumming: on
scatter-gather: on
tcp-segmentatio
generic-
generic-
large-receive-
rx-vlan-offload: on [fixed]
tx-vlan-offload: on [fixed]
ntuple-filters: off [fixed]
receive-hashing: off [fixed]
highdma: on
rx-vlan-filter: off [fixed]
vlan-challenged: off [fixed]
tx-lockless: off [fixed]
netns-local: off [fixed]
tx-gso-robust: off [fixed]
tx-fcoe-
tx-gre-
tx-gre-
tx-ipxip4-
tx-ipxip6-
tx-udp_
tx-udp_
tx-gso-partial: off [fixed]
tx-sctp-
tx-esp-
tx-udp-
fcoe-mtu: off [fixed]
tx-nocache-copy: off
loopback: off [fixed]
rx-fcs: off [fixed]
rx-all: off [fixed]
tx-vlan-
rx-vlan-
rx-vlan-
l2-fwd-offload: off [fixed]
hw-tc-offload: off [fixed]
esp-hw-offload: off [fixed]
esp-tx-
rx-udp_
tls-hw-tx-offload: off [fixed]
tls-hw-rx-offload: off [fixed]
rx-gro-hw: off [fixed]
tls-hw-record: off [fixed]