Netdev watchdog closes link on PowerEdge R6515 Realtek BCM5720
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
linux (Ubuntu) |
Confirmed
|
Undecided
|
Unassigned |
Bug Description
A few months ago we got 3 Dell PowerEdge™ R6515 machines and installed Ubuntu 20.04 LTS. After a few days problems started showing on the network interface for the internal network. The network link is disconnected, and a few seconds later it is connected again. We have updated all the server firmware, checked the cables and switch but the problem persists. Digging deeper in the logs we found this in dmesg (full log in the attached file):
[Sat Jan 30 09:57:07 2021] NETDEV WATCHDOG: eno2 (tg3): transmit queue 0 timed out
[Sat Jan 30 09:57:07 2021] tg3 0000:c1:00.1 eno2: transmit timed out, resetting
[Sat Jan 30 09:57:08 2021] tg3 0000:c1:00.1 eno2: Link is down
[Sat Jan 30 09:57:12 2021] tg3 0000:c1:00.1 eno2: Link is up at 1000 Mbps, full duplex
For some reason, the link is disconnected and reconnected a few seconds later. This happens randomly on all 3 servers and it does not seem to correlate with the current traffic on that interface. The problem seems to be very similar to Bug #1331513 but this happens on the up to date Ubutnu 20.04 (last updated mid last week).
If you need any additional info, feel free to ask.
Additional info:
-------
linux-firmware/now 1.187.8 all
linux-image-
linux-image-
-------
lspci -nnk | grep -iA2 net
c1:00.0 Ethernet controller [0200]: Broadcom Inc. and subsidiaries NetXtreme BCM5720 2-port Gigabit Ethernet PCIe [14e4:165f]
DeviceName: NIC1
Subsystem: Dell PowerEdge R6515/R7515 LOM [1028:08fd]
--
c1:00.1 Ethernet controller [0200]: Broadcom Inc. and subsidiaries NetXtreme BCM5720 2-port Gigabit Ethernet PCIe [14e4:165f]
DeviceName: NIC2
Subsystem: Dell PowerEdge R6515/R7515 LOM [1028:08fd]
-------
modinfo tg3 | grep ^version
version: 3.137
-------
lshw -C network
*-network:0
description: Ethernet interface
product: NetXtreme BCM5720 2-port Gigabit Ethernet PCIe
vendor: Broadcom Inc. and subsidiaries
physical id: 0
bus info: pci@0000:c1:00.0
logical name: eno1
version: 00
serial: 34:48:ed:ef:07:0e
size: 1Gbit/s
capacity: 1Gbit/s
width: 64 bits
clock: 33MHz
resources: irq:160 memory:
*-network:1
description: Ethernet interface
product: NetXtreme BCM5720 2-port Gigabit Ethernet PCIe
vendor: Broadcom Inc. and subsidiaries
physical id: 0.1
bus info: pci@0000:c1:00.1
logical name: eno2
version: 00
serial: 34:48:ed:ef:07:0f
size: 1Gbit/s
capacity: 1Gbit/s
width: 64 bits
clock: 33MHz
resources: irq:163 memory:
-------
ethtool --show-offload eno2
Features for eno2:
rx-checksumming: on
tx-checksumming: on
scatter-gather: on
tcp-segmentatio
generic-
generic-
large-receive-
rx-vlan-offload: on [fixed]
tx-vlan-offload: on [fixed]
ntuple-filters: off [fixed]
receive-hashing: off [fixed]
highdma: on
rx-vlan-filter: off [fixed]
vlan-challenged: off [fixed]
tx-lockless: off [fixed]
netns-local: off [fixed]
tx-gso-robust: off [fixed]
tx-fcoe-
tx-gre-
tx-gre-
tx-ipxip4-
tx-ipxip6-
tx-udp_
tx-udp_
tx-gso-partial: off [fixed]
tx-sctp-
tx-esp-
tx-udp-
fcoe-mtu: off [fixed]
tx-nocache-copy: off
loopback: off [fixed]
rx-fcs: off [fixed]
rx-all: off [fixed]
tx-vlan-
rx-vlan-
rx-vlan-
l2-fwd-offload: off [fixed]
hw-tc-offload: off [fixed]
esp-hw-offload: off [fixed]
esp-tx-
rx-udp_
tls-hw-tx-offload: off [fixed]
tls-hw-rx-offload: off [fixed]
rx-gro-hw: off [fixed]
tls-hw-record: off [fixed]
---
ProblemType: Bug
AlsaDevices:
total 0
crw-rw---- 1 root audio 116, 1 Jan 26 10:58 seq
crw-rw---- 1 root audio 116, 33 Jan 26 10:58 timer
AplayDevices: aplay: device_list:274: no soundcards found...
ApportVersion: 2.20.11-
Architecture: amd64
ArecordDevices: arecord: device_list:274: no soundcards found...
AudioDevicesInUse: Error: [Errno 2] No such file or directory: 'fuser'
CasperMD5CheckR
DistroRelease: Ubuntu 20.04
IwConfig: Error: [Errno 2] No such file or directory: 'iwconfig'
MachineType: Dell Inc. PowerEdge R6515
Package: linux (not installed)
PciMultimedia:
ProcEnviron:
TERM=xterm-
PATH=(custom, no user)
LANG=en_US.UTF-8
SHELL=/bin/bash
ProcFB:
ProcKernelCmdLine: BOOT_IMAGE=
ProcVersionSign
RelatedPackageV
linux-
linux-
linux-firmware 1.187.8
RfKill: Error: [Errno 2] No such file or directory: 'rfkill'
Tags: focal
Uname: Linux 5.4.0-64-generic x86_64
UpgradeStatus: No upgrade log present (probably fresh install)
UserGroups: N/A
_MarkForUpload: True
dmi.bios.date: 08/10/2020
dmi.bios.vendor: Dell Inc.
dmi.bios.version: 1.5.3 [Hetzner 1.0.0]
dmi.board.name: 0R4CNN
dmi.board.vendor: Dell Inc.
dmi.board.version: A00
dmi.chassis.type: 23
dmi.chassis.vendor: Dell Inc.
dmi.modalias: dmi:bvnDellInc.
dmi.product.family: PowerEdge
dmi.product.name: PowerEdge R6515
dmi.product.sku: SKU=NotProvided
dmi.sys.vendor: Dell Inc.
Hi.
I think you've filed this bug in the wrong place. Please file a public kernel bug here:
https:/ /bugs.launchpad .net/ubuntu/ +source/ linux/+ filebug
Thank you,
Jeff