Kernel panic when using KVM and mlx4_en driver (when bonding and sriov enabled)
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
linux (Ubuntu) |
Triaged
|
High
|
Unassigned | ||
Xenial |
Triaged
|
High
|
Unassigned | ||
Artful |
Won't Fix
|
High
|
Unassigned | ||
Bionic |
Triaged
|
High
|
Unassigned |
Bug Description
##### System information #####
# uname -a
Linux m5c37 4.13.0-36-generic #40~16.04.1-Ubuntu SMP Fri Feb 16 23:25:58 UTC 2018 x86_64 x86_64 x86_64 GNU/Linux
# cat /etc/os-release
NAME="Ubuntu"
VERSION=
ID=ubuntu
ID_LIKE=debian
PRETTY_
VERSION_
HOME_URL="http://
SUPPORT_URL="http://
BUG_
VERSION_
UBUNTU_
# ethtool -i eno1
driver: mlx4_en
version: 4.3-1.0.1
firmware-
expansion-
bus-info: 0000:11:00.0
supports-
supports-test: yes
supports-
supports-
supports-
# ethtool -i bond0
driver: bonding
version: 3.7.1
firmware-
expansion-
bus-info:
supports-
supports-test: no
supports-
supports-
supports-
# ethtool -i vmbr0
driver: bridge
version: 2.3
firmware-
expansion-
bus-info: N/A
supports-
supports-test: no
supports-
supports-
supports-
Mellanox driver was installed from
http://
./mlnxofedi
##### Steps for reproduce #####
This is my /etc/network/
auto lo
iface lo inet loopback
auto openibd
iface openibd inet manual
pre-up /etc/init.d/openibd start
auto bond0
iface bond0 inet manual
pre-up ip link add bond0 type bond || true
pre-up ip link set bond0 down
pre-up ip link set bond0 type bond mode active-backup arp_interval 2000 arp_ip_target 10.36.0.1 arp_validate 3 primary eno1
pre-up ip link set eno1 down
pre-up ip link set eno1d1 down
pre-up ip link set eno1 master bond0
pre-up ip link set eno1d1 master bond0
pre-up ip link set bond0 up
auto vmbr0
iface vmbr0 inet static
address 10.36.128.217
netmask 255.255.0.0
gateway 10.36.0.1
I execute these commands:
wget http://
qemu-
And after few moments I have hang kernel, and theese messages in console:
[74390.187908] mlx4_core 0000:11:00.0: bond for multifunction failed
[74390.486476] mlx4_en: eno1d1: Fail to bond device
[74390.750758] cache_from_obj: Wrong slab cache. kmalloc-256 but object is from kmalloc-192
[74391.152326] general protection fault: 0000 [#1] SMP PTI
[74391.410424] cache_from_obj: Wrong slab cache. kmalloc-256 but object is from kmalloc-192
kernel trace log in attachment
ProblemType: Bug
DistroRelease: Ubuntu 16.04
Package: linux-image-
ProcVersionSign
Uname: Linux 4.13.0-36-generic x86_64
ApportVersion: 2.20.1-0ubuntu2.15
Architecture: amd64
Date: Mon Mar 12 19:59:16 2018
ProcEnviron:
TERM=xterm-
PATH=(custom, no user)
LANG=C
SHELL=/bin/bash
SourcePackage: linux-hwe
UpgradeStatus: No upgrade log present (probably fresh install)
summary: |
- Kernel panic when using KVM and Mellanox OFED driver (bonding and sriov + Kernel panic when using KVM and mlx4_en driver (when bonding and sriov enabled) |
description: | updated |
I'm tried to use simple modules loading instead /etc/init.d/openibd script:
rmmod mlx4_en mlx4_core
modprobe mlx4_core num_vfs=1 port_type_array=2,2 probe_vf=1
The result the same:
[ 193.469331] mlx4_core 0000:11:00.0: bond for multifunction failed netlink xt_nat xt_tcpudp xt_recent ip_set nfnetlink ip_vs rpcsec_gss_krb5 auth_rpcgss nfsv4 nfs lockd grace sunrpc fscache xt_comment xt_mark netconsole mlx4_ib(OE) mlx4_en(OE) mlx4_core(OE) ipt_MASQUERADE nf_nat_ masquerade_ ipv4 xfrm_user xfrm_algo iptable_nat nf_conntrack_ipv4 nf_defrag_ipv4 nf_nat_ipv4 xt_addrtype iptable_filter ip_tables xt_conntrack x_tables nf_nat nf_conntrack libcrc32c br_netfilter 8021q garp mrp ib_core(OE) mlx_compat(OE) bridge stp llc bonding ipmi_ssif intel_rapl x86_pkg_ temp_thermal intel_powerclamp coretemp kvm_intel kvm irqbypass crct10dif_pclmul crc32_pclmul ghash_clmulni_intel pcbc aesni_intel aes_x86_64 hpilo crypto_simd
[ 193.770126] cache_from_obj: Wrong slab cache. kmalloc-256 but object is from kmalloc-192
[ 194.170171] mlx4_en: eno1d1: Fail to bond device
[ 194.170178] nf_reject_ipv4 ebtable_filter ebtables ip6table_filter ip6_tables xt_set ip_set_list_set ip_set_hash_net veth beegfs(OE) dummy nf_conntrack_
[ 194.170482] glue_helper cryptd intel_cstate mei_me ipmi_si ipmi_devintf ipmi_msghandler intel_rapl_perf shpchp mei acpi_power_meter mac_hid ie31200_edac knem(OE) autofs4 overlay nbd ptp pps_core i915 mgag200 video ttm i2c_algo_bit drm_kms_helper syscopyarea sysfillrect sysimgblt fb_sys_fops drm nvme ahci nvme_core libahci devlink [last unloaded: mlx4_core]
[ 194.170550] CPU: 0 PID: 7 Comm: ksoftirqd/0 Tainted: G W OE 4.13.0-36-generic #40~16.04.1-Ubuntu