Backport mlx5e fix for tunnel offload
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
linux (Ubuntu) |
Fix Released
|
Medium
|
Unassigned | ||
Bionic |
Invalid
|
Undecided
|
Unassigned | ||
Focal |
Fix Released
|
Medium
|
Tim Gardner | ||
Groovy |
Fix Released
|
Medium
|
Unassigned | ||
Hirsute |
Fix Released
|
Medium
|
Unassigned | ||
linux-azure (Ubuntu) |
Confirmed
|
Undecided
|
Unassigned | ||
Bionic |
Invalid
|
Medium
|
Unassigned | ||
Focal |
Fix Released
|
Undecided
|
Tim Gardner | ||
Groovy |
Fix Released
|
Undecided
|
Unassigned | ||
Hirsute |
Confirmed
|
Undecided
|
Unassigned |
Bug Description
[SRU Justification]
We've discovered an issue on Ubuntu 20.04 when used with Kubernetes CNIs that perform offloading using Geneve that causes the kernel to panic on Azure instances with accelerated networking with the following errors:
[ 307.561223] mlx5_core 0001:00:02.0 enP1s1: Error cqe on cqn 0x200, ci 0x3d4, sqn 0x2c5, opcode 0xd, syndrome 0x2, vendor syndrome 0x68
[ 307.573864] mlx5_core 0001:00:02.0 enP1s1: ERR CQE on SQ: 0x2c5
[ 307.764902] mlx5_core 0001:00:02.0 enP1s1: Error cqe on cqn 0x200, ci 0x3d7, sqn 0x2c5, opcode 0xd, syndrome 0x2, vendor syndrome 0x68
[ 307.777332] mlx5_core 0001:00:02.0 enP1s1: ERR CQE on SQ: 0x2c5
[ 322.814393] mlx5_core 0001:00:02.0 enP1s1: Error cqe on cqn 0x218, ci 0x1a7, sqn 0x2bd, opcode 0xd, syndrome 0x2, vendor syndrome 0x68
[ 322.826685] mlx5_core 0001:00:02.0 enP1s1: ERR CQE on SQ: 0x2bd
NVIDIA fixed this issue in https:/
[Test Plan]
Spin up a Kubernetes CNI that uses Geneve offloading
[Where problems could occur]
Its possible some traffic won't get geneve acceleration. This patch has been backported to v5.10.y and v5.11.y
CVE References
description: | updated |
Changed in linux (Ubuntu Focal): | |
importance: | Undecided → Medium |
Changed in linux (Ubuntu Focal): | |
status: | In Progress → Fix Committed |
Changed in linux (Ubuntu Groovy): | |
importance: | Undecided → Medium |
status: | Incomplete → In Progress |
Changed in linux (Ubuntu Hirsute): | |
importance: | Undecided → Medium |
status: | Incomplete → Fix Committed |
Changed in linux (Ubuntu Bionic): | |
status: | Confirmed → Invalid |
Changed in linux-azure (Ubuntu Bionic): | |
status: | In Progress → Invalid |
assignee: | Tim Gardner (timg-tpi) → nobody |
Changed in linux (Ubuntu Hirsute): | |
status: | Fix Committed → Fix Released |
tags: |
added: verification-done-focal removed: verification-needed-focal |
Changed in linux (Ubuntu Groovy): | |
status: | In Progress → Fix Committed |
Can confirm this is also the case on Ubuntu 18.04.