mlx5_core ovs/ovn hardware offload - broken network connectivity between instances
Bug #1907451 reported by
James Page
This bug affects 1 person
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
charm-ovn-chassis |
New
|
Undecided
|
Unassigned |
Bug Description
Ubuntu: Focal
OpenStack: Ussuri
OVN: 20.03.1
OVS: 2.12.1
Mellanox ConnectX 5 Ex cards connected to 100G TOR Cumulus switches in MLAG configuration.
Leaf and Spine network topology between racks.
8 core instances
Ubuntu: Focal
For some instances, networking works just fine - for others they are not able to connect to other instances on the same project network running on other hypervisors.
A ping will work for the first packet (not offloaded) but subsequent packets are lost for example indicating that the issue lies somewhere in the offloaded flows on the eswitch.
Observation:
With everything Groovy and ct offload enabled the problem is not there. If I disable port security the problem IS there, so perhaps incorrect flows are created or the card is interpreting them incorrectly, the offloaded flows do indeed look quite different when pivoting port security on/off.
The non-offloaded instances still work dandy in either case.