Instances with hardware offloaded ovs ports lose access after failed live migrations
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
OpenStack Compute (nova) |
Fix Released
|
Medium
|
Erlon R. Cruz | ||
Ussuri |
New
|
Undecided
|
Unassigned | ||
Victoria |
New
|
Undecided
|
Unassigned | ||
Wallaby |
In Progress
|
Undecided
|
Unassigned | ||
Xena |
Fix Released
|
Undecided
|
Unassigned | ||
Yoga |
Fix Released
|
Undecided
|
Unassigned | ||
neutron |
Incomplete
|
Undecided
|
Unassigned |
Bug Description
If for some reason a live migration fails for an instance with an SRIOV port during the '_pre_live_
The instance re-gains connectivity on the source host after a reboot (don't know if there's another way to restore connectivity). As a side effect of this behavior, the pre-live migration cleanup hook also fails with:
PCI device 0000:3b:10.0 is in use by driver QEMU
[How to reproduce]
- Create an environment with SRIOV, (our case uses switchdev[1])
- Create 1 VM
- Provoke a failure in the _pre_live_migration process (for example creating a directory /var/lib/
- Check the VM's connectivity
- Check the logs for: libvirt.
Full-stack trace[2]
[Expected]
VM connectivity is restored even if it gets a brief disconnection
As happens for non-SRIOV scenarios, after a failure, no leftovers remains (port bindings and instance path files)
[Observed]
VM loses connectivity which is only is restored after the VM status is set to ERROR and the VM is power recycled
Port bindings are not removed
[Environment]
Focal Ussuri with Mellanox Connect5 cards
[1] https:/
[2] https:/
description: | updated |
Changed in neutron: | |
status: | New → Incomplete |
tags: | added: sriov-pci-pt |
summary: |
- Instances with SRIOV ports loose access after failed live migrations + Instances with hardware offloaded ovs ports loose access after failed + live migrations |
description: | updated |
Changed in nova: | |
status: | Incomplete → In Progress |
Changed in nova: | |
assignee: | nobody → Erlon R. Cruz (sombrafam) |
summary: |
- Instances with hardware offloaded ovs ports loose access after failed + Instances with hardware offloaded ovs ports lose access after failed live migrations |
tags: | added: yoga-rc-potential |
Changed in nova: | |
importance: | Undecided → Medium |
https:/ /docs.openstack .org/neutron/ latest/ admin/config- sriov.html# known-limitatio ns
This doc shows that before openstack Train release, Live migration with sriov port was not supported. After T release, please following the note to ensure that you have the correct settings of the guest interface.
"""
Indirect mode SR-IOV interfaces (vnic-type: macvtap or virtio-forwarder) can now be migrated transparently to the guest. Direct mode SR-IOV interfaces (vnic-type: direct or direct-physical) are detached before the migration and reattached after the migration so this is not transparent to the guest. To avoid loss of network connectivy when live migrating with direct mode sriov the user should create a failover bond in the guest with a transparently live migration port type e.g. vnic-type normal or indirect mode SR-IOV.
"""