Activity log for bug #1923668

Date Who What changed Old value New value Message
2021-04-13 21:43:02 Michael Skalka bug added bug
2021-04-13 21:46:22 Michael Skalka tags openstack-upgrade
2021-04-13 21:47:14 Michael Skalka attachment added juju-crashdump-cd63074f-f189-4fa0-a532-9bdf6e6b0280.tar.xz https://bugs.launchpad.net/charm-neutron-openvswitch/+bug/1923668/+attachment/5487443/+files/juju-crashdump-cd63074f-f189-4fa0-a532-9bdf6e6b0280.tar.xz
2021-04-13 21:59:25 Michael Skalka description While upgrading a cloud from Queens to Rocky I attempted to flush a hypervisor to avoid service disruption on the final unit of nova-compute using live-migrate. The action queues up in the dashboard however it completes with the instance remaining on the same host. Looking into the nova-compute logs from that instance it seems that the target host could not create the tap: /var/log/nova/nova-compute.log: 2021-04-13 21:12:50.464 1286276 WARNING nova.compute.resource_tracker [req-b1cea8db-be1e-4252-9e31-c78d097ad671 - - - - -] [instance: e341e106-5bec-4048-a76e-03ef0c70441c] Instance not resizing, skipping migration. 2021-04-13 21:12:50.658 1286276 INFO nova.compute.resource_tracker [req-b1cea8db-be1e-4252-9e31-c78d097ad671 - - - - -] Final resource view: name=flagler.playground.solutionsqa phys_ram=32123MB used_ram=18432MB phys_disk=361GB used_disk=20GB total_vcpus=12 used_vcpus=1 pci_stats=[] 2021-04-13 21:13:02.025 1286276 ERROR nova.virt.libvirt.driver [req-06db27eb-b304-4969-b1e2-cbd0d80094ca d966ea789bfe431fb5863da1e72d6e49 80545c41a5db45d98d6adf7083c4914b - 9580fece017f4adf9b4ff1aa2bf836c8 9580fece017f4adf9b4ff1aa2bf836c8] [instance: e341e106-5bec-4048-a76e-03ef0c70441c] Live Migration failure: internal error: Unable to add port tap9c8d13c9-8a to OVS bridge br-int: libvirtError: internal error: Unable to add port tap9c8d13c9-8a to OVS bridge br-int 2021-04-13 21:13:02.187 1286276 ERROR nova.virt.libvirt.driver [req-06db27eb-b304-4969-b1e2-cbd0d80094ca d966ea789bfe431fb5863da1e72d6e49 80545c41a5db45d98d6adf7083c4914b - 9580fece017f4adf9b4ff1aa2bf836c8 9580fece017f4adf9b4ff1aa2bf836c8] [instance: e341e106-5bec-4048-a76e-03ef0c70441c] Migration operation has aborted 2021-04-13 21:13:02.364 1286276 INFO nova.compute.manager [req-06db27eb-b304-4969-b1e2-cbd0d80094ca d966ea789bfe431fb5863da1e72d6e49 80545c41a5db45d98d6adf7083c4914b - 9580fece017f4adf9b4ff1aa2bf836c8 9580fece017f4adf9b4ff1aa2bf836c8] [instance: e341e106-5bec-4048-a76e-03ef0c70441c] Swapping old allocation on 5a94928b-fb98-401f-bdd9-aa2f9f08602c held by migration 44727a6b-3417-4df3-9ca9-5b52e2e0f487 for instance 2021-04-13 21:13:04.381 1286276 WARNING nova.compute.manager [req-2f77835b-38ab-45b9-8acd-38a98ff3fcfc 6cad752c2b9744d6aac17fb26522004c d1aed1922a5a4a7899cae3e3afb6bc90 - c1a08b45ef134260be7501e96bc9ee3d c1a08b45ef134260be7501e96bc9ee3d] [instance: e341e106-5bec-4048-a76e-03ef0c70441c] Received unexpected event network-vif-unplugged-9c8d13c9-8a96-49e0-834a-3c512f1990cb for instance with vm_state active and task_state None. 2021-04-13 21:13:05.836 1286276 WARNING nova.compute.manager [req-66d4ddc6-4ac8-4c1a-8007-582d599da366 6cad752c2b9744d6aac17fb26522004c d1aed1922a5a4a7899cae3e3afb6bc90 - c1a08b45ef134260be7501e96bc9ee3d c1a08b45ef134260be7501e96bc9ee3d] [instance: e341e106-5bec-4048-a76e-03ef0c70441c] Received unexpected event network-vif-plugged-9c8d13c9-8a96-49e0-834a-3c512f1990cb for instance with vm_state active and task_state None. Looking at the target unit the ovs-vsswitchd service is not even running on a number of the units: https://pastebin.ubuntu.com/p/YhdTQRRGb4/ Restarting the ovs-vsswitchd service on those hosts restores the ability to migrate. While upgrading a cloud from Queens to Rocky I attempted to flush a hypervisor to avoid service disruption on the final unit of nova-compute using live-migrate. The action queues up in the dashboard however it completes with the instance remaining on the same host. Looking into the nova-compute logs from that instance it seems that the target host could not create the tap: /var/log/nova/nova-compute.log: 2021-04-13 21:12:50.464 1286276 WARNING nova.compute.resource_tracker [req-b1cea8db-be1e-4252-9e31-c78d097ad671 - - - - -] [instance: e341e106-5bec-4048-a76e-03ef0c70441c] Instance not resizing, skipping migration. 2021-04-13 21:12:50.658 1286276 INFO nova.compute.resource_tracker [req-b1cea8db-be1e-4252-9e31-c78d097ad671 - - - - -] Final resource view: name=flagler.playground.solutionsqa phys_ram=32123MB used_ram=18432MB phys_disk=361GB used_disk=20GB total_vcpus=12 used_vcpus=1 pci_stats=[] 2021-04-13 21:13:02.025 1286276 ERROR nova.virt.libvirt.driver [req-06db27eb-b304-4969-b1e2-cbd0d80094ca d966ea789bfe431fb5863da1e72d6e49 80545c41a5db45d98d6adf7083c4914b - 9580fece017f4adf9b4ff1aa2bf836c8 9580fece017f4adf9b4ff1aa2bf836c8] [instance: e341e106-5bec-4048-a76e-03ef0c70441c] Live Migration failure: internal error: Unable to add port tap9c8d13c9-8a to OVS bridge br-int: libvirtError: internal error: Unable to add port tap9c8d13c9-8a to OVS bridge br-int 2021-04-13 21:13:02.187 1286276 ERROR nova.virt.libvirt.driver [req-06db27eb-b304-4969-b1e2-cbd0d80094ca d966ea789bfe431fb5863da1e72d6e49 80545c41a5db45d98d6adf7083c4914b - 9580fece017f4adf9b4ff1aa2bf836c8 9580fece017f4adf9b4ff1aa2bf836c8] [instance: e341e106-5bec-4048-a76e-03ef0c70441c] Migration operation has aborted 2021-04-13 21:13:02.364 1286276 INFO nova.compute.manager [req-06db27eb-b304-4969-b1e2-cbd0d80094ca d966ea789bfe431fb5863da1e72d6e49 80545c41a5db45d98d6adf7083c4914b - 9580fece017f4adf9b4ff1aa2bf836c8 9580fece017f4adf9b4ff1aa2bf836c8] [instance: e341e106-5bec-4048-a76e-03ef0c70441c] Swapping old allocation on 5a94928b-fb98-401f-bdd9-aa2f9f08602c held by migration 44727a6b-3417-4df3-9ca9-5b52e2e0f487 for instance 2021-04-13 21:13:04.381 1286276 WARNING nova.compute.manager [req-2f77835b-38ab-45b9-8acd-38a98ff3fcfc 6cad752c2b9744d6aac17fb26522004c d1aed1922a5a4a7899cae3e3afb6bc90 - c1a08b45ef134260be7501e96bc9ee3d c1a08b45ef134260be7501e96bc9ee3d] [instance: e341e106-5bec-4048-a76e-03ef0c70441c] Received unexpected event network-vif-unplugged-9c8d13c9-8a96-49e0-834a-3c512f1990cb for instance with vm_state active and task_state None. 2021-04-13 21:13:05.836 1286276 WARNING nova.compute.manager [req-66d4ddc6-4ac8-4c1a-8007-582d599da366 6cad752c2b9744d6aac17fb26522004c d1aed1922a5a4a7899cae3e3afb6bc90 - c1a08b45ef134260be7501e96bc9ee3d c1a08b45ef134260be7501e96bc9ee3d] [instance: e341e106-5bec-4048-a76e-03ef0c70441c] Received unexpected event network-vif-plugged-9c8d13c9-8a96-49e0-834a-3c512f1990cb for instance with vm_state active and task_state None. Looking at the target unit the ovs-vsswitchd service is not even running on a number of the units: https://pastebin.ubuntu.com/p/YhdTQRRGb4/ Restarting the ovs-vsswitchd service on those hosts restores the ability to migrate. In each attempt the source of the instance was flagler and the destination was everitt which are machines 6 and 3 in the attached crashdump respectively.
2021-04-14 13:26:25 Michael Skalka summary Upgrade from Queens to Rocky results in dead ovs-vsswitchd services Upgrade from Queens to Rocky results in dead ovs-vswitchd services
2021-04-19 02:59:00 Billy Olsen attachment added crash file from system https://bugs.launchpad.net/charm-neutron-openvswitch/+bug/1923668/+attachment/5489633/+files/_usr_lib_openvswitch-switch_ovs-vswitchd.0.crash
2021-06-18 02:21:21 Billy Olsen bug task added openvswitch (Ubuntu)
2021-06-18 02:22:32 Billy Olsen nominated for series Ubuntu Focal
2021-06-18 02:22:32 Billy Olsen bug task added openvswitch (Ubuntu Focal)
2021-06-18 02:22:39 Billy Olsen openvswitch (Ubuntu Focal): status New Fix Released
2021-06-18 02:22:49 Billy Olsen openvswitch (Ubuntu): status New Fix Released
2021-06-18 02:25:49 Billy Olsen bug task added cloud-archive
2021-06-18 02:27:45 Billy Olsen nominated for series cloud-archive/rocky
2021-06-18 02:27:45 Billy Olsen bug task added cloud-archive/rocky
2021-06-18 02:27:55 Billy Olsen cloud-archive: status New Triaged
2021-06-18 02:27:57 Billy Olsen cloud-archive/rocky: status New Triaged
2021-06-18 02:28:02 Billy Olsen charm-neutron-openvswitch: status New Invalid
2021-06-18 02:28:09 Billy Olsen cloud-archive/rocky: importance Undecided High
2021-10-25 08:07:12 Chris MacNaughton cloud-archive/rocky: assignee Chris MacNaughton (chris.macnaughton)
2021-10-25 10:53:47 Launchpad Janitor merge proposal linked https://code.launchpad.net/~chris.macnaughton/ubuntu/+source/openvswitch/+git/openvswitch/+merge/410730
2021-10-25 10:55:08 Launchpad Janitor merge proposal linked https://code.launchpad.net/~chris.macnaughton/ubuntu/+source/openvswitch/+git/openvswitch/+merge/410731
2021-10-25 15:08:05 Chris MacNaughton cloud-archive/rocky: status Triaged Fix Committed
2021-10-25 15:08:07 Chris MacNaughton tags openstack-upgrade openstack-upgrade verification-rocky-needed
2023-05-26 13:47:53 Corey Bryant cloud-archive: status Triaged Invalid