Creating a fresh bug for this as we haven't seen it in a while since the libvirt event rewrite landed but it did just cause a gate failure...
https://storage.bhs.cloud.ovh.net/v1/AUTH_dcaab5e32b234d56b626f72581e3644c/zuul_opendev_logs_5d6/801990/6/gate/tempest-integrated-compute/5d6a7ec/testr_results.html
13 Aug 05 16:01:08.964854 ubuntu-focal-ovh-gra1-0025782813 nova-compute[104299]: ERROR nova.virt.libvirt.driver [None req-54714613-ee7e-4c9f-8e60-8251a77a526e tempest-AttachVolumeNegativeTest-1260221999 tempe st-AttachVolumeNegativeTest-1260221999-project] Waiting for libvirt event about the detach of device vdb with device alias virtio-disk1 from instance ce4b444f-1587-43e9-b708-d2ae2c9cd59f is timed out.
47114 Aug 05 16:01:08.969793 ubuntu-focal-ovh-gra1-0025782813 nova-compute[104299]: DEBUG nova.virt.libvirt.driver [None req-54714613-ee7e-4c9f-8e60-8251a77a526e tempest-AttachVolumeNegativeTest-1260221999 tempe st-AttachVolumeNegativeTest-1260221999-project] Failed to detach device vdb with device alias virtio-disk1 from instance ce4b444f-1587-43e9-b708-d2ae2c9cd59f from the live domain config. Libvirt did not re port any error but the device is still in the config. {{(pid=104299) _detach_from_live_with_retry /opt/stack/nova/nova/virt/libvirt/driver.py:2394}}
47115 Aug 05 16:01:08.970117 ubuntu-focal-ovh-gra1-0025782813 nova-compute[104299]: ERROR nova.virt.libvirt.driver [None req-54714613-ee7e-4c9f-8e60-8251a77a526e tempest-AttachVolumeNegativeTest-1260221999 tempe st-AttachVolumeNegativeTest-1260221999-project] Run out of retry while detaching device vdb with device alias virtio-disk1 from instance ce4b444f-1587-43e9-b708-d2ae2c9cd59f from the live domain config. De vice is still attached to the guest.
47116 Aug 05 16:01:08.971007 ubuntu-focal-ovh-gra1-0025782813 nova-compute[104299]: WARNING nova.virt.block_device [None req-54714613-ee7e-4c9f-8e60-8251a77a526e tempest-AttachVolumeNegativeTest-1260221999 tempe st-AttachVolumeNegativeTest-1260221999-project] [instance: ce4b444f-1587-43e9-b708-d2ae2c9cd59f] Guest refused to detach volume ff8d0867-1e92-4f04-8e52-01bf05f52ac3: nova.exception.DeviceDetachFailed: Devi ce detach failed for vdb: Run out of retry while detaching device vdb with device alias virtio-disk1 from instance ce4b444f-1587-43e9-b708-d2ae2c9cd59f from the live domain config. Device is still attached to the guest.
47117 Aug 05 16:01:09.024848 ubuntu-focal-ovh-gra1-0025782813 nova-compute[104299]: DEBUG oslo_concurrency.lockutils [None req-54714613-ee7e-4c9f-8e60-8251a77a526e tempest-AttachVolumeNegativeTest-1260221999 tem pest-AttachVolumeNegativeTest-1260221999-project] Lock "ce4b444f-1587-43e9-b708-d2ae2c9cd59f" released by "nova.compute.manager.ComputeManager.detach_volume.<locals>.do_detach_volume" :: held 201.180s {{(p id=104299) inner /usr/local/lib/python3.8/dist-packages/oslo_concurrency/lockutils.py:367}}
47118 Aug 05 16:01:09.095134 ubuntu-focal-ovh-gra1-0025782813 nova-compute[104299]: ERROR oslo_messaging.rpc.server [None req-54714613-ee7e-4c9f-8e60-8251a77a526e tempest-AttachVolumeNegativeTest-1260221999 temp est-AttachVolumeNegativeTest-1260221999-project] Exception during message handling: nova.exception.DeviceDetachFailed: Device detach failed for vdb: Run out of retry while detaching device vdb with device alias virtio-disk1 from instance ce4b444f-1587-43e9-b708-d2ae2c9cd59f from the live domain config. Device is still attached to the guest.
$ logsearch storedsearch bug-xxx-detach
Running stored search:
bug-xxx-detach:
branches:
- master
files:
- controller/logs/screen-n-cpu.txt
job-groups:
- nova-devstack
limit: 100
project: openstack/nova
regex: from the live domain config. Device is still attached to the guest.
result: FAILURE
[..]
5d6a7ec36f104f10a1f49fe44909aa23:.logsearch/5d6a7ec36f104f10a1f49fe44909aa23/controller/logs/screen-n-cpu.txt:47115:Aug 05 16:01:08.970117 ubuntu-focal-ovh-gra1-0025782813 nova-compute[104299]: ERROR nova.virt.libvirt.driver [None req-54714613-ee7e-4c9f-8e60-8251a77a526e tempest-AttachVolumeNegativeTest-1260221999 tempest-AttachVolumeNegativeTest-1260221999-project] Run out of retry while detaching device vdb with device alias virtio-disk1 from instance ce4b444f-1587-43e9-b708-d2ae2c9cd59f from the live domain config. Device is still attached to the guest.
5d6a7ec36f104f10a1f49fe44909aa23:.logsearch/5d6a7ec36f104f10a1f49fe44909aa23/controller/logs/screen-n-cpu.txt:47116:Aug 05 16:01:08.971007 ubuntu-focal-ovh-gra1-0025782813 nova-compute[104299]: WARNING nova.virt.block_device [None req-54714613-ee7e-4c9f-8e60-8251a77a526e tempest-AttachVolumeNegativeTest-1260221999 tempest-AttachVolumeNegativeTest-1260221999-project] [instance: ce4b444f-1587-43e9-b708-d2ae2c9cd59f] Guest refused to detach volume ff8d0867-1e92-4f04-8e52-01bf05f52ac3: nova.exception.DeviceDetachFailed: Device detach failed for vdb: Run out of retry while detaching device vdb with device alias virtio-disk1 from instance ce4b444f-1587-43e9-b708-d2ae2c9cd59f from the live domain config. Device is still attached to the guest.
5d6a7ec36f104f10a1f49fe44909aa23:.logsearch/5d6a7ec36f104f10a1f49fe44909aa23/controller/logs/screen-n-cpu.txt:47118:Aug 05 16:01:09.095134 ubuntu-focal-ovh-gra1-0025782813 nova-compute[104299]: ERROR oslo_messaging.rpc.server [None req-54714613-ee7e-4c9f-8e60-8251a77a526e tempest-AttachVolumeNegativeTest-1260221999 tempest-AttachVolumeNegativeTest-1260221999-project] Exception during message handling: nova.exception.DeviceDetachFailed: Device detach failed for vdb: Run out of retry while detaching device vdb with device alias virtio-disk1 from instance ce4b444f-1587-43e9-b708-d2ae2c9cd59f from the live domain config. Device is still attached to the guest.
5d6a7ec36f104f10a1f49fe44909aa23:.logsearch/5d6a7ec36f104f10a1f49fe44909aa23/controller/logs/screen-n-cpu.txt:47170:Aug 05 16:01:09.101298 ubuntu-focal-ovh-gra1-0025782813 nova-compute[104299]: ERROR oslo_messaging.rpc.server nova.exception.DeviceDetachFailed: Device detach failed for vdb: Run out of retry while detaching device vdb with device alias virtio-disk1 from instance ce4b444f-1587-43e9-b708-d2ae2c9cd59f from the live domain config. Device is still attached to the guest.
Builds with matching logs 1/100:
+----------------------------------+---------------------+-----------------------------------+----------+----------------------------+
| uuid | finished | review | pipeline | job |
+----------------------------------+---------------------+-----------------------------------+----------+----------------------------+
| 5d6a7ec36f104f10a1f49fe44909aa23 | 2021-08-05T16:25:36 | https://review.opendev.org/801990 | gate | tempest-integrated-compute |
+----------------------------------+---------------------+-----------------------------------+----------+----------------------------+
[ 0.000000] Linux version 5.3.0-26-generic (buildd@ lgw01-amd64- 039) (gcc version 7.4.0 (Ubuntu 7.4.0-1ubuntu1~ 18.04.1) ) #28~18.04.1-Ubuntu SMP Wed Dec 18 16:40:14 UTC 2019 (Ubuntu 5.3.0-26. 28~18.04. 1-generic 5.3.13) 000-0x000000000 009fbff] usable c00-0x000000000 009ffff] reserved 000-0x000000000 00fffff] reserved 000-0x000000000 7fdcfff] usable 000-0x000000000 7ffffff] reserved 000-0x00000000f fffffff] reserved 0x000f5c9f] 0x07fccfff] 000-0x000000000 7fdcfff] 0x0798dfff] 000-0x000000000 0ffffff] 000-0x000000000 7fdcfff] 000-0x000000000 009efff] 000-0x000000000 7fdcfff] 000-0x000000000 7fdcfff]
[ 0.000000] Command line: LABEL=cirros-rootfs ro console=tty1 console=ttyS0
[ 0.000000] KERNEL supported cpus:
[ 0.000000] Intel GenuineIntel
[ 0.000000] AMD AuthenticAMD
[ 0.000000] Hygon HygonGenuine
[ 0.000000] Centaur CentaurHauls
[ 0.000000] zhaoxin Shanghai
[ 0.000000] x86/fpu: x87 FPU will use FXSAVE
[ 0.000000] BIOS-provided physical RAM map:
[ 0.000000] BIOS-e820: [mem 0x0000000000000
[ 0.000000] BIOS-e820: [mem 0x000000000009f
[ 0.000000] BIOS-e820: [mem 0x00000000000f0
[ 0.000000] BIOS-e820: [mem 0x0000000000100
[ 0.000000] BIOS-e820: [mem 0x0000000007fdd
[ 0.000000] BIOS-e820: [mem 0x00000000fffc0
[ 0.000000] NX (Execute Disable) protection: active
[ 0.000000] SMBIOS 2.8 present.
[ 0.000000] DMI: OpenStack Foundation OpenStack Nova, BIOS 1.13.0-1ubuntu1.1 04/01/2014
[ 0.000000] tsc: Fast TSC calibration using PIT
[ 0.000000] tsc: Detected 2394.452 MHz processor
[ 0.026481] last_pfn = 0x7fdd max_arch_pfn = 0x400000000
[ 0.028543] x86/PAT: Configuration [0-7]: WB WC UC- UC WB WP UC- WT
[ 0.059293] found SMP MP-table at [mem 0x000f5c90-
[ 0.080002] check: Scanning 1 areas for low memory corruption
[ 0.087928] RAMDISK: [mem 0x0798e000-
[ 0.088855] ACPI: Early table checksum verification disabled
[ 0.089662] ACPI: RSDP 0x00000000000F5AA0 000014 (v00 BOCHS )
[ 0.090108] ACPI: RSDT 0x0000000007FE153C 000030 (v01 BOCHS BXPCRSDT 00000001 BXPC 00000001)
[ 0.091295] ACPI: FACP 0x0000000007FE1418 000074 (v01 BOCHS BXPCFACP 00000001 BXPC 00000001)
[ 0.092403] ACPI: DSDT 0x0000000007FE0040 0013D8 (v01 BOCHS BXPCDSDT 00000001 BXPC 00000001)
[ 0.092593] ACPI: FACS 0x0000000007FE0000 000040
[ 0.092735] ACPI: APIC 0x0000000007FE148C 000078 (v01 BOCHS BXPCAPIC 00000001 BXPC 00000001)
[ 0.092800] ACPI: HPET 0x0000000007FE1504 000038 (v01 BOCHS BXPCHPET 00000001 BXPC 00000001)
[ 0.096602] No NUMA configuration found
[ 0.096673] Faking a node at [mem 0x0000000000000
[ 0.097804] NODE_DATA(0) allocated [mem 0x07963000-
[ 0.104051] Zone ranges:
[ 0.104153] DMA [mem 0x0000000000001
[ 0.104229] DMA32 [mem 0x0000000001000
[ 0.104250] Normal empty
[ 0.104279] Device empty
[ 0.104302] Movable zone start for each node
[ 0.104370] Early memory node ranges
[ 0.104422] node 0: [mem 0x0000000000001
[ 0.104679] node 0: [mem 0x0000000000100
[ 0.105220] Zeroed struct page in unavailable ranges: 98 pages
[ 0.105486] Initmem setup node 0 [mem 0x0000000000001
[ 0.113353] ACPI: PM-Timer IO Port: 0x608
[ 0.11...