nova fails to re-create mediated devices after reboot
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
OpenStack Nova Compute NVIDIA vGPU Plugin Charm |
New
|
Undecided
|
Unassigned | ||
nova (Ubuntu) |
Confirmed
|
Undecided
|
Unassigned |
Bug Description
OpenStack Xena
Ubuntu 20.04
After a reboot of a nova-compute node with running instances with attached vgpu devices the nova-compute daemon fails to startup due to missing mediated device definitions.
It looks like the code intends to detect the missing devices and then re-create them but the libvirt python module throws an exception due to the missing mediated device when the domain definition is being inspected.
2022-06-08 07:24:27.061 2689 ERROR oslo_service.
2022-06-08 07:24:27.061 2689 ERROR oslo_service.
2022-06-08 07:24:27.061 2689 ERROR oslo_service.
2022-06-08 07:24:27.061 2689 ERROR oslo_service.
2022-06-08 07:24:27.061 2689 ERROR oslo_service.
2022-06-08 07:24:27.061 2689 ERROR oslo_service.
2022-06-08 07:24:27.061 2689 ERROR oslo_service.
2022-06-08 07:24:27.061 2689 ERROR oslo_service.
2022-06-08 07:24:27.061 2689 ERROR oslo_service.
2022-06-08 07:24:27.061 2689 ERROR oslo_service.
2022-06-08 07:24:27.061 2689 ERROR oslo_service.
2022-06-08 07:24:27.061 2689 ERROR oslo_service.
2022-06-08 07:24:27.061 2689 ERROR oslo_service.
2022-06-08 07:24:27.061 2689 ERROR oslo_service.
2022-06-08 07:24:27.061 2689 ERROR oslo_service.
2022-06-08 07:24:27.061 2689 ERROR oslo_service.
2022-06-08 07:24:27.061 2689 ERROR oslo_service.
2022-06-08 07:24:27.061 2689 ERROR oslo_service.
2022-06-08 07:24:27.061 2689 ERROR oslo_service.
2022-06-08 07:24:27.061 2689 ERROR oslo_service.
2022-06-08 07:24:27.061 2689 ERROR oslo_service.
2022-06-08 07:24:27.061 2689 ERROR oslo_service.
2022-06-08 07:24:27.061 2689 ERROR oslo_service.
2022-06-08 07:24:27.061 2689 ERROR oslo_service.
2022-06-08 07:24:27.061 2689 ERROR oslo_service.
2022-06-08 07:24:27.061 2689 ERROR oslo_service.
2022-06-08 07:24:27.061 2689 ERROR oslo_service.
2022-06-08 07:24:27.061 2689 ERROR oslo_service.
2022-06-08 07:24:27.061 2689 ERROR oslo_service.
2022-06-08 07:24:27.061 2689 ERROR oslo_service.
description: | updated |
description: | updated |
summary: |
- mediated devices missing after reboot + nova fails to re-create mediated devices after reboot |
description: | updated |
This is probably related to the libvirt version in use (6.0.0 from focal).
7.3 and upward have features to support persistence of mediated devices between reboots.