update_available_resource will raise DiskNotFound after resize but before confirm
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
OpenStack Compute (nova) |
Fix Released
|
Medium
|
Matthew Booth | ||
Ocata |
Triaged
|
Medium
|
Unassigned | ||
Pike |
Fix Released
|
Medium
|
Sasha Andonov | ||
Queens |
Fix Committed
|
Medium
|
Lee Yarwood | ||
Rocky |
Fix Released
|
Medium
|
Lee Yarwood | ||
Stein |
Fix Released
|
Medium
|
Lee Yarwood | ||
Train |
Fix Released
|
Undecided
|
Unassigned | ||
Ubuntu Cloud Archive |
Invalid
|
Undecided
|
Unassigned | ||
Queens |
Fix Released
|
Undecided
|
Unassigned | ||
nova (Ubuntu) |
Invalid
|
Undecided
|
Unassigned | ||
Bionic |
Fix Released
|
High
|
Unassigned |
Bug Description
Original reported in RH Bugzilla: https:/
Tested on OSP12 (Pike), but appears to be still present on master. Should only occur if nova compute is configured to use local file instance storage.
Create instance A on compute X
Resize instance A to compute Y
Domain is powered off
/var/
Domain is *not* undefined
On compute X:
update_
First action is to update self
rt calls driver.
...calls _get_disk_
...iterates over all defined domains, including the ones whose disks we renamed
...fails because a referenced disk no longer exists
Results in errors in nova-compute.log:
2018-05-30 02:17:08.647 1 ERROR nova.compute.
2018-05-30 02:17:08.647 1 ERROR nova.compute.
2018-05-30 02:17:08.647 1 ERROR nova.compute.
2018-05-30 02:17:08.647 1 ERROR nova.compute.
2018-05-30 02:17:08.647 1 ERROR nova.compute.
2018-05-30 02:17:08.647 1 ERROR nova.compute.
2018-05-30 02:17:08.647 1 ERROR nova.compute.
2018-05-30 02:17:08.647 1 ERROR nova.compute.
2018-05-30 02:17:08.647 1 ERROR nova.compute.
2018-05-30 02:17:08.647 1 ERROR nova.compute.
2018-05-30 02:17:08.647 1 ERROR nova.compute.
2018-05-30 02:17:08.647 1 ERROR nova.compute.
2018-05-30 02:17:08.647 1 ERROR nova.compute.
2018-05-30 02:17:08.647 1 ERROR nova.compute.
2018-05-30 02:17:08.647 1 ERROR nova.compute.
2018-05-30 02:17:08.647 1 ERROR nova.compute.
2018-05-30 02:17:08.647 1 ERROR nova.compute.
And resource tracker is no longer updated. We can find lots of these in the gate.
Note that change Icec2769bf42455
=======
[Impact]
See above
[Test Plan]
Deploy Openstack Queens with one compute node.
Create a VM instance. Eg:
openstack server create --wait --image $image_name --flavor $flavor --key-name testkey --nic net-id=${net_id} test-instance-1234
Get the details for that instance and copy the instance_name. Eg:
openstack server show test-instance-1234 -c OS-EXT-
Get the disk location used based on the instance name we retrieved before. Eg:
disk_location=`juju run -a nova-compute -- virsh domblklist $var_name | grep nova | awk -v N=2 '{print $N}'`
Move that file in a different location. Eg:
juju run -a nova-compute -- mv $disk_location "$disk_
Check the nova compute logs on the compute node for a warning. Eg:
juju run -a nova-compute -- grep "DiskNotFound" /var/log/
The output should look like the following:
```
2021-09-22 11:07:46.009 26176 WARNING nova.virt.
```
[Where problems could occur]
Users which were relying on an error could be affected.
Changed in nova: | |
status: | New → Confirmed |
importance: | Undecided → Medium |
assignee: | nobody → jichenjc (jichenjc) |
Changed in nova: | |
status: | Confirmed → In Progress |
Changed in nova: | |
assignee: | jichenjc (jichenjc) → Lee Yarwood (lyarwood) |
Changed in nova: | |
assignee: | Lee Yarwood (lyarwood) → jichenjc (jichenjc) |
Changed in nova: | |
assignee: | jichenjc (jichenjc) → Lee Yarwood (lyarwood) |
Changed in nova: | |
assignee: | Lee Yarwood (lyarwood) → Vladyslav Drok (vdrok) |
Changed in nova: | |
assignee: | Vladyslav Drok (vdrok) → Lee Yarwood (lyarwood) |
Changed in nova: | |
assignee: | Lee Yarwood (lyarwood) → Vladyslav Drok (vdrok) |
Changed in nova: | |
assignee: | Vladyslav Drok (vdrok) → Matthew Booth (mbooth-9) |
description: | updated |
tags: | added: verification-done-bionic verification-needed |
Any news regarding this issue? I've read the whole history here and on RedHat's Bugzilla and I assume that this issue is not fixed yet, am I correct? We are experiencing a quite significant level of log records regarding this issue. Please update info here, thank you.