ironic: n-cpu fails to recover after losing connection to ironic-api and placement-api
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
OpenStack Compute (nova) |
Fix Released
|
Low
|
Jim Rollenhagen |
Bug Description
The ironic virt driver does some crazy things when the ironic API goes down - it returns [] from get_available_
If placement is also down at this time, the resource providers will not be properly deleted.
When ironic-api and placement-api return, nova will see nodes, create compute_node records for them, and try to create new resource providers (as they are new compute_node records). This will fail with a name conflict, and the nodes will be unusable.
This is easy to fix, by raising an exception in get_available_
This may be fine but should have a larger discussion. We've added these hacks over the years for some reason, we should look at the bigger picture and decide how we want to handle these cases.
tags: | added: ironic placement |
Changed in nova: | |
assignee: | nobody → Jim Rollenhagen (jim-rollenhagen) |
status: | Triaged → In Progress |
There is a patch in progress for this: https:/ /review. openstack. org/#/c/ 545479/