Resource Tracker performance with Ironic driver
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
OpenStack Compute (nova) |
Fix Released
|
High
|
Eric Fried | ||
Rocky |
Fix Committed
|
High
|
Eric Fried | ||
Stein |
Fix Committed
|
High
|
Eric Fried |
Bug Description
The problem is in rocky.
The resource tracker builds the resource provider tree and it's updated 2 times in "_update_
With "_init_
The problem is that the RP tree will contain all the ironic RP and all the tree is flushed to placement (2 times as described above) when the periodic task iterate per Ironic RP.
In our case with 1700 ironic nodes, the period task takes:
1700 x (2 x 7s) = ~6h
+++
mitigations:
- shard nova-compute. Have several nova-computes dedicated to ironic.
Most of the current deployments only use 1 nova-compute to avoid resources shuffle/recreation between nova-computes.
Several nova-computes will be need to accommodate the load.
- why do we need to do the full resource provider tree flush to placement and not only the RP that is being considered?
As a work around we are doing this now!
Changed in nova: | |
importance: | Undecided → High |
tags: | added: ironic performance resource-tracker |
Fix proposed to branch: master /review. openstack. org/637225
Review: https:/