Docs needed for tunables at large scale
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
OpenStack Compute (nova) |
Confirmed
|
Undecided
|
Unassigned |
Bug Description
Various things come up in IRC every once in a while about configuration options that need to be tweaked at large scale (blizzard, cern, etc) which once you hit hundreds or thousands of compute nodes need to be changed to avoid killing the control plane.
One such option is this:
From a blizzard operator:
(3:04:18 PM) eandersson: mriedem, we had to set heal_instance_
(3:05:41 PM) eandersson: It was getting real heavy on large sites with 1k nodes
(3:06:26 PM) eandersson: We also ended up adding a variance
Similarly, CERN had to totally disable this one:
And rely on SIGHUP / restart of the service if they needed to refresh that cache.
We should put these things in the admin docs as we come across them so we don't forget about this stuff when new operators/users come along and hit scaling issues.
Changed in nova: | |
assignee: | nobody → Takashi NATSUME (natsume-takashi) |
status: | New → In Progress |
Changed in nova: | |
assignee: | Takashi NATSUME (natsume-takashi) → nobody |
status: | In Progress → Confirmed |
tags: |
added: doc removed: docs |
https:/ /docs.openstack .org/nova/ latest/ configuration/ config. html#DEFAULT. report_ interval is another one, and rpc_response_ timeout and long_rpc_timeout.