"nova-manage cell_v2 discover_hosts" failed, causing ncc status to temporarily go into error status
Bug #1720846 reported by
Jason Hobbs
This bug affects 2 people
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
OpenStack Nova Cloud Controller Charm |
Confirmed
|
High
|
Corey Bryant |
Bug Description
In a deploy of openstack-base, the the n-c-c unit temporarily went into error state:
http://
2017-09-30 10:35:40 DEBUG juju.worker.uniter agent.go:17 [AGENT-STATUS] error: hook failed: "cloud-
imo, charms shouldn't enter an error state if they can recover on their own. To an external observer (juju wait in this case) this is indistinguishable from a fatal error, because there is no way to know how long to wait for the error to correct itself.
I've attached a crashdump from the test run.
tags: | added: cpe-onsite |
Changed in charm-nova-cloud-controller: | |
status: | Triaged → In Progress |
Changed in charm-nova-cloud-controller: | |
status: | New → Confirmed |
To post a comment you must log in.
FYI, we disable failed hook retries in our CI tests. We believe that the charm should never enter a failed/error state. So any time the application unit workload state is error, we see that as a bug.
As charmers, we don't want to rely on juju's retries to save us. It's nice that it will/might, but we don't lean on that by design.