subordinate juju agent does not start when principal is very busy
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Canonical Juju |
Fix Released
|
Medium
|
Andrew Wilkins | ||
OpenStack HA Cluster Charm |
Invalid
|
Undecided
|
Unassigned | ||
OpenStack Keystone Charm |
Triaged
|
Medium
|
Unassigned | ||
OpenStack Percona Cluster Charm |
Fix Released
|
Critical
|
David Ames | ||
Telegraf Charm |
Invalid
|
Undecided
|
Unassigned |
Bug Description
When a principle is very busy immediately after deploy, it is possible that subordinates might never get a Juju agent started.
=====
juju version: 2.2.4
percona-cluster charm: cs:percona-
=Description=
This is a bundle using percona-cluster and hacluster:
http://
It gets stuck, so far for about an hour, in a state where hacluster-pxc is not setup because the hacluster unit on the mysql/0 host never gets to run its hooks to setup:
http://
The mysql/0 unit continuously fires, here's the log for it:
http://
=To Reproduce=
Deploy the bundle shown above. I don't know for sure if it happens everytime yet.
=Partial Workaround=
I have been able to workaround this for a manual deployment by stopping the jujud-unit-mysql-0 service, letting the hacluster charm do its thing, then restarting the service. Unfortunately, this is difficult to automate.
tags: | added: foundations-engine |
tags: | added: cdo-qa |
tags: | added: cdo-qa-blocker |
description: | updated |
description: | updated |
tags: | added: uosci |
Changed in charm-percona-cluster: | |
status: | Incomplete → In Progress |
importance: | Undecided → Critical |
assignee: | nobody → David Ames (thedac) |
milestone: | none → 17.11 |
summary: |
- subordinate juju agent does not start when principle is very busy + subordinate juju agent does not start when principal is very busy |
Changed in charm-percona-cluster: | |
status: | In Progress → Fix Committed |
description: | updated |
Changed in juju: | |
status: | New → Triaged |
importance: | Undecided → Medium |
Changed in juju: | |
milestone: | none → 2.3-rc1 |
assignee: | nobody → Andrew Wilkins (axwalk) |
status: | Triaged → In Progress |
Changed in juju: | |
milestone: | 2.3-rc1 → 2.3-beta3 |
Changed in juju: | |
status: | In Progress → Fix Committed |
Changed in juju: | |
status: | Fix Committed → Fix Released |
Changed in charm-keystone: | |
milestone: | 18.02 → 18.05 |
Changed in charm-keystone: | |
milestone: | 18.05 → 18.08 |
Changed in charm-keystone: | |
milestone: | 18.08 → 18.11 |
Changed in charm-keystone: | |
milestone: | 18.11 → 19.04 |
Changed in charm-keystone: | |
milestone: | 19.04 → 19.07 |
Changed in charm-keystone: | |
milestone: | 19.07 → 19.10 |
Changed in charm-keystone: | |
milestone: | 19.10 → 20.01 |
Changed in charm-keystone: | |
milestone: | 20.01 → 20.05 |
no longer affects: | charm-telegraf |
Changed in charm-telegraf: | |
status: | New → Invalid |
Changed in charm-keystone: | |
milestone: | 20.05 → 20.08 |
Changed in charm-keystone: | |
milestone: | 20.08 → none |
Hi Jason. paste.ubuntu. com/25563847/ )
To narrow down the issue would it be possible to test without telegraf ? Looking at the mysql hook executions from your pastebin I see ~60 telegraf units with relation to mysql ( http://