Error causes deployments to fail
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Autopilot Log Analyser |
Fix Committed
|
Undecided
|
Francis Ginther | ||
Canonical Juju |
Fix Released
|
High
|
Christian Muirhead | ||
Landscape Server |
New
|
Undecided
|
Francis Ginther |
Bug Description
As seen here:
http://
The test fails trying to acquire a lock.
util_test.go:136:
c.Assert(err, jc.ErrorIsNil)
... value *errors.Err = &errors.
... error stack:
github.
github.
github.
github.
[LOG] 0:08.862 DEBUG juju.api RPC connection died
Changed in autopilot-log-analyser: | |
status: | New → Fix Committed |
tags: | added: oil |
summary: |
+ Deployment fails and affects landscape `15-20% of the time, TestUniterSteadyStateUpgradeRelations: could not acquire lock |
summary: |
- Deployment fails and affects landscape `15-20% of the time, - TestUniterSteadyStateUpgradeRelations: could not acquire lock + Error causes deployments to fail |
Changed in juju: | |
milestone: | none → 2.1.1 |
assignee: | nobody → Ian Booth (wallyworld) |
Changed in juju: | |
assignee: | Ian Booth (wallyworld) → nobody |
Changed in juju: | |
assignee: | nobody → Christian Muirhead (2-xtian) |
tags: | added: cdo-qa-blocker |
tags: | added: landscape |
Changed in autopilot-log-analyser: | |
assignee: | nobody → Francis Ginther (fginther) |
Changed in juju: | |
status: | Triaged → Fix Committed |
Changed in landscape: | |
assignee: | nobody → Francis Ginther (fginther) |
Changed in juju: | |
status: | Fix Committed → Fix Released |
tags: | removed: cdo-qa-blocker |
I'm seeing instances of this error in our automated testing of Landscape autopilot openstack deployments. The most recent example was with juju 1:2.1~rc1, but I also have runs with 1:2.1~beta4. Also, I first found lp:1604915, which is duped to this bug. I've seen this error four times between beta4 and rc1.
The error message seen in one of the juju unit logs is: 0-inner- logs/ceilometer -1/var/ log/juju/ unit-ceilometer -1.log] dependency engine.go:547 "leadership- tracker" manifold worker returned unexpected error: leadership failure: lease manager stopped
[from build 5173 landscape-
2017-02-02 05:12:48 ERROR juju.worker.
2017-02-02 05:12:48 ERROR juju.worker.uniter agent.go:28 resolver loop error: could not acquire lock: cancelled acquiring mutex
The juju-status message for that unit is also set to "resolver loop error". In each instance of hitting this, it appears to be a different application. The error causes the entire deployment to fail.
Builds associated with this failure: /ci.lscape. net/job/ landscape- system- tests/5263 /ci.lscape. net/job/ landscape- system- tests/5173 /ci.lscape. net/job/ landscape- system- tests/5139 /ci.lscape. net/job/ landscape- system- tests/5132
- https:/
- https:/
- https:/
- https:/