juju-upgrade from 2.2.9 to 2.3.2 fails with state changing too quickly
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Canonical Juju |
Fix Released
|
High
|
John A Meinel | ||
2.2 |
Won't Fix
|
Undecided
|
Unassigned | ||
2.3 |
Fix Released
|
Critical
|
John A Meinel |
Bug Description
# Cloud
MAAS 2.1.5
# Issue
Upgrading our juju environment (3x HA controllers) from 2.1.2 -> 2.2.9 -> 2.3.2 fails during the last upgrade which renders the controllers and juju unusable. The upgrade from 2.1.2 -> 2.2.9 went seemingly fine, however the upgrade from 2.2.9 -> 2.3.2 failed.
# How to reproduce
We manage to reproduce this in our testing environment, bootstrapping a clean 2.1.2 environment, deploying OpenStack, then doing the same upgrade steps (see steps below).
## With a Juju 2.1.2 client
* juju bootstrap devmaas --bootstrap-
* juju enable-ha --constraints tags=juju-
* juju deploy bundle.yaml # deploy custom openstack HA bundle
* juju upgrade-charm ceph-mon # upgrade all openstack-charmers charms to handle new juju version
* juju upgrade-charm ...
## Switch to a Juju 2.3.2 client
* juju upgrade-juju -m controller # upgrade controllers to 2.2.9
* juju upgrade-juju # upgrade default model containing the openstack deployment to 2.2.9
Both the juju controller and openstack model are upgraded to 2.2.9 at this point and work as expected.
* juju upgrade-juju -m controller # try to upgrade controllers to 2.3.2
This last step renders the controllers and juju unusable.
# Errors
## An excerpt from machine-0.log on controller 0
...
2018-01-30 11:15:22 WARNING juju.worker.
2018-01-30 11:15:22 ERROR juju.worker.
2018-01-30 11:15:23 ERROR juju.worker runner.go:392 exited "leadership": upgrading legacy lease documents: state changing too quickly; try again soon
2018-01-30 11:15:23 ERROR juju.worker runner.go:392 exited "singular": upgrading legacy lease documents: state changing too quickly; try again soon
...
## Juju controller not responding from client
12:36 $ juju status
ERROR could not fetch leaders: upgrading legacy lease documents: state changing too quickly; try again soon (excessive contention)
# Logs
See attached logs for a collection of the machine logs from the controllers. These logs are from a clean reproduction running through the steps from # How to reproduce
Changed in juju: | |
milestone: | none → 2.4-beta1 |
assignee: | nobody → John A Meinel (jameinel) |
Changed in juju: | |
status: | Fix Committed → Fix Released |
This could potentially go hand-in-hand with bug # 1742604.
Symptoms are different but difficulty to upgrade are tear-inducing.