Failed upgrade, mixed up HA addresses
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
OpenStack HA Cluster Charm |
Invalid
|
Undecided
|
Unassigned | ||
juju-core |
Fix Released
|
High
|
Ian Booth | ||
1.22 |
Fix Released
|
High
|
Tim Penhey | ||
1.24 |
Fix Released
|
High
|
Ian Booth | ||
hacluster (Juju Charms Collection) |
Invalid
|
Undecided
|
Unassigned |
Bug Description
After upgrading juju from 1.20.14 to 1.22.1, one mysql HA container a) failed to upgrade and b) had an ip address overwritten by the hacluster VIP address.
The mysql lxc's physically have those addresses:
mysql/0: 172.20.171.204
mysql/1: 172.20.172.11
mysql/2: 172.20.171.253
The HA-cluster VIP is set to 172.20.168.104.
After the upgrade, mysql/0 still is on the old juju version and has it's address set to the former VIP:
mysql/0:
machine: 0/lxc/0
mysql/1:
machine: 1/lxc/4
mysql/2:
machine: 2/lxc/4
Notes:
* the lxc's are still reachable under the original addresses, ie. the physical networking, sans HA of course, is still intact, eg.:
$ for h in 172.20.171.204 172.20.172.11 172.20.171.253 ; do ssh -l ubuntu $h hostname ; done
Warning: Permanently added '172.20.171.204' (ECDSA) to the list of known hosts.
juju-machine-
Connection to 172.20.171.204 closed.
Warning: Permanently added '172.20.172.11' (ECDSA) to the list of known hosts.
juju-machine-
Connection to 172.20.172.11 closed.
Warning: Permanently added '172.20.171.253' (ECDSA) to the list of known hosts.
juju-machine-
Connection to 172.20.171.253 closed.
* I've hit lp:1441478 during upgrade and applied the manual db hackery mentioned in comment #2
* Also, during the upgrade machine-1 lost DNS resolution for short time due to dhcp flakiness (unrelated)
description: | updated |
Changed in juju-core: | |
milestone: | none → 1.25.0 |
Changed in juju-core: | |
importance: | Undecided → High |
tags: | added: blocker |
Changed in juju-core: | |
status: | New → In Progress |
assignee: | nobody → Eric Snow (ericsnowcurrently) |
tags: | added: ha upgrade-juju |
Changed in juju-core: | |
assignee: | Eric Snow (ericsnowcurrently) → nobody |
status: | In Progress → Triaged |
no longer affects: | juju-core/1.22 |
Changed in juju-core: | |
assignee: | nobody → Ian Booth (wallyworld) |
status: | Triaged → Fix Committed |
Changed in juju-core: | |
status: | Fix Committed → Fix Released |
Changed in hacluster (Juju Charms Collection): | |
status: | New → Invalid |
Changed in charm-hacluster: | |
status: | New → Invalid |
Do you have any logs for machine 0/lxc/0 that we could take a look at?