mariadb multinode upgrade broken ocata-pike
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
kolla-ansible |
Fix Released
|
Critical
|
Unassigned | ||
Ocata |
Triaged
|
Critical
|
Unassigned | ||
Pike |
Fix Released
|
Critical
|
Unassigned |
Bug Description
While doing a upgrade from ocata to master(pike) in multinode environment, mariadb containers keep restarting.
170522 12:33:48 mysqld_safe Starting mysqld daemon with databases from /var/lib/mysql/
170522 12:33:48 mysqld_safe WSREP: Running position recovery with --log_error=
nohup: ignoring input
170522 12:33:48 [Note] /usr/sbin/mysqld (mysqld 10.0.30-
170522 12:33:51 mysqld_safe WSREP: Recovered position 7ca85504-
170522 12:33:51 [Note] WSREP: Read nil XID from storage engines, skipping position init
170522 12:33:51 [Note] WSREP: wsrep_load(): loading provider library '/usr/lib64/
170522 12:33:51 [Note] /usr/sbin/mysqld (mysqld 10.0.30-
170522 12:33:51 [Note] WSREP: wsrep_load(): Galera 25.3.19(r3667) by Codership Oy <email address hidden> loaded successfully.
170522 12:33:51 [Note] WSREP: CRC-32C: using hardware acceleration.
170522 12:33:51 [Note] WSREP: Found saved state: 7ca85504-
170522 12:33:51 [Note] WSREP: Passing config to GCS: base_dir = /var/lib/mysql/; base_host = 192.168.100.186; base_port = 4567; cert.log_conflicts = no; debug = no; evs.auto_evict = 0; evs.delay_margin = PT1S; evs.delayed_
170522 12:33:51 [Note] WSREP: GCache history reset: old(7ca85504-
170522 12:33:51 [Note] WSREP: Assign initial position for certification: 7, protocol version: -1
170522 12:33:51 [Note] WSREP: wsrep_sst_grab()
170522 12:33:51 [Note] WSREP: Start replication
170522 12:33:51 [Note] WSREP: Setting initial position to 7ca85504-
170522 12:33:51 [Note] WSREP: protonet asio version 0
170522 12:33:51 [Note] WSREP: Using CRC-32C for message checksums.
170522 12:33:51 [Note] WSREP: backend: asio
170522 12:33:51 [Note] WSREP: gcomm thread scheduling priority set to other:0
170522 12:33:51 [Warning] WSREP: access file(/var/
170522 12:33:51 [Note] WSREP: restore pc from disk failed
170522 12:33:51 [Note] WSREP: GMCast version 0
170522 12:33:51 [Note] WSREP: (8783922d, 'tcp://
170522 12:33:51 [Note] WSREP: (8783922d, 'tcp://
170522 12:33:51 [Note] WSREP: EVS version 0
170522 12:33:51 [Note] WSREP: gcomm: connecting to group 'openstack', peer '192.168.
170522 12:33:51 [Note] WSREP: (8783922d, 'tcp://
170522 12:33:51 [Note] WSREP: (8783922d, 'tcp://
170522 12:33:51 [Note] WSREP: declaring 87cf69fd at tcp://192.
170522 12:33:51 [Warning] WSREP: no nodes coming from prim view, prim not possible
170522 12:33:51 [Note] WSREP: view(view_
8783922d,0
87cf69fd,0
} joined {
} left {
} partitioned {
})
170522 12:33:55 [Note] WSREP: (8783922d, 'tcp://
170522 12:34:22 [ERROR] WSREP: failed to open gcomm backend connection: 110: failed to reach primary view: 110 (Connection timed out)
at gcomm/src/
170522 12:34:22 [ERROR] WSREP: gcs/src/
170522 12:34:22 [ERROR] WSREP: gcs/src/
170522 12:34:22 [ERROR] WSREP: gcs connect failed: Connection timed out
170522 12:34:22 [ERROR] WSREP: wsrep::
170522 12:34:22 [ERROR] Aborting
170522 12:34:22 [Note] WSREP: Service disconnected.
170522 12:34:23 [Note] WSREP: Some threads may fail to exit.
170522 12:34:23 [Note] /usr/sbin/mysqld: Shutdown complete
170522 12:34:23 mysqld_safe mysqld from pid file /var/lib/
Changed in kolla-ansible: | |
importance: | Undecided → Critical |
summary: |
- mariadb multinode fail to upgrade ocata->pike + mariadb multinode fail to upgrade ocata->pike due missing serial |
summary: |
- mariadb multinode fail to upgrade ocata->pike due missing serial + mariadb multinode fail to upgrade ocata->pike |
summary: |
- optimize reconfiguration breaks mariadb upgrade + optimize reconfiguration breaks multinode mariadb upgrade |
Tried several times today with ansible2.3 and ansible2.1.0.
Cannot find the root cause of this issue
Debug logs for deployment: http:// paste.openstack .org/show/ 610628/
Mariadb package versions are the same in both Ocata and master images (mariadb 10.0.30)