RBR error on IST not zeroing grastate
Affects | Status | Importance | Assigned to | Milestone | ||
---|---|---|---|---|---|---|
Galera | Status tracked in 3.x | |||||
2.x |
Fix Committed
|
High
|
Yan Zhang | |||
3.x |
Fix Released
|
High
|
Yan Zhang | |||
Percona XtraDB Cluster moved to https://jira.percona.com/projects/PXC | Status tracked in 5.6 | |||||
5.5 |
Fix Released
|
Undecided
|
Unassigned | |||
5.6 |
Fix Released
|
Undecided
|
Unassigned |
Bug Description
130516 10:02:30 [Note] WSREP: SST received: f9ae5241-
130516 10:02:30 [Note] /usr/sbin/mysqld: ready for connections.
Version: '5.5.30' socket: '/var/lib/
.r3843
130516 10:02:30 [Note] WSREP: Receiving IST: 24484 writesets, seqnos 43045-67529
130516 10:02:30 [ERROR] Slave SQL: Could not execute Delete_rows event on table test.sbtest1; Can't find recor
d in 'sbtest1', Error_code: 1032; handler error HA_ERR_
s 1193, Error_code: 1032
130516 10:02:30 [Warning] WSREP: RBR event 6 Delete_rows apply warning: 120, 43046
130516 10:02:30 [ERROR] WSREP: receiving IST failed, node restart required: Failed to apply app buffer: seqno:
43046, status: WSREP_FATAL
at galera/
at galera/
I was able to get a node stuck in this state where it continued to retry IST on every restart and got this error. The grastate.dat was not getting zeroed appropriately in this case.
[root@perconadbt mysql]# rpm -qa | grep -i percona
percona-
Percona-
Percona-
percona-
Percona-
Percona-
Changed in percona-xtradb-cluster: | |
milestone: | 5.5.31-25 → 5.5.31-24.8 |
Changed in percona-xtradb-cluster: | |
milestone: | 5.5.31-23.7.5 → 5.5.31-25 |
Changed in galera: | |
milestone: | 23.2.6 → 23.2.7 |
Changed in percona-xtradb-cluster: | |
milestone: | 5.5.33-23.7.6 → future-5.5 |
This seems to be a Galera bug: grastate invalidation code does not cover all code paths.