Reproducable crash of Percona XtraDB cluster using large updates
Affects | Status | Importance | Assigned to | Milestone | ||
---|---|---|---|---|---|---|
Galera |
New
|
Undecided
|
Unassigned | |||
Percona XtraDB Cluster moved to https://jira.percona.com/projects/PXC | Status tracked in 5.6 | |||||
5.5 |
Invalid
|
Medium
|
Unassigned | |||
5.6 |
Invalid
|
Medium
|
Unassigned |
Bug Description
We operate Percona XtraDB cluster of three servers version: 5.5.34-31.1, Percona XtraDB Cluster (GPL), Release 31.1, wsrep_25.9.r3928
On a database having table with following structure:
CREATE TABLE `docStatsDetail` (
`date` date NOT NULL,
`adId` varchar(36) COLLATE utf8_slovenian_ci NOT NULL,
`userId` varchar(100) COLLATE utf8_slovenian_ci DEFAULT NULL,
`adImpressions` int(11) DEFAULT '0',
`adClicks` int(11) DEFAULT '0',
`ts` timestamp NOT NULL DEFAULT CURRENT_TIMESTAMP ON UPDATE CURRENT_TIMESTAMP,
PRIMARY KEY (`date`,`adId`),
KEY `TS` (`ts`),
KEY `_adId` (`adId`),
KEY `_userId` (`userId`)
) ENGINE=InnoDB DEFAULT CHARSET=utf8 COLLATE=
The table has 120 mio records and if we try to delete from this table more than 20 mio records using: DELETE FROM docStatsDetail LIMIT 20000000; the server crashes with a crash report:
10:57:55 UTC - mysqld got signal 7 ;
This could be because you hit a bug. It is also possible that this binary
or one of the libraries it was linked against is corrupt, improperly built,
or misconfigured. This error can also be caused by malfunctioning hardware.
We will try our best to scrape up some info that will hopefully help
diagnose the problem, but since we have already crashed,
something is definitely wrong and this may fail.
Please help us make Percona XtraDB Cluster better by reporting any
bugs at https:/
key_buffer_
read_buffer_
max_used_
max_threads=102
thread_count=10
connection_count=10
It is possible that mysqld could use up to
key_buffer_size + (read_buffer_size + sort_buffer_
Hope that's ok; if not, decrease some variables in the equation.
Thread pointer: 0x5b7cdb0
Attempting backtrace. You can use the following information to find out
where mysqld died. If you see no messages after this, something went
terribly wrong...
stack_bottom = 7f31a953ae70 thread_stack 0x40000
/usr/sbin/
/usr/sbin/
/lib/x86_
/lib/x86_
/usr/lib/
/usr/lib/
/usr/lib/
/usr/lib/
/usr/lib/
/usr/sbin/
/usr/sbin/
/usr/sbin/
/usr/sbin/
/usr/sbin/
/usr/sbin/
/usr/sbin/
/usr/sbin/
/usr/sbin/
/usr/sbin/
/usr/sbin/
/lib/x86_
/lib/x86_
Trying to get some variables.
Some pointers may be invalid and cause the dump to abort.
Query (7f317c004b90): is an invalid pointer
Connection ID (thread ID): 48
Status: NOT_KILLED
You may download the Percona XtraDB Cluster operations manual by visiting
http://
in the manual which will help you identify the cause of the crash.
140116 11:57:55 mysqld_safe Number of processes running now: 0
140116 11:57:55 mysqld_safe WSREP: not restarting wsrep node automatically
140116 11:57:55 mysqld_safe mysqld from pid file /var/run/
The servers have following settings:
# -------
# wsrep provider configuration: basic wsrep options
# -------
wsrep_on = 1
wsrep_debug = 0
wsrep_provider = /usr/lib/
wsrep_provider_
# Empty gcomm address is being used when cluster is getting bootstrapped. Dont use it later. It will form new cluster!!!
#wsrep_
wsrep_cluster_
wsrep_cluster_name = bolha_cluster
wsrep_node_name = as-02.bolha.com
wsrep_node_address = ae-02
wsrep_sst_method = xtrabackup-v2
wsrep_sst_auth = "sstuser:mXXXXXX9"
wsrep_slave_threads = 8
wsrep_notify_cmd = /etc/mysql/
# Dodal aAP
wsrep_max_ws_rows = 128K
wsrep_max_ws_size = 4096M
# -------
# InnoDB
# -------
innodb_
innodb_
innodb_
innodb_
innodb_
innodb_
innodb_open_files = 131072
innodb_io_capacity = 1000
innodb_flush_method = O_DIRECT
# dodal AP
innodb_
innodb_
innodb_
innodb_
innodb_
Servers are running on SSD's and the erorr is reproducable in every node.
Servers are Linux based
Linux XXX 3.2.0-57-generic #87-Ubuntu SMP Tue Nov 12 21:35:10 UTC 2013 x86_64 x86_64 x86_64 GNU/Linux
lsb_release -a
No LSB modules are available.
Distributor ID: Ubuntu
Description: Ubuntu 12.04.4 LTS
Release: 12.04
Codename: precise
PLEASE HELP!
description: | updated |
a) Looks like a crash in galera, so adding galera project here as well.
b) Also, this looks similar to https:/ /bugs.launchpad .net/codership- mysql/+ bug/1254179
c)
wsrep_max_ws_rows = 128K
wsrep_max_ws_size = 4096M
The large value of ws_size may be causing issues here.
@Ales,
d) Does this happen only when a certain limit (in this case 20000000) is exceeded - DELETE FROM docStatsDetail LIMIT 20000000? If so, it more looks like c). Also, I see buffer reserve/resize at the last leg of the crash.