Galera sst not finished before timeout of starting mysql in Pacemaker
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Fuel for OpenStack |
Fix Released
|
High
|
Sergii Golovatiuk | ||
6.1.x |
Fix Released
|
High
|
Rodion Tikunov | ||
7.0.x |
Fix Released
|
High
|
Rodion Tikunov | ||
8.0.x |
Fix Committed
|
High
|
Sergii Golovatiuk |
Bug Description
Description: If one of the nodes fail in the MySQL galera cluster there is sometimes the need for full state transfer which takes more than 10 minutes if the database is large e.g. when using Zabbix.
Current situation:
The mysql database is unable to start because it gets killed before full state transfer is finished.
Expected situation:
Instead of increasing the timeout we should do a special check to see whether a state transfer is still in progress if so notify pacemaker to not yet kill the mysql starting process. IF this is not possible we can consider changing timeout from 600 seconds to higher, but i suggest instead we decrease the timeout and use a specific check to see if SST is in process.
Steps to reproduce:
1. Create an HA environment
2. Import 10+Gb of data into the mysql database
3. Let one of the nodes fail on purpose and require full state transfer. Detailed steps for this:
kill mysqld_safe and mysqld process && rm /var/lib/mysql/* -rf
4. Wait for corosync/pacemaker to restart the mysql process on specified node.
5. The log on failed node should show: WSREP_SST: [INFO] Evaluating socat -u TCP-LISTEN:
Current workaround:
crm configure edit p_mysql
Chanage "op start interval=0 timeout=600"
to "op start interval=0 timeout=1600"
description: | updated |
Changed in fuel: | |
milestone: | none → 7.0 |
assignee: | nobody → Fuel Library Team (fuel-library) |
status: | New → Confirmed |
importance: | Undecided → High |
Changed in fuel: | |
assignee: | Fuel Library Team (fuel-library) → Sergii Golovatiuk (sgolovatiuk) |
Changed in fuel: | |
status: | Confirmed → In Progress |
Changed in fuel: | |
assignee: | Denis Egorenko (degorenko) → Sergii Golovatiuk (sgolovatiuk) |
tags: | added: on-verification |
no longer affects: | fuel/mitaka |
Changed in fuel: | |
milestone: | 7.0 → 9.0 |
tags: | added: area-library |
tags: | added: team-bugfix |
tags: | added: area-docs release-notes |
tags: | added: on-verification |
tags: | added: on-verification |
tags: | added: on-verification |
tags: |
added: release-notes-done removed: release-notes |
Assigned to Denis per discussion with Andrey Maksimov.