pacemaker-controld crash on double free
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
pacemaker (Ubuntu) |
Fix Released
|
Undecided
|
Unassigned | ||
Jammy |
Fix Released
|
Undecided
|
Michał Małoszewski |
Bug Description
[Impact]
* The pacemaker-controld is Pacemaker’s coordinator, which maintains a consistent view of the cluster membership and orchestration of all the other components.
* Users of mysql clusters migrating from bionic to jammy reported a crash.
* This crash is caused by lrmd_dispatch_
[Test Plan]
lxc launch ubuntu:22.04 node1
lxc shell node1
apt update && apt dist-upgrade -y
apt install pcs mysql-server resource-agents -y
echo hacluster:hacluster | chpasswd
mysql -e "CREATE USER 'replicator'
mysql -e "GRANT RELOAD, PROCESS, SUPER, REPLICATION SLAVE, REPLICATION CLIENT ON *.* TO 'replicator'
systemctl disable mysql.service
systemctl stop mysql.service
exit
lxc copy node1 node2
lxc start node2
lxc shell node1
pcs host auth node1 node2 -u hacluster -p hacluster
pcs cluster setup --force mysqlclx node1 node2 transport udpu
pcs cluster enable --all
pcs cluster start --all
pcs property set stonith-
pcs property set no-quorum-
pcs resource create p_mysql ocf:heartbeat:mysql \
replication
test_user=root \
op demote interval=0s timeout=120 monitor interval=20 timeout=30 monitor \
interval=10 role=Master timeout=30 monitor interval=30 role=Slave timeout=30 \
notify interval=0s timeout=90 promote interval=0s timeout=120 start \
interval=0s timeout=120 stop interval=0s timeout=120 meta notify=true
pcs resource promotable p_mysql p_mysql-master notify=true
Example of failed output:
There should be a crash file at /var/crash/ in some of the nodes.
Example of successful output:
No crash file at /var/crash/.
[Where problems could occur]
* The patch itself modifies only the lmrd code, so regressions should be limited to the behavior of lmrd.
* Since the code changes affect event dispatching and memory allocation, therefore potential regressions would most likely be related to that.
-------
After migrating a mysql cluster from bionic to jammy (pacemaker 2.1.2-1ubuntu3), pacemaker started to malfunction because of pacemaker-controld crashes. It is easy to reproduce doing a standby of the promoted node.
Apport crash view has been attached and it is the same bug reported in redhat https:/
And was fixed in this commit https:/
Please, provide an update for pacemaker because it is unusable this way.
Related branches
- git-ubuntu bot: Approve
- Christian Ehrhardt (community): Approve
- Canonical Server Reporter: Pending requested
-
Diff: 68 lines (+46/-0)3 files modifieddebian/changelog (+8/-0)
debian/patches/jammy-avoid-double-free-during-notify-operation.patch (+37/-0)
debian/patches/series (+1/-0)
Changed in pacemaker (Ubuntu Jammy): | |
assignee: | nobody → Michał Małoszewski (michal-maloszewski99) |
description: | updated |
tags: |
added: verification-done removed: verification-needed |
Thanks for taking the time to report this bug.
The issue was introduced in 2.1.2 by https:/ /github. com/ClusterLabs /pacemaker/ commit/ 31c7fa8a3a9c72c 05bafdac1841c1c 0c5f003797.
As mentioned, it was fixed in 2.1.3 by https:/ /github. com/ClusterLabs /pacemaker/ commit/ ed8b2c86ab77aaa 3d7fd688c049ad5 e1b922a9c6.
Therefore, only jammy is affected.
A workaround provided by the red hat bug is to remove the notify=true entry.