* When the rabbit node went down, its status remains 'running'
in mnesia db for a while, so few retries (50 sec of total) are
required in order to kick and forget this node from the cluster.
This also requires +50 sec for actions stop & demote timeout.
* The rabbit master score in the CIB is retained after the current
master moved manually. This is wrong and the score must be reset
ASAP for post-demote and post-stop as well.
* The demoted node must be kicked from cluster by other nodes
on post-demote processing.
* Post-demote should stop the rabbit app at the node being demoted as
this node should be kicked from the cluster by other nodes.
Instead, it stops the app at the *other* nodes and brings full
cluster downtime.
* The check to join should be only done at the post-start and not at
the post-promote, otherwise the node being promoted may think it
is clustered with some node while the join check reports it as
already clustered with another one.
(the regression was caused by https://review.openstack.org/184671)
* Change `hostname` call to `crm_node -n` via $THIS_PCMK_NODE
everywhere to ensure we are using correct pacemaker node name
* Handle empty values for OCF_RESKEY_CRM_meta_notify_* by reporting
the resource as not running. This will rerun resource and restore
its state, eventually.
Reviewed: https:/ /review. openstack. org/185044 /git.openstack. org/cgit/ stackforge/ fuel-library/ commit/ ?id=085fe8c5a22 55d4274cdcee5c2 a74c15c443c0db
Committed: https:/
Submitter: Jenkins
Branch: master
commit 085fe8c5a2255d4 274cdcee5c2a74c 15c443c0db
Author: Bogdan Dobrelya <email address hidden>
Date: Fri May 22 16:40:16 2015 +0200
Fix rabbit OCF demote/stop/promote actions
* When the rabbit node went down, its status remains 'running' /review. openstack. org/184671) CRM_meta_ notify_ * by reporting
in mnesia db for a while, so few retries (50 sec of total) are
required in order to kick and forget this node from the cluster.
This also requires +50 sec for actions stop & demote timeout.
* The rabbit master score in the CIB is retained after the current
master moved manually. This is wrong and the score must be reset
ASAP for post-demote and post-stop as well.
* The demoted node must be kicked from cluster by other nodes
on post-demote processing.
* Post-demote should stop the rabbit app at the node being demoted as
this node should be kicked from the cluster by other nodes.
Instead, it stops the app at the *other* nodes and brings full
cluster downtime.
* The check to join should be only done at the post-start and not at
the post-promote, otherwise the node being promoted may think it
is clustered with some node while the join check reports it as
already clustered with another one.
(the regression was caused by https:/
* Change `hostname` call to `crm_node -n` via $THIS_PCMK_NODE
everywhere to ensure we are using correct pacemaker node name
* Handle empty values for OCF_RESKEY_
the resource as not running. This will rerun resource and restore
its state, eventually.
Closes-bug: #1436812
Closes-bug: #1455761
Change-Id: Ib01c1731b4f06e 6b643a4bca84582 8f7db507ad3
Signed-off-by: Bogdan Dobrelya <email address hidden>