Dynamic corosync node removal must be safe
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Fuel for OpenStack |
Fix Committed
|
Medium
|
Dmitry Ilyin | ||
6.0.x |
Invalid
|
Undecided
|
Unassigned |
Bug Description
Current implementation of dynamic node removal with corosync-cmapctl is unsafe. There is a case when nodes marked in UI for removal, but deploy was run via CLI for all nodes (just for some reason). Given circumstances, pacemaker provider will issue nodes remove commands for CMAP on live nodes, running pacemaker with corosync, which is wrong and never should be allowed. Doing so, would end up with broken corosync cluster.
The solution is to check if the provider are about to delete its own node from CMAP and either to skip this action, or at least stop pacemaker and corosync locally prior to issue any remove actions to cmap tool
The complete solution could be to prohibit (or warn and skip) in CLI the deploy action for nodes marked for removal
Changed in fuel: | |
milestone: | none → 6.1 |
status: | New → Confirmed |
importance: | Undecided → Critical |
assignee: | nobody → Dmitry Ilyin (idv1985) |
description: | updated |
tags: | added: module-client pacemaker |
Changed in fuel: | |
milestone: | 7.0 → 6.1-updates |
status: | Confirmed → Fix Committed |
Related fix proposed to branch: master /review. openstack. org/182623
Review: https:/