[10.0 swarm] task "rabbitmq" fails
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Mirantis OpenStack |
Confirmed
|
High
|
MOS Oslo |
Bug Description
Detailed bug description:
The issue was found by https:/
Steps to reproduce:
1. Create a cluster
2. Add 3 nodes with controller role
3. Add 1 node with compute role
4. Deploy the cluster
2017-02-27 01:32:22 INFO [17257] Cluster[]: All nodes are finished. Failed tasks: Task[rabbitmq/5] Stopping the deployment process!
Additional info: rabbitmq cluster seems alive http://
And this bug doesn't look similar to https:/
Changed in mos: | |
status: | New → Confirmed |
assignee: | nobody → MOS Oslo (mos-oslo) |
milestone: | none → 10.0 |
tags: | added: area-oslo |
tags: |
added: swarm-blocker removed: swarm-fail |
tags: |
added: swarm-fail removed: swarm-blocker |
Changed in mos: | |
milestone: | 10.0 → 9.x-updates |
puppet manifest was adding rabbit users right at the moment when pacemaker decided to restart some rabbits. Funny thing is that adding users in puppet is a useless operation in presence of pacemaker: those created users will be lost during resets/joins performed by OCF script.
We need to disable this user-creation activity completely - and the ONLY thing that puppet should do is to install package and drop 2 config files into their proper locations (i.e. no user management, no (re)starting/ stopping/ enabling of systemd unit, etc.)