[10.0 swarm] Network verification is not possible because some nodes are not available via mcollective
Bug #1673743 reported by
Vladimir Khlyunev
This bug report is a duplicate of:
Bug #1672964: Redeployment failed on tasks[system_provision].
Edit
Remove
This bug affects 1 person
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Fuel for OpenStack |
Invalid
|
High
|
Vladimir Sharshov |
Bug Description
ISO 1481
Scenario:
1. Create cluster
2. Add 3 node with controller role
3. Verify network
4. Deploy cluster
5. Stop deployment
6. Add 2 nodes with compute role
7. Re-deploy cluster
Result:
Deployment successful, but node-3 was not deployed:
[root@nailgun ~]# fuel node
...
3 | discover | slave-03_controller | 1 | 10.109.15.6 | 64:c8:c9:b1:3b:90 | controller |
[root@nailgun ~]# ssh node-3
ssh: Could not resolve hostname node-3: Name or service not known
[root@nailgun ~]# ssh 10.109.15.6
Warning: Permanently added '10.109.15.6' (ECDSA) to the list of known hosts.
root@bootstrap:~#
tags: | added: swarm-fail |
Changed in fuel: | |
status: | New → Confirmed |
tags: | added: area-python |
Changed in fuel: | |
assignee: | Fuel Sustaining (fuel-sustaining-team) → Vladimir Kozhukalov (kozhukalov) |
Changed in fuel: | |
assignee: | Vladimir Kozhukalov (kozhukalov) → Vladimir Sharshov (vsharshov) |
summary: |
- After deployment stop and restart node still in bootstrap/discover state + [10.0 swarm] Network verification is not possible because some nodes are + not available via mcollective |
To post a comment you must log in.
there is the error in the asute.log
2017-03-28 00:39:12 ERROR [17836] Error running RPC method verify_networks: Network verification not available because nodes ["1"] not available via mcollective, trace: share/gems/ gems/astute- 10.0.0/ lib/astute/ orchestrator. rb:218: in `validate_ nodes_access' ", share/gems/ gems/astute- 10.0.0/ lib/astute/ orchestrator. rb:170: in `check_dhcp'", share/gems/ gems/astute- 10.0.0/ lib/astute/ server/ dispatcher. rb:126: in `check_dhcp'", share/gems/ gems/astute- 10.0.0/ lib/astute/ server/ dispatcher. rb:110: in `block in verify_networks'", share/gems/ gems/astute- 10.0.0/ lib/astute/ server/ dispatcher. rb:108: in `each'", share/gems/ gems/astute- 10.0.0/ lib/astute/ server/ dispatcher. rb:108: in `verify_networks'", share/gems/ gems/astute- 10.0.0/ lib/astute/ server/ server. rb:172: in `dispatch_ message' ", share/gems/ gems/astute- 10.0.0/ lib/astute/ server/ server. rb:131: in `block in dispatch'", share/gems/ gems/astute- 10.0.0/ lib/astute/ server/ task_queue. rb:64:in `call'", share/gems/ gems/astute- 10.0.0/ lib/astute/ server/ task_queue. rb:64:in `block in each'", share/gems/ gems/astute- 10.0.0/ lib/astute/ server/ task_queue. rb:56:in `each'", share/gems/ gems/astute- 10.0.0/ lib/astute/ server/ task_queue. rb:56:in `each'", share/gems/ gems/astute- 10.0.0/ lib/astute/ server/ server. rb:128: in `each_with_index'", share/gems/ gems/astute- 10.0.0/ lib/astute/ server/ server. rb:128: in `dispatch'", share/gems/ gems/astute- 10.0.0/ lib/astute/ server/ server. rb:106: in `block in perform_main_job'"]
["/usr/
"/usr/
"/usr/
"/usr/
"/usr/
"/usr/
"/usr/
"/usr/
"/usr/
"/usr/
"/usr/
"/usr/
"/usr/
"/usr/
"/usr/
there are no errors in mcollective.log file on the node which was not available. Looks like the reason why this issue appears is that network connection is unstable (maybe due to high cpu load on the host at that time).