[2.0 beta 2] Nodes fail to remain powered after Trusty commission with "Allow SSH" selected
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
MAAS |
Won't Fix
|
Critical
|
Unassigned | ||
cloud-init |
Fix Released
|
Medium
|
Unassigned | ||
cloud-init (Ubuntu) |
Fix Released
|
Medium
|
Unassigned | ||
Trusty |
Confirmed
|
Undecided
|
Unassigned |
Bug Description
Build Version/Date: MAAS 2.0 Beta2
Environment used for testing: Xenial
Summary:
When commissioning nodes with the "Allow SSH" option selected, at least 50% of nodes fail to remain powered and in "Ready" state
Steps to Reproduce:
Enlist 5+ nodes
Commission all nodes at once
Expected result:
All nodes Ready and powered
Actual result:
50-75% of nodes are Ready but powered off
Syslog shows the following errors
Apr 14 19:03:41 donphan sh[28839]: 2016-04-14 19:03:41+0000 [RemoteOriginRe
Apr 14 19:05:37 donphan sh[28839]: 2016-04-14 19:05:37+0000 [RemoteOriginRe
Apr 14 19:05:37 donphan sh[28839]: 2016-04-14 19:05:37+0000 [RemoteOriginRe
Apr 14 19:07:12 donphan sh[28839]: 2016-04-14 19:07:12+0000 [RemoteOriginRe
Apr 14 19:07:48 donphan sh[28839]: 2016-04-14 19:07:48+0000 [RemoteOriginRe
Apr 14 19:08:08 donphan sh[28839]: 2016-04-14 19:08:08+0000 [RemoteOriginRe
Apr 14 19:11:37 donphan sh[28839]: 2016-04-14 19:11:37+0000 [RemoteOriginRe
Apr 14 19:11:44 donphan sh[28839]: 2016-04-14 19:11:44+0000 [RemoteOriginRe
Apr 14 19:11:47 donphan sh[28839]: 2016-04-14 19:11:47+0000 [RemoteOriginRe
Apr 14 19:12:24 donphan sh[28839]: 2016-04-14 19:12:24+0000 [RemoteOriginRe
Apr 14 19:12:24 donphan sh[28839]: 2016-04-14 19:12:24+0000 [RemoteOriginRe
Apr 14 19:13:14 donphan sh[28839]: 2016-04-14 19:13:14+0000 [RemoteOriginRe
Apr 14 19:13:17 donphan sh[28575]: Failure: twisted.
Apr 14 19:13:18 donphan sh[28575]: Failure: twisted.
Apr 14 19:43:41 donphan sh[28839]: 2016-04-14 19:43:41+0000 [RemoteOriginRe
Apr 14 19:43:50 donphan sh[28839]: 2016-04-14 19:43:50+0000 [RemoteOriginRe
Apr 14 19:43:57 donphan sh[28839]: 2016-04-14 19:43:57+0000 [RemoteOriginRe
Apr 14 19:44:05 donphan sh[28839]: 2016-04-14 19:44:05+0000 [RemoteOriginRe
Apr 14 19:44:06 donphan sh[28839]: 2016-04-14 19:44:06+0000 [RemoteOriginRe
Apr 14 19:45:10 donphan sh[28839]: 2016-04-14 19:45:10+0000 [RemoteOriginRe
Apr 14 19:46:19 donphan sh[28575]: #011twisted.
Apr 14 21:34:08 donphan sh[28839]: 2016-04-14 21:34:08+0000 [RemoteOriginRe
Apr 14 21:34:09 donphan sh[28839]: 2016-04-14 21:34:09+0000 [RemoteOriginRe
Apr 14 21:34:20 donphan sh[28839]: 2016-04-14 21:34:20+0000 [RemoteOriginRe
Apr 14 21:34:35 donphan sh[28839]: 2016-04-14 21:34:35+0000 [RemoteOriginRe
Apr 14 21:34:36 donphan sh[28839]: 2016-04-14 21:34:36+0000 [RemoteOriginRe
Apr 14 21:35:05 donphan sh[28575]: Failure: twisted.
Apr 14 21:35:46 donphan sh[28839]: 2016-04-14 21:35:46+0000 [RemoteOriginRe
Apr 14 21:36:51 donphan sh[28575]: #011twisted.
Apr 14 21:37:00 donphan sh[28575]: #011twisted.
Changed in maas: | |
importance: | Undecided → Critical |
status: | Incomplete → Triaged |
Changed in cloud-init: | |
importance: | Undecided → Medium |
status: | Confirmed → Fix Released |
Changed in cloud-init (Ubuntu): | |
importance: | Undecided → Medium |
status: | New → Fix Released |
tags: | added: internal |
I've tried to replicate this in 2 different MAAS Clusters with 2 different types of machines and I have been unable to replicate. Questions:
Are you trying to access the commisisoning environment because something is failing? There may be the case that commissioning is failing , or something within it is failing preventing the ssh key to be imported into the commissioning environment and preventing the machine to be told to not power off....
What commissioning image are you using? Xenial or Trusty?