MAAS 3.4: Deployment fails on LXD VMs
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
MAAS |
Fix Released
|
High
|
Christian Grabowski |
Bug Description
running MAAS 3.4.0~alpha1 installed from snap, in production configuration, with Postgres 14
Cannot deploy LXD VMs composed from LXD VM host:
1. Commissioning completes, but takes a very long time; rackd.log is idle after squashfs is downloaded, and the lxd console for the machine shows no further activity after machine initially boots from NBP. Machine eventually (suddenly?) reaches the "Ready" state and can be moved to "Allocated".
2. Deployment can be started, but does not complete. Again, rackd.log is idle after squashfs is downloaded, and the lxd console for the machine shows no further activity after machine initially boots from NBP. UI message hangs at "Loading ephemeral". MAAS eventually stops the machine, long before deployment times out.
There are no relevant log messages during this long timeout, and there is absolutely no lxd console output for the machine during Commissioning or Deployment.
Contrast this with 3.3, in which normal commissioning and takes place, including several rackd.log messages along the way, and the expected flood of curtin and cloud-init messages in the lxd console throughout the entire processes.
Related branches
- MAAS Lander: Approve
- Alberto Donato: Approve
-
Diff: 96 lines (+48/-11)2 files modifiedsrc/maasserver/rpc/boot.py (+11/-10)
src/maasserver/rpc/tests/test_boot.py (+37/-1)
Changed in maas: | |
status: | New → Triaged |
importance: | Undecided → High |
milestone: | none → 3.4.0 |
Changed in maas: | |
status: | In Progress → Fix Committed |
Changed in maas: | |
milestone: | 3.4.0 → 3.4.0-beta1 |
Changed in maas: | |
status: | Fix Committed → Fix Released |
This one may be related: https:/ /bugs.launchpad .net/maas/ +bug/1908452