Activity log for bug #1497991

Date Who What changed Old value New value Message
2015-09-21 13:19:16 Blake Rouse bug added bug
2015-09-21 13:24:49 Blake Rouse description During a deployment with MAAS, Juju, and Landscape one of the nodes failed to PXE boot and the BIOS then chose to boot from the local disk. That disk had a previous deployment and the cloud-init datasource for MAAS setup. The node then contacted the MAAS server with those credentials which were valid the last deployment and the node was marked deployed. This is a big problem the node did not deploy at all and now all the old data is on that node. In this case the node should be marked "failed deployment" or even better MAAS would try to restart the node and try again, then after a few tries get marked "failed deployment". If a node contacts the cloud-init metadata service *not the one used by curtin* before netboot_off=True, then the node should be marked "failed deployment". This signals that curtin did not finish its entire installation process and the node should not have made it to this point of using cloud-init. The node should then be powered off so that no user or Juju will try to ssh into that node, because it will have the previous deployment SSH keys. During a deployment with MAAS, Juju, and Landscape one of the nodes failed to PXE boot and the BIOS then chose to boot from the local disk. That disk had a previous deployment and the cloud-init datasource for MAAS setup. The node then contacted the MAAS server with those credentials which were valid the last deployment and the node was marked deployed. This is a big problem the node did not deploy at all and now all the old data is on that node. In this case the node should be marked "failed deployment" or even better MAAS would try to restart the node and try again, then after a few tries get marked "failed deployment". If a node contacts the cloud-init metadata service *not the one used by curtin* before netboot_off=True, then the node should be marked "failed deployment". This signals that curtin did not finish its entire installation process and the node should not have made it to this point of using cloud-init. The node should then be powered off so that no user or Juju will try to ssh into that node, because it will have the previous deployment SSH keys. How to reproduce: Deploy a node with MAAS. Wait for the deployment to finish and be successful. Release the node and wait for it to go back to ready. Change the boot order on the node to boot from local disk first instead of PXE. *This needs to be done on a power type that doesn't change the boot order on power up, like virsh.* Deploy the node again, it will transition to "Deployed" very fast and will be the previous deployment not the new one.
2015-09-21 13:26:30 Adam Collard bug added subscriber Landscape
2016-02-18 15:40:30 Christian Reis maas: importance Critical High
2016-02-18 15:40:33 Christian Reis maas: milestone 1.9.0 1.9.1
2016-03-01 19:49:26 Andres Rodriguez maas: milestone 1.9.1 1.9.2
2016-04-27 20:20:12 Andres Rodriguez maas: milestone 1.9.2 1.9.3
2016-05-17 05:01:04 Andres Rodriguez maas: milestone 1.9.3 1.9.4
2016-07-13 13:31:45 Andres Rodriguez maas: milestone 1.9.4 1.9.5
2017-11-03 22:13:15 Andres Rodriguez maas: status Triaged Won't Fix