Machine cloud-init takes a long time when one agent binary URL times out

Bug #1622531 reported by Andrew Wilkins
14
This bug affects 2 people
Affects Status Importance Assigned to Milestone
Canonical Juju
Fix Released
High
Andrew Wilkins

Bug Description

When machine agents are deployed to separate private networks from the controller, e.g. as in any Azure controller, the machine agents started in non-controller models will be slow to download the agents.

The reason for this is that the cloud-init script attempts to download from each of the controller's addresses in series, starting with the cloud-local addresses. These are never usable by machine agents except for those within the same model.

We should do at least one of the following things:
 1. attempt to download the agent from each of the addresses in parallel
 2. specify a shorter timeout to curl
 3. update providers to ignore addresses they know they can never use (e.g. the cloud-local address for azure, except when in the controller model)

Ian Booth (wallyworld)
summary: - Machine cloud-init hould attempt to download agent using all addresses
+ Machine cloud-init should attempt to download agent using all addresses
in parallel
Changed in juju:
milestone: 2.0-rc1 → 2.0-rc2
Curtis Hovey (sinzui)
Changed in juju:
milestone: 2.0-rc2 → none
Changed in juju:
milestone: none → 2.0.0
Changed in juju:
milestone: 2.0.0 → 2.1.0
Andrew Wilkins (axwalk)
Changed in juju:
status: Triaged → In Progress
Revision history for this message
Anastasia (anastasia-macmood) wrote : Re: Machine cloud-init should attempt to download agent using all addresses in parallel
Revision history for this message
Andrew Wilkins (axwalk) wrote :

I've closed the above mentioned PR, in favour of a simpler approach: specify a connection timeout to curl of 20 seconds. This does mean that there's still a 20 second overhead, but it's better than what we have now (several minutes) and minimises risk that would be introduced by reinventing the wheel.

summary: - Machine cloud-init should attempt to download agent using all addresses
- in parallel
+ Machine cloud-init takes a long time when one agent binary URL times out
Revision history for this message
John A Meinel (jameinel) wrote :
Andrew Wilkins (axwalk)
Changed in juju:
status: In Progress → Fix Committed
Curtis Hovey (sinzui)
Changed in juju:
status: Fix Committed → Fix Released
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Duplicates of this bug

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.