l3 agent downtime can cause tenant VM outages during upgrade
Bug #1671504 reported by
Steven Hardy
This bug affects 1 person
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
tripleo |
Invalid
|
High
|
Marios Andreou |
Bug Description
We currently upgrade the l3 agent on the controller during the upgrade_tasks steps, which isn't batched (we take down the services on all nodes at the same time).
It would be better to instead use upgrade_batch_tasks to ensure minimal downtime (which will stop, upgrade, then start the service on each node one by one.
There is a question over package dependencies if we do this, but provided there aren't too many it may be possible to simply move the tasks to upgrade_batch_tasks
Changed in tripleo: | |
status: | New → Triaged |
importance: | Undecided → High |
milestone: | none → pike-1 |
Changed in tripleo: | |
assignee: | nobody → Marios Andreou (marios-b) |
status: | Triaged → In Progress |
Changed in tripleo: | |
milestone: | pike-1 → pike-2 |
To post a comment you must log in.
WIP - https:/ /review. openstack. org/445494 for possibly getting this into newton to ocata workflow... still being tested.