charm gets stuck on config-changed hook and holds juju machine-lock while trying to create br-ex
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
OpenStack Neutron Open vSwitch Charm |
New
|
Undecided
|
Unassigned |
Bug Description
This issue was faced by a customer on their first small bionic-stein cloud
(9 nodes, 7 nova-compute + 2 neutron-gateway).
They did not face this issue in any of their larger bionic-stein clouds
(39 nodes, 37 nova-compute + 2 neutron-gateways).
I was able to reproduce this in an OrangeBox (10 node - all nova-compute).
The bundle I used to reproduce this issue - https:/
The juju status (problem is on the octavia LXD containers): https:/
Output of ps aux | grep juju on one of the units shows that there is a python3 process running the config-changed hook of the neutron-ovs : https:/
juju machine-lock.log: https:/
Also I saw that the ovs-vswitchd service fails to run.
I was led to believe that this issue may be due to this bug: https:/
limits.
However after applying the patch and upgrading the charm and recreating the containers, the result is still the same so I think it is a diffrent issue altogether.
Here are the crashdumps before the patch:https:/
and after the patch: https:/
Crash file in juju crashdump has the same signature as LP#1906280. Marking as duplicate