kube-ovn controller pod is in crashloop backoff after cidr-expansion
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Charmed Kubernetes Testing |
Fix Released
|
High
|
George Kraft | ||
Kubernetes Control Plane Charm |
Fix Released
|
High
|
George Kraft |
Bug Description
while running the end-to-end validation tests from jenkins [0], a test failure usually manifests in the wrong spot after the cidr expansion tests rather than during. This was revealed by a CrashLoopBackoff Pod in the kube-ovn-controller deployment [1]. I checked the deployment config [2] and it had the right expanded cidr
Steps I believe are necessary to reproduce:
```
juju add-model kubernetes-ovn
juju deploy charmed-kubernetes --overlay overlays/
juju-wait
tox -e py -- .tox/py/bin/pytest jobs/integratio
```
[0]: https:/
[1]: https:/
[2]: https:/
Changed in charmed-kubernetes-testing: | |
status: | In Progress → Fix Committed |
Changed in charm-kubernetes-master: | |
status: | In Progress → Fix Committed |
Changed in charmed-kubernetes-testing: | |
milestone: | 1.25+ck3 → 1.26 |
Changed in charm-kubernetes-master: | |
milestone: | 1.25+ck3 → 1.26 |
Changed in charmed-kubernetes-testing: | |
status: | Fix Committed → Fix Released |
Changed in charm-kubernetes-master: | |
status: | Fix Committed → Fix Released |
After investigating the crash, and while writing this bug report, eventually the deployment stabilized and kube-ovn-controller was up without crashes. This may be why the rest of the tests in the suite continue normally. Perhaps there is a longer recovery time than expected after the cidr expansion is begun?