[kolla] periodic: container build job can fail during push
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
kolla |
Fix Released
|
Medium
|
Mark Goddard | ||
Queens |
Fix Committed
|
Medium
|
Mark Goddard | ||
Rocky |
Fix Committed
|
Medium
|
Mark Goddard | ||
Stein |
Fix Committed
|
Medium
|
Mark Goddard | ||
Train |
Fix Released
|
Medium
|
Mark Goddard | ||
tripleo |
Fix Released
|
Medium
|
Unassigned |
Bug Description
```
2019-09-19 01:13:00 | ERROR:kolla.
2019-09-19 01:13:00 | Traceback (most recent call last):
2019-09-19 01:13:00 | File "/home/
2019-09-19 01:13:00 | self.push_
2019-09-19 01:13:00 | File "/home/
2019-09-19 01:13:00 | for response in self.dc.
2019-09-19 01:13:00 | File "/usr/lib/
2019-09-19 01:13:00 | for chunk in json_stream(
2019-09-19 01:13:00 | File "/usr/lib/
2019-09-19 01:13:00 | for data in stream_
2019-09-19 01:13:00 | File "/usr/lib/
2019-09-19 01:13:00 | for data in stream:
2019-09-19 01:13:00 | File "/usr/lib/
2019-09-19 01:13:00 | data = reader.read(1)
2019-09-19 01:13:00 | File "/usr/lib/
2019-09-19 01:13:00 | raise IncompleteRead(
2019-09-19 01:13:00 | File "/usr/lib64/
2019-09-19 01:13:00 | self.gen.
2019-09-19 01:13:00 | File "/usr/lib/
2019-09-19 01:13:00 | raise ReadTimeoutErro
2019-09-19 01:13:00 | ReadTimeoutError: UnixHTTPConnect
2019-09-19 01:13:00 |INFO:kolla.
2019-09-19 01:13:00 |INFO:kolla.
2019-09-19 01:13:40 |INFO:kolla.
2019-09-19 01:13:40 |INFO:kolla.
2019-09-19 01:13:41 |INFO:kolla.
2019-09-19 01:13:41 |INFO:kolla.
2019-09-19 01:13:42 |INFO:kolla.
2019-09-19 01:13:42 |INFO:kolla.
```
Apparently the code does not have any retry mechanism on it, I think it should retry at least 3 times withing 10 minutes before failing, so we can avoid failing the entire job just because an external service is restarted or the network connectivity is bit flaky.
The retry should be implemented around https:/
tags: | added: alert |
summary: |
- periodic: container build job can fail during push + [koilla] periodic: container build job can fail during push |
summary: |
- [koilla] periodic: container build job can fail during push + [kolla] periodic: container build job can fail during push |
Changed in kolla: | |
importance: | Undecided → Medium |
status: | New → Triaged |
description: | updated |
Changed in tripleo: | |
status: | New → Triaged |
milestone: | none → ussuri-2 |
importance: | Undecided → Medium |
Changed in tripleo: | |
milestone: | ussuri-2 → ussuri-3 |
Changed in tripleo: | |
status: | Triaged → Fix Released |
It seems to be a glitch - all push attempts before and after this error ended with "Pushed successfully". RDO registry logs have been checked - ho issue found.
I recommend to retry the push.