container quay-push jobs are failing push quay.rdoproject.org with 502 Bad Gateway

Bug #1998762 reported by Marios Andreou
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
tripleo
Fix Released
Critical
Unassigned

Bug Description

At [1][2][3][4] the periodic-tripleo-ci-build-containers- quay-push-<branch> jobs are failing to push to quay.rdoproject.org with trace like:

 * 2022-12-04 21:41:27 | Stderr: 'Getting image source signatures\nCopying blob sha256:b7b591e3443f17f9d8272b8d118b6c031ca826deb09d4b44f296ba934f1b6e57\nCopying blob sha256:1ffe7e5e3ec6d4681c808baeb071f69c94c572e1cac5f5af77d9784d908ee782\nCopying blob sha256:ccf04fbd6e1943f648d1c2980e96038edc02b543c597556098ab2bcaa4fd1fa8\nCopying blob sha256:237660821a7ca594556f4c64b736b4e874d6307c3b75c0db939e9b2d1c91948b\nerror pushing image "quay.rdoproject.org/tripleowallabycentos8/openstack-tripleoclient:47c581dacb7a7b35ad3aa337c08e1703" to "docker://quay.rdoproject.org/tripleowallabycentos8/openstack-tripleoclient:47c581dacb7a7b35ad3aa337c08e1703": error copying layers and metadata from "containers-storage:[overlay@/var/lib/containers/storage+/run/containers/storage:overlay.mountopt=nodev,metacopy=on]quay.rdoproject.org/tripleowallabycentos8/openstack-tripleoclient:47c581dacb7a7b35ad3aa337c08e1703" to "docker://quay.rdoproject.org/tripleowallabycentos8/openstack-tripleoclient:47c581dacb7a7b35ad3aa337c08e1703": Error writing blob: Error initiating layer upload to /v2/tripleowallabycentos8/openstack-tripleoclient/blobs/uploads/ in quay.rdoproject.org: received unexpected HTTP status: 502 Bad Gateway\n': RuntimeError: 

Noting the 502 Bad Gateway this may be an issue with the registry, if so we can close this out once fixed/resolved.

This has been hitting us since 04th December, e.g. there are three examples there [5] from 04th for periodic-tripleo-ci-build-containers-centos-8-quay-push-wallaby

Obviously promotion blocker.

[1] https://logserver.rdoproject.org/openstack-periodic-integration-stable1/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-build-containers-centos-8-quay-push-wallaby/94991d4/logs/build.log
[2] https://logserver.rdoproject.org/openstack-periodic-integration-main/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-build-containers-ubi-9-quay-push-master/aea1339/logs/build.log
[3] https://logserver.rdoproject.org/openstack-periodic-integration-stable1/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-build-containers-ubi-9-quay-push-wallaby/860e767/logs/build.log
[4] https://logserver.rdoproject.org/openstack-periodic-integration-stable1/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-build-containers-centos-8-quay-push-wallaby/7176ded/logs/build.log
[5] https://review.rdoproject.org/zuul/builds?job_name=periodic-tripleo-ci-build-containers-centos-8-quay-push-wallaby

Revision history for this message
Marios Andreou (marios-b) wrote :

strangely this did not affect train, at [1] green runs 03 04 05 December

[1] https://review.rdoproject.org/zuul/builds?job_name=periodic-tripleo-ci-build-containers-centos-8-quay-push-train

Revision history for this message
Marios Andreou (marios-b) wrote :

OK, this looks like it is resolving? will monitor today and close the bug if so.

Zed [1] failed on 04th with this bug [2] but green today 05th

        * 2022-12-05 03:14:23 SUCCESS
        * 2022-12-04 13:22:07 FAILURE

[1] https://review.rdoproject.org/zuul/builds?job_name=periodic-tripleo-ci-build-containers-ubi-9-quay-push-zed
[2] https://logserver.rdoproject.org/openstack-periodic-integration-zed-centos9/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-build-containers-ubi-9-quay-push-zed/9fd5adc/logs/build.log

Revision history for this message
Marios Andreou (marios-b) wrote :

11:25 < dpawlik> marios|ruck: quay was out of disk space due
11:26 < dpawlik> zed repository was not added into the pruner
11:26 < dpawlik> that's why was 502

Revision history for this message
Marios Andreou (marios-b) wrote :

https://softwarefactory-project.io/r/c/software-factory/sf-infra/+/26900 Add api-centos9-zed into Quay pruner DLRN endpoints

Revision history for this message
daniel.pawlik (daniel-pawlik) wrote :

Our Quay registry have an issue described here: https://issues.redhat.com/browse/PROJQUAY-2451 that has been fixed in https://github.com/quay/quay/pull/961 , but it requires upgrade to 3.7.0.

Revision history for this message
daniel.pawlik (daniel-pawlik) wrote :

Update to 3.7.10 done, but space is cleaning up very slow.
The error 502 might happen once again.

Revision history for this message
Marios Andreou (marios-b) wrote (last edit ):

this is still hitting us

example at [1] killed the c9/wallaby run this morning and [2] from the train line:

        * 2022-12-06 04:48:05 | 2022-12-06 04:48:05.119 39306 ERROR tripleoclient.v2.tripleo_container_image.Build Stderr: 'Getting image source signatures\nCopying blob sha256:ce9d08ae8f9927d284c3e9cd8e01fb1ccb35f3ccb58316e99172a48fb539f003\nCopying blob sha256:96b52c5f75bb0a72743ceab116e64ec82732f4b832df631ad97da614fb5ab311\nCopying blob sha256:ae04de67f65bc7e47c74b6897f995ceaaecfe3cd8bdd373f9f5409dd7d4e3b53\nCopying blob sha256:4e5109037c82ecb446faa91efeea0b88fd48579ea872a7df1a28a151f63a6814\nError: pushing image "quay.rdoproject.org/tripleowallabycentos9/openstack-ironic-base:b6edcbfef9c90c6251a7e5d45568f1ac" to "docker://quay.rdoproject.org/tripleowallabycentos9/openstack-ironic-base:b6edcbfef9c90c6251a7e5d45568f1ac": writing blob: initiating layer upload to /v2/tripleowallabycentos9/openstack-ironic-base/blobs/uploads/ in quay.rdoproject.org: received unexpected HTTP status: 502 Bad Gateway\n'

[1] https://logserver.rdoproject.org/openstack-periodic-integration-stable1/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-build-containers-ubi-9-quay-push-wallaby/50728b1/logs/build.log
[2] https://logserver.rdoproject.org/openstack-periodic-integration-stable4/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-build-containers-centos-8-quay-push-train/7cffaa1/logs/build.log

Revision history for this message
Marios Andreou (marios-b) wrote :

moving this fix-released

dpawlik added a new volume yesterday
issue is resolved

Changed in tripleo:
status: Triaged → Fix Released
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.