autopkgtest success rate dropped inhibiting proposed migration
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
systemd (Ubuntu) |
Fix Released
|
Medium
|
Unassigned | ||
Bionic |
Fix Released
|
Medium
|
Dan Streetman | ||
Focal |
Fix Released
|
Medium
|
Dan Streetman | ||
Groovy |
Fix Released
|
Undecided
|
Unassigned |
Bug Description
[impact]
autopkgtests are failing/flaky and prevent other packages from migrating to -updates
[test case]
check autopkgtest history
[regression potential]
in regard to the changed test cases, any regression would likely result in either an incorrectly passed test, or an incorrectly failed test.
[scope]
for systemd, this is needed for x, b, and f.
tests in g appear to be mostly stable, but I've opened MR (linked from this bug) to update the tests there as well.
i don't plan to update x, as it's reaching ESM in ~6 months, and backporting the test fixes is more work than just a simple code copy, since there are additional differences/changes needed in the older version of systemd (and python3). the failing/flaky tests in x have been like that forever, and people have just retried them; we can keep retrying them until x moves into ESM next year.
[original description]
Hi,
we had such cases in the past like bug 1817721 for bionic and maybe bug 1892130 is about the same as well. There were more but I didn't want to search for all of them - what I checked is that there are no open ones clearly pointing out the recent further drop in already flaky subtests.
In particular the tests "tests-in-lxd" and "systemd-fsckd" were known to be flaky before, but got even worse.
Here stats of the last 40 runs, it might be a coincidences that this is after 246-2ubuntu1 landed. Could as well be any other change
groovy
amd64
tests-in-lxd (F 42% S 0% B 10% => P 45%/) FFFFBFFFFFFFB.
build-login (F 0% S 0% B 10% => P 87%/) ....B..
unit-config (F 0% S 0% B 10% => P 87%/) ....B..
networkd-testpy (F 0% S 0% B 10% => P 87%/) ....B..
boot-and-services (F 0% S 0% B 10% => P 87%/) ....B..
boot-smoke (F 0% S 0% B 10% => P 87%/) ....B..
logind (F 0% S 0% B 10% => P 87%/) ....B..
storage (F 0% S 0% B 10% => P 87%/) ....B..
upstream (F 35% S 0% B 10% => P 52%/) ..FFB.FFF.
udev (F 0% S 0% B 10% => P 87%/) ....B..
systemd-fsckd (F 37% S 0% B 10% => P 50%/) FFFFBFFFFFFFB.
root-unittests (F 0% S 0% B 10% => P 87%/) ....B..
ppc64el
tests-in-lxd (F 25% S 0% B 0% => P 75%/) FFFF...
systemd-fsckd (F 35% S 0% B 0% => P 65%/) FFFFFFF.
root-unittests (F 2% S 0% B 0% => P 97%/) .......
s390x
tests-in-lxd (F 52% S 0% B 0% => P 47%/) FFFFFFF.
timedated (F 2% S 0% B 0% => P 97%/) .......
upstream (F 17% S 0% B 0% => P 82%/) .....F.
systemd-fsckd (F 32% S 0% B 0% => P 67%/) FFFFFFF.
root-unittests (F 10% S 0% B 0% => P 90%/) .......
arm64
tests-in-lxd (F 40% S 0% B 2% => P 57%/) FFFFF.B.
logind (F 2% S 0% B 2% => P 95%/) ......B.
upstream (F 22% S 0% B 2% => P 75%/) ...F.FB.
root-unittests (F 12% S 0% B 2% => P 85%/) ......B.
(I'm sure LP will make this unreadable, but is is nice in monospace)
Whatever the root cause is - the success rate of these has reduced so much that the (even formerly questionable) practice of retry-until-success won't work anymore.
I have run the two tests in a local VM and systemd-fsckd works there while tests-in-lxd seems to trip over the old flaky fellow being "boot-and-
We had the discussion in the past, but I think I need to again bring up the suggestion to skip "tests-in-lxd" and "systemd-fsckd" until they are on reasonable success rates.
Related branches
- Ubuntu Stable Release Updates Team: Pending requested
-
Diff: 13 lines (+3/-0)1 file modifiedubuntu-sru (+3/-0)
tags: | added: rls-gg-incoming |
description: | updated |
Changed in systemd (Ubuntu Bionic): | |
assignee: | nobody → Dan Streetman (ddstreet) |
importance: | Undecided → Medium |
status: | New → In Progress |
Changed in systemd (Ubuntu Focal): | |
assignee: | nobody → Dan Streetman (ddstreet) |
importance: | Undecided → Medium |
status: | Confirmed → In Progress |
description: | updated |
no longer affects: | build-essential (Ubuntu) |
no longer affects: | build-essential (Ubuntu Focal) |
no longer affects: | glib2.0 (Ubuntu) |
no longer affects: | iputils (Ubuntu) |
no longer affects: | kbd (Ubuntu) |
no longer affects: | linux-meta (Ubuntu) |
no longer affects: | linux-meta (Ubuntu Bionic) |
no longer affects: | linux-meta (Ubuntu Focal) |
no longer affects: | ntpsec (Ubuntu) |
no longer affects: | qemu (Ubuntu) |
no longer affects: | qemu (Ubuntu Focal) |
no longer affects: | util-linux (Ubuntu) |
no longer affects: | util-linux (Ubuntu Focal) |
Changed in systemd (Ubuntu): | |
importance: | Undecided → Medium |
Tests-in-lxd fails because LXD images ship invalid /etc/fstab again making systemd- remount- fs.service fail.
I'm marking systemd-fsckd test as flaky to not block others' work.