Bug#1072512: mutter: flaky autopgktest
Simon McVittie
smcv at debian.org
Sat Sep 7 10:37:49 BST 2024
On Fri, 06 Sep 2024 at 23:40:15 +0200, Paul Gevers wrote:
> On Mon, 3 Jun 2024 10:12:07 -0400 =?UTF-8?Q?Jeremy_B=C3=ADcha?=
> <jeremy.bicha at canonical.com> wrote:
> > The tests passed on retry and libxcursor is no longer blocked from
> > migrating. Sorry that the autopkgtests are flaky. :(
>
> The ratio at which mutter (at least on amd64) is failing on ci.d.n is
> unacceptable. (The ratio where I'm starting to doubt about severity is about
> 1/6 to 1/8 failures, but recent history is more like 1/2.)
Looking at the recent failure logs, the failures all seem to be in the
"stacking" test suite, which has two semi-common failure modes:
- sometimes mutter tries to create a texture with a negative size for
reasons that I tried to investigate but do not understand
(#1077800 aka #1078359)
- and sometimes the stacking (window management) tests just aren't reliable
(#1050022, #1050023, similar random failures)
None of the individual test failures are particularly frequent, but there
are enough affected test cases that there's a high probability of at least
one of them failing in any given buildd build or autopkgtest run.
To mitigate this, I'm going to try something I had been considering for
a while: treat the whole "stacking" test suite as flaky. Unfortunately it
just isn't stable enough to be used as a QA gate.
smcv
More information about the pkg-gnome-maintainers
mailing list