[all][TC] Stats about rechecking patches without reason given

Slawek Kaplonski skaplons at redhat.com
Thu Jun 30 14:53:19 UTC 2022


Dnia czwartek, 30 czerwca 2022 15:37:47 CEST Sean Mooney pisze:
> On Thu, 2022-06-30 at 13:06 +0000, Jeremy Stanley wrote:
> > On 2022-06-30 14:57:44 +0200 (+0200), Dmitriy Rabotyagov wrote:
> > > Is it possible to adjust the script a bit in the future to add the
> > > amount of changes pushed/merged or some ratio of the amount of
> > > rechecks per merged patch? I think it would also be an interesting
> > > stat to see in addition to the amount of rechecks to understand how CI
> > > is stable or not.
> > [...]
> > 
> > Recheck comment volume doesn't really provide an accurate measure of
> > CI stability, all it tells you is how often people requested
> > rerunning tests. Their reasons for doing it can be myriad, from not
> > believing actual failures their changes are causing, to repeatedly
> > rechecking successful results in hopes of reproducing some rare
> > failure condition.
> yep we also recheck succeful result if we think we have fixed an intermint
> ci failure that we could not repoduced reliably but created a patch based on code inspection.
> in such a case we usually recheck 3 times looking for at least 3 consecitive check +1s before we +2w
> rearly is also recheck if a patch is old and the logs have rotaed when im reviewing others work
> but genrally i just click the rebase button in that case. for example i will tend to do +2 recheck
> if there are already cherry picks of the patch to avoid those having to be updated. but as i said this is
> rare as we dont ofthen have bugfixes that sit around for 3+ months that still actully apply with out a merge confilict
> but it does happen.
> so recheck is not a a great proxy for ci stablity without knowing the reason which is why not doing bare rechecks is important.

That's true. The reason why I did script to check "bare" rechecks is to see how often people just do "recheck" without even checking reason of failures.

For CI stability, some time ago I did another script https://github.com/slawqo/rechecks-stats/blob/main/rechecks_stats/rechecks.py[1] which checks only merged patches and counts number of "Failed build" comments from Zuul on the last, merged patch set. That is also not perfect metric for sure but can give IMO better view of the CI stability as it will not count rechecks of the passed CI run to see intermittent failures or issues caused by the patch itself.


Slawek Kaplonski
Principal Software Engineer
Red Hat

[1] https://github.com/slawqo/rechecks-stats/blob/main/rechecks_stats/rechecks.py
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.openstack.org/pipermail/openstack-discuss/attachments/20220630/1b0db2d5/attachment.htm>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 488 bytes
Desc: This is a digitally signed message part.
URL: <http://lists.openstack.org/pipermail/openstack-discuss/attachments/20220630/1b0db2d5/attachment.sig>

More information about the openstack-discuss mailing list