ci: Daily CI to soak flaky tests #5382

robin-aws · 2024-04-30T21:16:24Z

Description

Setting up a daily scheduled build to run the LSP tests 5 times on each platform, in an effort to proactively uncover flaky test failures that show up at the worst possible time as you're trying to get your wonderful PR you've been working on for weeks merged or even worse that break the nightly build for the third day in a row forcing the team to drop everything to click Retry Failed Jobs two or three times before ANY PRs can be merged not that I'm frustrated or anything. :)

The PR CI was previously running the LSP tests twice every time for the same reason. Given this daily job will soak them more deeply, and that the osx unit test job on PRs in particular has been taking almost 45 minutes and becoming the limiting factor, I've opted to just run them once on PRs now.

Note that unlike the original double run, it was simpler to use matrices just as we do for integration tests, only in this case just iterating the same test run in parallel 5 times. It's possible running twice on the same runner might trigger more failures, but it seems unlikely since the second run happens in a fresh process and no state leaks between the runs AFAICT.

How has this been tested?

Dry-run on my fork: https://github.com/robin-aws/dafny/actions/runs/8914692810

I've run it a few times without any failures yet, but we'll have to see what happens after a week or so of scheduled runs.

By submitting this pull request, I confirm that my contribution is made under the terms of the MIT license.

…dafny into daily-ci-to-soak-flaky-tests

alex-chew

LGTM

robin-aws added 2 commits April 30, 2024 10:49

First cut

0369e4d

Parameterize unit tests workflow

60eaa2b

robin-aws added the run-deep-tests Tells CI to run all tests label Apr 30, 2024

Poke CI

f5ab9bd

robin-aws removed the run-deep-tests Tells CI to run all tests label May 1, 2024

robin-aws added 6 commits May 1, 2024 11:44

Merge branch 'master' into daily-ci-to-soak-flaky-tests

5a59976

Fix working directory and population steps

614bb03

Merge branch 'daily-ci-to-soak-flaky-tests' of github.com:dafny-lang/…

ef2c490

…dafny into daily-ci-to-soak-flaky-tests

Debugging on fork

649e65b

Make os explicit to fix matrix calculation

582a649

Better name

e1c017f

robin-aws closed this May 1, 2024

robin-aws added 2 commits May 1, 2024 14:00

Restore scheduling guard

b349c91

Better scheduled time

ac84fc7

robin-aws reopened this May 1, 2024

robin-aws marked this pull request as ready for review May 1, 2024 21:14

alex-chew approved these changes May 1, 2024

View reviewed changes

Merge branch 'master' into daily-ci-to-soak-flaky-tests

90c5ed0

robin-aws enabled auto-merge (squash) May 1, 2024 23:12

robin-aws merged commit 79c323d into master May 1, 2024
21 checks passed

robin-aws deleted the daily-ci-to-soak-flaky-tests branch May 1, 2024 23:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ci: Daily CI to soak flaky tests #5382

ci: Daily CI to soak flaky tests #5382

robin-aws commented Apr 30, 2024 •

edited

Loading

alex-chew left a comment

ci: Daily CI to soak flaky tests #5382

ci: Daily CI to soak flaky tests #5382

Conversation

robin-aws commented Apr 30, 2024 • edited Loading

Description

How has this been tested?

alex-chew left a comment

Choose a reason for hiding this comment

robin-aws commented Apr 30, 2024 •

edited

Loading