[gettext] fix flaky test on windows #11940

picnixz · 2024-02-04T14:15:41Z

This is a PR for triggering the CI/CD. I'll try to see if it changes anything (and to check if a special statement caused the regular issues). So I'll likely leave that PR open and try to retrigger CI/CD a few times to make it fail.

jayaddison · 2024-02-04T14:56:43Z

Something I've been wondering about: is the .mo file genuinely a source dependency of the project during rebuilds?

It seems like the failing test is trying to say that "no, modifying the .mo file should not result in detection of files-to-update".

But I think that some of the logic in the codebase (notably here) does seem to consider mo files as dependencies (at least when the builder is configured accordingly).

picnixz · 2024-02-04T14:58:39Z

Well the issue is that it sometimes works and not on Windows. I suspect that it's because when we are querying the last modification time, something happens. So I put some debug prints here and there to check.

As for the dependency, I don't know but I think yes.

picnixz · 2024-02-04T16:33:43Z

Ok, so I know what happens:

When we read the documents, we put some timestamp to indicate when it was read. We use time.time_ns() // 1000 which gives us a lower-bound on the real time since EPOCH in microseconds. In particular, the time of the document is either the one that we stored or something a bit later.
When we want to test for the last modification time, we extract the closest microsecond of the last modification time as given by the OS in nanoseconds. So, we do -(ns // -1000).

Now, the failure that I could extract has a time.time_ns() equal to T = 1707062261168486400. So, we will store T // 1000 = 1707062261168486. The exact value would be 1707062261168486.4. Now, if we compare it to a document for which os.stat(file).st_mtime_ns also returns T, we actually compute T' = -(T // -1000) = 1707062261168487 and here is the little issue.

It should be fine to err for the release build because better be safe than sorry. But for the tests, since those failures appear quite often, we always get a notification on the fact that the workflow fails. Also, people might get confused because they don't know whether what they did affected this component or not sometimes. So I suggest that, for the tests only, we could mock time.time_ns for the specific tests and only for Windows platforms.

Actually the whole issue stems from the fact that time.time_ns() has a resolution of 318 microseconds on Windows vs 84ns on Unix-based systems. That's likely the reason why my two files end up having the same st_mtime_ns value but not on Unix (since they are effectively created at different points in time). If Windows had a good time.time_ns() resolution, then this issue would likely disappear.

picnixz · 2024-02-04T16:46:11Z

Closing since I'm done with my investigation.

picnixz · 2024-02-10T11:48:32Z

So let's work again on that one.

picnixz · 2024-02-10T12:03:51Z

I think it now works?

picnixz · 2024-02-10T20:06:01Z

Merging this one should technically (and I hope so) fix the future issues on Windows that we had.

tests/test_intl/test_intl.py

jayaddison · 2024-02-11T01:15:24Z

FYI: I've got a branch where I've attempted to keep all the 'fix' parts of this, while filtering-out the refactor-like changes.

That helped me to understand that the key parts are both the time-mocking and also the very-specific modified-time patch after the BOM file has been written but before it is re-read.

It's currently visible for comparison at: master...jayaddison:sphinx:issue-11941/pr-11940-review-experimentation#files_bucket (although I'll probably remove that branch at some point)

I did also make one or two changes during that - nothing too significant. I'll try to remember to provide the relevant things as code-review feedback.

tests/test_intl/test_intl.py

picnixz · 2024-02-24T12:28:49Z

I think I'll merge this one next since it's not hard to revert it if there are issues. It's not blocking per se but at least it would reduce the number of random CI/CD failures.

jayaddison · 2024-02-24T13:29:45Z

Nice work @picnixz!

picnixz added 4 commits February 4, 2024 14:55

try fixing flaky test

131f1a8

improve test logic

f468b13

improve test logic

7f0ec64

fix test

dc0fda3

picnixz added the DO NOT MERGE label Feb 4, 2024

picnixz added 2 commits February 4, 2024 15:55

reduce test suite for investigation

565511f

fix lint

cd04281

picnixz added 7 commits February 4, 2024 16:26

little hack for repeating tests

d2d0d22

add more messages

01755f8

fast fail

238b916

add hacky print

265ec9c

add prec

58cb6b2

add more info

88852fb

.

08ae601

picnixz mentioned this pull request Feb 4, 2024

[CI/CD] Improve test_gettext_dont_rebuild_mo on Windows platforms. #11941

Closed

picnixz closed this Feb 4, 2024

jayaddison mentioned this pull request Feb 5, 2024

[tests] _setup_intl autouse fixture writes mo files to unused paths #11953

Closed

Merge branch 'master' into fix/windows-test-gettext

10aba70

picnixz reopened this Feb 10, 2024

picnixz added 5 commits February 10, 2024 12:50

try monkeypatch

591ac66

reduce wf

6653759

remove lint

6c3aae7

add offset

40c62d1

update wf

7edf523

picnixz added type:tests type:performance labels Feb 10, 2024

picnixz requested review from jayaddison and AA-Turner February 10, 2024 20:04

picnixz mentioned this pull request Feb 10, 2024

tests: cleanup: remove unused _setup_intl autouse fixture #11954

Closed