test(robot-server): Fix race condition in integration test #13707

SyntaxColoring · 2023-10-03T16:37:59Z

Overview

This fixes a test that was flaky because of a race condition.

We were waiting for a run to succeed by:

Polling until its status indicated "done."
Checking that the final status was succeeded.

But we would erroneously proceed from step 1 to step 2 when the run transitioned from running to finishing, and finishing is of course not succeeded, so the test would sometimes fail depending on timing.

Test Plan

Just make sure CI keeps passing.

Risk assessment

No risk.

CaseyBatten · 2023-10-05T18:13:21Z

Looks good to me, appropriate change to increase the scope of that check. Thinking on potential alternative cases, would it be beneficial to adjust the check to include "stop-requested" as a third valid pre-end state on this check? so this:

if latest_status not in {"running", "stop-requested", "finishing"}:
            return latest_status  # type: ignore[no-any-return]

SyntaxColoring · 2023-10-05T19:45:59Z

Looks good to me, appropriate change to increase the scope of that check. Thinking on potential alternative cases, would it be beneficial to adjust the check to include "stop-requested" as a third valid pre-end state on this check? so this:
if latest_status not in {"running", "stop-requested", "finishing"}:
            return latest_status  # type: ignore[no-any-return]

That would be more appropriate, yes.

And there's a very similar polling loop in a different test file:

opentrons/robot-server/tests/integration/http_api/runs/test_labware_offsets_on_compatible_modules.py

Lines 35 to 49 in cd064e6

 async def poll_until_run_succeeds(robot_client: RobotClient, run_id: str) -> Any: 

 """Wait until a run completes, and then assert that it succeeded. 

  Return the completed run response. 

  """ 

 completed_run_statuses = {"stopped", "failed", "succeeded"} 

 while True: 

 run = (await robot_client.get_run(run_id=run_id)).json() 

 status = run["data"]["status"] 

 if status in completed_run_statuses: 

 assert status == "succeeded" 

 return run 

 else: 

 # The run is still ongoing. Wait a beat, then poll again. 

 await asyncio.sleep(RUN_POLL_INTERVAL)

That one takes the approach of listing out the "done" cases, instead of the "not done" cases.

This should be a shared function somewhere. I will do that in this PR.

Edit: Done.

SyntaxColoring · 2023-10-05T20:55:05Z

Got a 👍 from @CaseyBatten in person.

"Finishing" counts as "running."

2187457

SyntaxColoring force-pushed the fix_poll_until_not_running branch from 118d216 to 2187457 Compare October 3, 2023 16:38

SyntaxColoring requested a review from a team October 3, 2023 16:40

SyntaxColoring marked this pull request as ready for review October 3, 2023 16:40

SyntaxColoring requested a review from a team as a code owner October 3, 2023 16:40

CaseyBatten approved these changes Oct 5, 2023

View reviewed changes

SyntaxColoring marked this pull request as draft October 5, 2023 19:46

SyntaxColoring added 2 commits October 5, 2023 16:31

Centralize the logic to pull until the run completes.

d5369e3

Mark STARTUP_WAIT and SHUTDOWN_WAIT as private.

d26f810

SyntaxColoring marked this pull request as ready for review October 5, 2023 20:32

Revert accidental formatting change.

901dd14

SyntaxColoring merged commit 6e19c20 into edge Oct 5, 2023
7 checks passed

SyntaxColoring deleted the fix_poll_until_not_running branch October 5, 2023 20:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

test(robot-server): Fix race condition in integration test #13707

test(robot-server): Fix race condition in integration test #13707

SyntaxColoring commented Oct 3, 2023 •

edited

Loading

CaseyBatten commented Oct 5, 2023

SyntaxColoring commented Oct 5, 2023 •

edited

Loading

SyntaxColoring commented Oct 5, 2023

test(robot-server): Fix race condition in integration test #13707

test(robot-server): Fix race condition in integration test #13707

Conversation

SyntaxColoring commented Oct 3, 2023 • edited Loading

Overview

Test Plan

Risk assessment

CaseyBatten commented Oct 5, 2023

SyntaxColoring commented Oct 5, 2023 • edited Loading

SyntaxColoring commented Oct 5, 2023

SyntaxColoring commented Oct 3, 2023 •

edited

Loading

SyntaxColoring commented Oct 5, 2023 •

edited

Loading