Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[FLINK-8887][tests] Add single retry in MiniClusterClient #5657

Closed
wants to merge 2 commits into from

Conversation

zentol
Copy link
Contributor

@zentol zentol commented Mar 7, 2018

What is the purpose of the change

This PR presents a test workaround for race-conditions in FLIP-6 (most notably FLINK-8887). Basically, every MiniClusterClient call is retried once after 500ms in case of certain exceptions.

This is only a band-aid until a proper fix is in place so we can finally continue merging more test ports.

Brief change log

  • add guardWithSingleRetry convenience method
  • add ScheduledExecutor to MiniClusterClient
  • guard all calls to the MiniCluster

Verifying this change

The change can be verified by cherry-picking this branch and running the AbstractOperatorRestoreTestBase. Before this change there was always 1-2 tests failing, whereas now none should fail.

/cc @aljoscha @GJL

@aljoscha
Copy link
Contributor

aljoscha commented Mar 7, 2018

+1 I think this makes sense if it unblocks merging test ports to master. I'm guessing we'll leave FLINK-8887 open until we have the proper fix?

zentol added a commit to zentol/flink that referenced this pull request Mar 8, 2018
@zentol
Copy link
Contributor Author

zentol commented Mar 8, 2018

merging.

asfgit pushed a commit that referenced this pull request Mar 9, 2018
@asfgit asfgit closed this in 94e959f Mar 9, 2018
@zentol zentol deleted the 8887_bandaid branch March 9, 2018 09:17
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
3 participants