Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[deflakey] Fix the Redis failure detection and deflakey the test #37259

Merged
merged 4 commits into from
Jul 11, 2023

Conversation

fishbone
Copy link
Contributor

@fishbone fishbone commented Jul 10, 2023

Signed-off-by: Yi Cheng [email protected]

Why are these changes needed?

The test is flakey due to several reasons:

  • The redis cluster is not ready
  • The redis recover is not done
  • GCS exit used more time

This PR fixed these issues. In the same time also fix a corner cases where the callback is none.

Related issue number

Closes #36678

Checks

  • I've signed off every commit(by using the -s flag, i.e., git commit -s) in this PR.
  • I've run scripts/format.sh to lint the changes in this PR.
  • I've included any doc changes needed for https://docs.ray.io/en/master/.
    • I've added any new APIs to the API Reference. For example, if I added a
      method in Tune, I've added it in doc/source/tune/api/ under the
      corresponding .rst file.
  • I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
  • Testing Strategy
    • Unit tests
    • Release tests
    • This PR is not tested :(

Signed-off-by: Yi Cheng <[email protected]>
Signed-off-by: Yi Cheng <[email protected]>
Signed-off-by: Yi Cheng <[email protected]>
Signed-off-by: Yi Cheng <[email protected]>
@fishbone fishbone marked this pull request as ready for review July 10, 2023 20:12
@fishbone fishbone requested a review from a team as a code owner July 10, 2023 20:12
@fishbone fishbone merged commit 4849c55 into ray-project:master Jul 11, 2023
114 of 121 checks passed
Bhav00 pushed a commit to Bhav00/ray that referenced this pull request Jul 11, 2023
…-project#37259)

## Why are these changes needed?

The test is flakey due to several reasons:

- The redis cluster is not ready
- The redis recover is not done
- GCS exit used more time

This PR fixed these issues. In the same time also fix a corner cases where the callback is none.

## Related issue number
Closes ray-project#36678

Signed-off-by: Bhavpreet Singh <[email protected]>
SongGuyang pushed a commit to alipay/ant-ray that referenced this pull request Jul 12, 2023
…-project#37259)

## Why are these changes needed?

The test is flakey due to several reasons:

- The redis cluster is not ready
- The redis recover is not done
- GCS exit used more time

This PR fixed these issues. In the same time also fix a corner cases where the callback is none.

## Related issue number
Closes ray-project#36678

Signed-off-by: 久龙 <[email protected]>
Bhav00 pushed a commit to Bhav00/ray that referenced this pull request Jul 24, 2023
…-project#37259)

## Why are these changes needed?

The test is flakey due to several reasons:

- The redis cluster is not ready
- The redis recover is not done
- GCS exit used more time

This PR fixed these issues. In the same time also fix a corner cases where the callback is none.

## Related issue number
Closes ray-project#36678

Signed-off-by: Bhavpreet Singh <[email protected]>
NripeshN pushed a commit to NripeshN/ray that referenced this pull request Aug 15, 2023
…-project#37259)

## Why are these changes needed?

The test is flakey due to several reasons:

- The redis cluster is not ready
- The redis recover is not done
- GCS exit used more time

This PR fixed these issues. In the same time also fix a corner cases where the callback is none.

## Related issue number
Closes ray-project#36678

Signed-off-by: NripeshN <[email protected]>
harborn pushed a commit to harborn/ray that referenced this pull request Aug 17, 2023
…-project#37259)

## Why are these changes needed?

The test is flakey due to several reasons:

- The redis cluster is not ready
- The redis recover is not done
- GCS exit used more time

This PR fixed these issues. In the same time also fix a corner cases where the callback is none.

## Related issue number
Closes ray-project#36678

Signed-off-by: harborn <[email protected]>
harborn pushed a commit to harborn/ray that referenced this pull request Aug 17, 2023
…-project#37259)

## Why are these changes needed?

The test is flakey due to several reasons:

- The redis cluster is not ready
- The redis recover is not done
- GCS exit used more time

This PR fixed these issues. In the same time also fix a corner cases where the callback is none.

## Related issue number
Closes ray-project#36678
arvind-chandra pushed a commit to lmco/ray that referenced this pull request Aug 31, 2023
…-project#37259)

## Why are these changes needed?

The test is flakey due to several reasons:

- The redis cluster is not ready
- The redis recover is not done
- GCS exit used more time

This PR fixed these issues. In the same time also fix a corner cases where the callback is none.

## Related issue number
Closes ray-project#36678

Signed-off-by: e428265 <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[CI] linux:https://python/ray/tests:test_gcs_fault_tolerance is failing/flaky on master.
4 participants