Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[ASoC 2022] Enable data caching cross jobs to boost job performance with high memory efficiency #252

Open
yhalpha opened this issue May 30, 2022 · 0 comments
Assignees
Labels
asoc2022 Alibaba Summer of Code, 2022 community Community discussions enhancement New feature or request

Comments

@yhalpha
Copy link
Collaborator

yhalpha commented May 30, 2022

What would you like to be added:

  1. Refactor the caching API to support inter-job caching, which means the lifecycle of datasets should be independent of training jobs.
  2. Implement a caching policy that interacts with the distributed cache runtime to retain popular datasets in memory, such that the cache efficiency is maximized.

Why is this needed:
Caching datasets in memory of the local cluster helps to accelerate the training jobs. Typically, popular and public datasets might be used by multiple jobs. Therefore, it helps improve the caching efficiency to make datasets sharable across training jobs with a well-designed caching policy.

@SimonCqk SimonCqk added enhancement New feature or request asoc2022 Alibaba Summer of Code, 2022 community Community discussions labels May 30, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
asoc2022 Alibaba Summer of Code, 2022 community Community discussions enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

3 participants