[ASoC 2022] Enable data caching cross jobs to boost job performance with high memory efficiency #252

yhalpha · 2022-05-30T04:33:25Z

What would you like to be added:

Refactor the caching API to support inter-job caching, which means the lifecycle of datasets should be independent of training jobs.
Implement a caching policy that interacts with the distributed cache runtime to retain popular datasets in memory, such that the cache efficiency is maximized.

Why is this needed:
Caching datasets in memory of the local cluster helps to accelerate the training jobs. Typically, popular and public datasets might be used by multiple jobs. Therefore, it helps improve the caching efficiency to make datasets sharable across training jobs with a well-designed caching policy.

yhalpha assigned jian-he May 30, 2022

SimonCqk added enhancement New feature or request asoc2022 Alibaba Summer of Code, 2022 community Community discussions labels May 30, 2022

SimonCqk mentioned this issue May 30, 2022

🧑‍💻 🏕 Alibaba Summer of Code (ASOC) 2022 #249

Open

zjchenn mentioned this issue Jul 29, 2022

feat: enable data caching cross jobs #263

Merged

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ASoC 2022] Enable data caching cross jobs to boost job performance with high memory efficiency #252

[ASoC 2022] Enable data caching cross jobs to boost job performance with high memory efficiency #252

yhalpha commented May 30, 2022

[ASoC 2022] Enable data caching cross jobs to boost job performance with high memory efficiency #252

[ASoC 2022] Enable data caching cross jobs to boost job performance with high memory efficiency #252

Comments

yhalpha commented May 30, 2022