Skip to content
This repository has been archived by the owner on Jan 15, 2024. It is now read-only.

Checkpoint management #305

Closed
eric-haibin-lin opened this issue Aug 19, 2018 · 1 comment
Closed

Checkpoint management #305

eric-haibin-lin opened this issue Aug 19, 2018 · 1 comment
Labels
enhancement New feature or request

Comments

@eric-haibin-lin
Copy link
Member

The large word language model scripts produces > 100GB checkpoints. We want to add some logic to automatically keep the past K checkpoints and remove the previous ones so that it doesn't exceed disk capacity.

@szha szha added the enhancement New feature or request label Aug 21, 2018
@eric-haibin-lin
Copy link
Member Author

We can use the checkpoint handler in apache/mxnet#14629 for new scripts

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants