-
Notifications
You must be signed in to change notification settings - Fork 1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Run eval harness during training #367
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It looks like it's locking on piqa. We need to check out the HF dataset builder internals and make sure it's thread safe.
Notes on this PR's behavior:
|
mathqa + piqa has been running for over 8 hours without any problems. I’m wondering if the core problem is about mixing HF and non-HF tasks |
Hey @StellaAthena can you test out the above change, and let me know if it makes a difference? |
It's on today's TODO list :) |
Just some peace of mind for @StellaAthena that this is definitely working :) |
addresses #366
The generation tasks aren't that fast - so best to stick to the log likelihood tasks (you can add them to the yaml with the "eval_tasks" parameter.
e.g