-
Notifications
You must be signed in to change notification settings - Fork 1.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Implement the StoryCloze evaluation #8
Labels
Projects
Comments
Form to get access is here: |
I have the download links and can provide them to anyone who DM’s me on Discord. |
StellaAthena
added
Eval Set
and removed
feature request
A feature that isn't implemented yet.
labels
Oct 23, 2020
Implementing Evaluations
automation
moved this from In progress
to Data integrated, Eval not done
Oct 24, 2020
StellaAthena
added
feature request
A feature that isn't implemented yet.
good first issue
Good for newcomers
labels
Jan 5, 2021
leogao2
moved this from To do, Evaluations to Implement
to Deferred
in Implementing Evaluations
Feb 8, 2021
leogao2
moved this from Deferred
to To do, Evaluations to Implement
in Implementing Evaluations
Feb 11, 2021
leogao2
moved this from To do, Evaluations to Implement
to Deferred
in Implementing Evaluations
Jun 12, 2021
Implemented in #300 |
qmdnls
pushed a commit
to qmdnls/lm-evaluation-harness
that referenced
this issue
Aug 17, 2023
Pytest update
LZY-the-boys
pushed a commit
to LZY-the-boys/lm-evaluation-harness-fast
that referenced
this issue
Sep 12, 2023
Pytest update
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
From the GPT-3 paper
The evaluation code should be modeled after the interface in
lm_eval/base.py
and the example of theBoolQ
task inlm_eval/tasks/suerglue.py
The text was updated successfully, but these errors were encountered: