-
Notifications
You must be signed in to change notification settings - Fork 1.6k
Issues: EleutherAI/lm-evaluation-harness
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Support richer example-packing functionality.
feature request
A feature that isn't implemented yet.
#31
by zphang
was closed Jan 4, 2021
Support writing out predictions
feature request
A feature that isn't implemented yet.
#32
by zphang
was closed Jan 4, 2021
Implement the DROP evaluation
feature request
A feature that isn't implemented yet.
good first issue
Good for newcomers
#19
by StellaAthena
was closed Mar 7, 2021
1 of 2 tasks
Implement the Penn Tree Bank evaluation
feature request
A feature that isn't implemented yet.
good first issue
Good for newcomers
#5
by StellaAthena
was closed Mar 25, 2023
Implement the LAMBADA evaluation
feature request
A feature that isn't implemented yet.
#6
by StellaAthena
was closed Jan 29, 2021
Implement the OpenBookQA evaluation
feature request
A feature that isn't implemented yet.
good first issue
Good for newcomers
#16
by StellaAthena
was closed Feb 9, 2021
2 tasks done
Implement the Adversarial Natural Language Inference (ANLI) evaluation
feature request
A feature that isn't implemented yet.
#24
by StellaAthena
was closed Jan 30, 2021
1 of 2 tasks
Implement the WSC273 Winograd Schemas Challenge evaluation
feature request
A feature that isn't implemented yet.
good first issue
Good for newcomers
#12
by StellaAthena
was closed Feb 3, 2021
2 tasks done
Implement the SQuAD evaluation
feature request
A feature that isn't implemented yet.
good first issue
Good for newcomers
#20
by StellaAthena
was closed Mar 28, 2021
1 of 2 tasks
Implement the RACE evaluation
feature request
A feature that isn't implemented yet.
good first issue
Good for newcomers
#21
by StellaAthena
was closed Jan 30, 2021
2 tasks done
Implement the HellaSwag evaluation
feature request
A feature that isn't implemented yet.
good first issue
Good for newcomers
#7
by StellaAthena
was closed Feb 8, 2021
2 tasks done
Implement the StoryCloze evaluation
feature request
A feature that isn't implemented yet.
good first issue
Good for newcomers
#8
by StellaAthena
was closed Apr 1, 2022
1 of 2 tasks
Implement the symbolic manipulations evaluation
feature request
A feature that isn't implemented yet.
good first issue
Good for newcomers
#26
by StellaAthena
was closed Feb 26, 2021
2 tasks done
Implement the Natural Questions evaluation
feature request
A feature that isn't implemented yet.
good first issue
Good for newcomers
#9
by StellaAthena
was closed Aug 21, 2023
1 of 2 tasks
Implement the WebQuestions evaluation
feature request
A feature that isn't implemented yet.
good first issue
Good for newcomers
#10
by StellaAthena
was closed Feb 8, 2021
1 of 2 tasks
Implement the CoQA evaluation
feature request
A feature that isn't implemented yet.
good first issue
Good for newcomers
#17
by StellaAthena
was closed Feb 14, 2021
1 of 2 tasks
Implement the TriviaQA evaluation
feature request
A feature that isn't implemented yet.
good first issue
Good for newcomers
#11
by StellaAthena
was closed Jan 30, 2021
2 tasks done
Implement the ARC Challenge evaluation
feature request
A feature that isn't implemented yet.
good first issue
Good for newcomers
#15
by StellaAthena
was closed Feb 5, 2021
2 tasks done
Implement arithmetic evaluations
feature request
A feature that isn't implemented yet.
good first issue
Good for newcomers
#25
by StellaAthena
was closed Jan 28, 2021
2 tasks done
Implement the Novel Word evaluation
feature request
A feature that isn't implemented yet.
good first issue
Good for newcomers
#30
by StellaAthena
was closed Nov 21, 2022
2 tasks
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.