-
Notifications
You must be signed in to change notification settings - Fork 1.5k
Issues: EleutherAI/lm-evaluation-harness
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Implement the DROP evaluation
feature request
A feature that isn't implemented yet.
good first issue
Good for newcomers
#19
by StellaAthena
was closed Mar 7, 2021
1 of 2 tasks
Implement the Penn Tree Bank evaluation
feature request
A feature that isn't implemented yet.
good first issue
Good for newcomers
#5
by StellaAthena
was closed Mar 25, 2023
Implement the LAMBADA evaluation
feature request
A feature that isn't implemented yet.
#6
by StellaAthena
was closed Jan 29, 2021
Implement the OpenBookQA evaluation
feature request
A feature that isn't implemented yet.
good first issue
Good for newcomers
#16
by StellaAthena
was closed Feb 9, 2021
2 tasks done
Implement the WSC273 Winograd Schemas Challenge evaluation
feature request
A feature that isn't implemented yet.
good first issue
Good for newcomers
#12
by StellaAthena
was closed Feb 3, 2021
2 tasks done
Implement the SQuAD evaluation
feature request
A feature that isn't implemented yet.
good first issue
Good for newcomers
#20
by StellaAthena
was closed Mar 28, 2021
1 of 2 tasks
Implement the RACE evaluation
feature request
A feature that isn't implemented yet.
good first issue
Good for newcomers
#21
by StellaAthena
was closed Jan 30, 2021
2 tasks done
Implement the HellaSwag evaluation
feature request
A feature that isn't implemented yet.
good first issue
Good for newcomers
#7
by StellaAthena
was closed Feb 8, 2021
2 tasks done
Implement the StoryCloze evaluation
feature request
A feature that isn't implemented yet.
good first issue
Good for newcomers
#8
by StellaAthena
was closed Apr 1, 2022
1 of 2 tasks
Implement the Natural Questions evaluation
feature request
A feature that isn't implemented yet.
good first issue
Good for newcomers
#9
by StellaAthena
was closed Aug 21, 2023
1 of 2 tasks
Implement the CoQA evaluation
feature request
A feature that isn't implemented yet.
good first issue
Good for newcomers
#17
by StellaAthena
was closed Feb 14, 2021
1 of 2 tasks
Implement the TriviaQA evaluation
feature request
A feature that isn't implemented yet.
good first issue
Good for newcomers
#11
by StellaAthena
was closed Jan 30, 2021
2 tasks done
Implement the adversarially-mined Winogrande evaluation
feature request
A feature that isn't implemented yet.
good first issue
Good for newcomers
#13
by StellaAthena
was closed Feb 3, 2021
2 tasks done
Implement the SAT evaluation
feature request
A feature that isn't implemented yet.
good first issue
Good for newcomers
#27
by StellaAthena
was closed Jan 8, 2021
2 tasks done
Implement the Natural Language Inference (NLI) evaluation
feature request
A feature that isn't implemented yet.
good first issue
Good for newcomers
#23
by StellaAthena
was closed Feb 12, 2021
1 of 2 tasks
New Evaluation: Math
feature request
A feature that isn't implemented yet.
good first issue
Good for newcomers
#77
by StellaAthena
was closed Feb 25, 2022
2 tasks
Implement WIkitext for GPT-2 replication
feature request
A feature that isn't implemented yet.
good first issue
Good for newcomers
#40
by anishthite
was closed Jun 12, 2021
1 of 2 tasks
Add flag to allow the evaluations to be carried out on a subset of the eval tasks
feature request
A feature that isn't implemented yet.
#60
by StellaAthena
was closed Nov 23, 2020
Implement the QASPER evaluation
feature request
A feature that isn't implemented yet.
#184
by leogao2
was closed Feb 22, 2022
Implement the ASDiv Evaluation
feature request
A feature that isn't implemented yet.
#190
by leogao2
was closed Jan 4, 2022
BUG: TypeError: create_from_arg_string() takes 2 positional arguments but 3 were given
#255
by mrseeker
was closed Feb 12, 2022
Multilingual StoryCloze
feature request
A feature that isn't implemented yet.
good first issue
Good for newcomers
#241
by StellaAthena
was closed May 11, 2023
Previous Next
ProTip!
Updated in the last three days: updated:>2024-07-04.