Skip to content

Issues: EleutherAI/lm-evaluation-harness

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

Implement the DROP evaluation feature request A feature that isn't implemented yet. good first issue Good for newcomers
#19 by StellaAthena was closed Mar 7, 2021
1 of 2 tasks
Implement the Penn Tree Bank evaluation feature request A feature that isn't implemented yet. good first issue Good for newcomers
#5 by StellaAthena was closed Mar 25, 2023
Implement the LAMBADA evaluation feature request A feature that isn't implemented yet.
#6 by StellaAthena was closed Jan 29, 2021
Implement the OpenBookQA evaluation feature request A feature that isn't implemented yet. good first issue Good for newcomers
#16 by StellaAthena was closed Feb 9, 2021
2 tasks done
Implement the WSC273 Winograd Schemas Challenge evaluation feature request A feature that isn't implemented yet. good first issue Good for newcomers
#12 by StellaAthena was closed Feb 3, 2021
2 tasks done
Implement the SQuAD evaluation feature request A feature that isn't implemented yet. good first issue Good for newcomers
#20 by StellaAthena was closed Mar 28, 2021
1 of 2 tasks
Implement the RACE evaluation feature request A feature that isn't implemented yet. good first issue Good for newcomers
#21 by StellaAthena was closed Jan 30, 2021
2 tasks done
Implement the HellaSwag evaluation feature request A feature that isn't implemented yet. good first issue Good for newcomers
#7 by StellaAthena was closed Feb 8, 2021
2 tasks done
Implement the StoryCloze evaluation feature request A feature that isn't implemented yet. good first issue Good for newcomers
#8 by StellaAthena was closed Apr 1, 2022
1 of 2 tasks
Implement the Natural Questions evaluation feature request A feature that isn't implemented yet. good first issue Good for newcomers
#9 by StellaAthena was closed Aug 21, 2023
1 of 2 tasks
Implement the CoQA evaluation feature request A feature that isn't implemented yet. good first issue Good for newcomers
#17 by StellaAthena was closed Feb 14, 2021
1 of 2 tasks
Implement the TriviaQA evaluation feature request A feature that isn't implemented yet. good first issue Good for newcomers
#11 by StellaAthena was closed Jan 30, 2021
2 tasks done
Implement the adversarially-mined Winogrande evaluation feature request A feature that isn't implemented yet. good first issue Good for newcomers
#13 by StellaAthena was closed Feb 3, 2021
2 tasks done
Implement the SAT evaluation feature request A feature that isn't implemented yet. good first issue Good for newcomers
#27 by StellaAthena was closed Jan 8, 2021
2 tasks done
Implement the Natural Language Inference (NLI) evaluation feature request A feature that isn't implemented yet. good first issue Good for newcomers
#23 by StellaAthena was closed Feb 12, 2021
1 of 2 tasks
New Evaluation: Math feature request A feature that isn't implemented yet. good first issue Good for newcomers
#77 by StellaAthena was closed Feb 25, 2022
2 tasks
RACE: nlp -> datasets bug Something isn't working.
#44 by cfoster0 was closed Oct 22, 2020
Implement WIkitext for GPT-2 replication feature request A feature that isn't implemented yet. good first issue Good for newcomers
#40 by anishthite was closed Jun 12, 2021
1 of 2 tasks
Implement the QASPER evaluation feature request A feature that isn't implemented yet.
#184 by leogao2 was closed Feb 22, 2022
Implement the ASDiv Evaluation feature request A feature that isn't implemented yet.
#190 by leogao2 was closed Jan 4, 2022
Multilingual StoryCloze feature request A feature that isn't implemented yet. good first issue Good for newcomers
#241 by StellaAthena was closed May 11, 2023
ProTip! Updated in the last three days: updated:>2024-07-04.