Skip to content

Issues: EleutherAI/lm-evaluation-harness

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Label
Filter by label
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Milestones
Filter by milestone
Assignee
Filter by who’s assigned
Sort

Issues list

Implement the English Grammar Correction evaluation feature request A feature that isn't implemented yet. good first issue Good for newcomers
#28 by StellaAthena was closed Nov 21, 2022
2 tasks
Implement the News Article Generation evaluation feature request A feature that isn't implemented yet.
#29 by StellaAthena was closed Nov 21, 2022
2 tasks
Implement the Novel Word evaluation feature request A feature that isn't implemented yet. good first issue Good for newcomers
#30 by StellaAthena was closed Nov 21, 2022
2 tasks
Support richer example-packing functionality. feature request A feature that isn't implemented yet.
#31 by zphang was closed Jan 4, 2021
Support writing out predictions feature request A feature that isn't implemented yet.
#32 by zphang was closed Jan 4, 2021
Double check all of the zero/few-shot formats documentation Improvements or additions to documentation.
#34 by leogao2 was closed Jan 4, 2021
RACE: nlp -> datasets bug Something isn't working.
#44 by cfoster0 was closed Oct 22, 2020
Implement Semantic Search evaluation feature request A feature that isn't implemented yet.
#58 by cauefcr was closed Nov 21, 2022
Make the eval_harness talk to the server feature request A feature that isn't implemented yet.
#62 by StellaAthena was closed Jan 4, 2021
Implement all GLUE evaluations
#92 by leogao2 was closed Jan 28, 2021
Implement the QA4MRE evaluation
#113 by leogao2 was closed Feb 12, 2021
Implement the BioASQ evaluation
#114 by leogao2 was closed Nov 21, 2022
Implement the BioREAD evaluation
#116 by leogao2 was closed Nov 21, 2022
Implement the BMKC evaluation
#117 by leogao2 was closed Nov 21, 2022
Implement the SciQ evaluation
#118 by leogao2 was closed Feb 6, 2021
Implement the HendrycksTest evaluation
#120 by leogao2 was closed Mar 26, 2021
Implement the GenericsKB evaluation feature request A feature that isn't implemented yet.
#122 by leogao2 was closed Nov 21, 2022
Implement the PubMedQA Evaluation
#125 by leogao2 was closed Feb 6, 2021
Implement the HeadQA evaluation
#127 by leogao2 was closed Feb 13, 2021
Write unit tests for janitor
#148 by leogao2 was closed Nov 21, 2022
Write unit tests for GPT3 LM
#149 by leogao2 was closed Jun 12, 2021
Implement LogiQA
#160 by leogao2 was closed Mar 10, 2021
MATH
#164 by leogao2 was closed Mar 25, 2021
ProTip! What’s not been updated in a month: updated:<2024-05-20.