Skip to content

Issues: EleutherAI/lm-evaluation-harness

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

Implement GPT2 greedy_until feature request A feature that isn't implemented yet.
#95 by leogao2 was closed Nov 21, 2022
Implement the QuAC evaluation feature request A feature that isn't implemented yet. good first issue Good for newcomers
#18 by StellaAthena was closed Nov 14, 2023
1 of 2 tasks
Implement the Natural Language Inference (NLI) evaluation feature request A feature that isn't implemented yet. good first issue Good for newcomers
#23 by StellaAthena was closed Feb 12, 2021
1 of 2 tasks
Implement the English Grammar Correction evaluation feature request A feature that isn't implemented yet. good first issue Good for newcomers
#28 by StellaAthena was closed Nov 21, 2022
2 tasks
Implement the News Article Generation evaluation feature request A feature that isn't implemented yet.
#29 by StellaAthena was closed Nov 21, 2022
2 tasks
Implement the Novel Word evaluation feature request A feature that isn't implemented yet. good first issue Good for newcomers
#30 by StellaAthena was closed Nov 21, 2022
2 tasks
Support richer example-packing functionality. feature request A feature that isn't implemented yet.
#31 by zphang was closed Jan 4, 2021
Support writing out predictions feature request A feature that isn't implemented yet.
#32 by zphang was closed Jan 4, 2021
Double check all of the zero/few-shot formats documentation Improvements or additions to documentation.
#34 by leogao2 was closed Jan 4, 2021
Implement Semantic Search evaluation feature request A feature that isn't implemented yet.
#58 by cauefcr was closed Nov 21, 2022
Implement all GLUE evaluations
#92 by leogao2 was closed Jan 28, 2021
Implement GLUE STSB feature request A feature that isn't implemented yet.
#93 by leogao2 was closed Feb 8, 2021
Implement the StoryCloze evaluation feature request A feature that isn't implemented yet. good first issue Good for newcomers
#8 by StellaAthena was closed Apr 1, 2022
1 of 2 tasks
Implement GPT3 logprobs for new framework feature request A feature that isn't implemented yet.
#94 by leogao2 was closed Nov 21, 2022
Implement GPT3 greedy_until feature request A feature that isn't implemented yet.
#96 by leogao2 was closed Nov 21, 2022
Implement a shuffle option in main.py
#102 by leogao2 was closed Nov 21, 2022
Implement the BioASQ evaluation
#114 by leogao2 was closed Nov 21, 2022
Implement the BioREAD evaluation
#116 by leogao2 was closed Nov 21, 2022
Implement the BMKC evaluation
#117 by leogao2 was closed Nov 21, 2022
Implement the HendrycksTest evaluation
#120 by leogao2 was closed Mar 26, 2021
Make all of the natural language prompts
#131 by leogao2 was closed Nov 21, 2022
Implement the EpicQA Evaluation feature request A feature that isn't implemented yet.
#137 by nicholaskross was closed Nov 21, 2022
Write unit tests for GPT3 LM
#149 by leogao2 was closed Jun 12, 2021
MATH
#164 by leogao2 was closed Mar 25, 2021
vllm backend faild
#2028 by chunniunai220ml was closed Jun 27, 2024
ProTip! Follow long discussions with comments:>50.