Skip to content

Issues: EleutherAI/lm-evaluation-harness

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

Double check all of the zero/few-shot formats documentation Improvements or additions to documentation.
#34 by leogao2 was closed Jan 4, 2021 updated Jan 4, 2021
Implement all GLUE evaluations
#92 by leogao2 was closed Jan 28, 2021 updated Jan 28, 2021
Implement the PubMedQA Evaluation
#125 by leogao2 was closed Feb 6, 2021 updated Feb 6, 2021
Implement the SciQ evaluation
#118 by leogao2 was closed Feb 6, 2021 updated Feb 6, 2021
Implement the QA4MRE evaluation
#113 by leogao2 was closed Feb 12, 2021 updated Feb 12, 2021
Implement GLUE STSB feature request A feature that isn't implemented yet.
#93 by leogao2 was closed Feb 8, 2021 updated Feb 13, 2021
Implement the HeadQA evaluation
#127 by leogao2 was closed Feb 13, 2021 updated Feb 13, 2021
Implement the MathQA evaluation
#132 by leogao2 was closed Feb 13, 2021 updated Feb 13, 2021
Implement the ETHICS evaluation
#121 by leogao2 was closed Mar 7, 2021 updated Mar 7, 2021
Implement LogiQA
#160 by leogao2 was closed Mar 10, 2021 updated Mar 10, 2021
MATH
#164 by leogao2 was closed Mar 25, 2021 updated Mar 25, 2021
Implement the HendrycksTest evaluation
#120 by leogao2 was closed Mar 26, 2021 updated Mar 26, 2021
Implement the Childrens Book Test evaluation
#134 by leogao2 was closed Apr 14, 2021 updated Apr 14, 2021
add versioning
#186 by leogao2 was closed Jun 5, 2021 updated Jun 5, 2021
versioning test
#189 by leogao2 was closed Jun 12, 2021 updated Jun 12, 2021
Write unit tests for GPT3 LM
#149 by leogao2 was closed Jun 12, 2021 updated Jun 12, 2021
Implement the ASDiv Evaluation feature request A feature that isn't implemented yet.
#190 by leogao2 was closed Jan 4, 2022 updated Jan 4, 2022
fix descriptions thing
#220 by leogao2 was closed Feb 13, 2022 updated Feb 13, 2022
Implement the QASPER evaluation feature request A feature that isn't implemented yet.
#184 by leogao2 was closed Feb 22, 2022 updated Feb 22, 2022
convert all downloads to use best_download
#173 by leogao2 was closed Apr 1, 2022 updated Apr 1, 2022
MuTual scores not matching with values in paper
#205 by leogao2 was closed Nov 21, 2022 updated Nov 21, 2022
Implement GPT3 greedy_until feature request A feature that isn't implemented yet.
#96 by leogao2 was closed Nov 21, 2022 updated Nov 21, 2022
Implement GPT2 greedy_until feature request A feature that isn't implemented yet.
#95 by leogao2 was closed Nov 21, 2022 updated Nov 21, 2022
Make all of the natural language prompts
#131 by leogao2 was closed Nov 21, 2022 updated Nov 21, 2022
Implement GPT3 logprobs for new framework feature request A feature that isn't implemented yet.
#94 by leogao2 was closed Nov 21, 2022 updated Nov 21, 2022
ProTip! Type g i on any issue or pull request to go back to the issue listing page.