Skip to content

Issues: EleutherAI/lm-evaluation-harness

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

[FEATURE REQUEST] Can we have HumanEval+ benchmark? feature request A feature that isn't implemented yet.
#1091 by hahuyhoang411 was closed Dec 18, 2023
Tasks on code evaluation
#1282 by yifan-bao was closed Jan 18, 2024
Different score when using accelerate bug Something isn't working.
#1293 by lintangsutawika was closed Jan 25, 2024
Prompt Templating
#896 by sachith-surge was closed Oct 10, 2023
[big refactor]mmlu is needed
#875 by xiaol was closed Oct 10, 2023
dataset path asking questions For asking for clarification / support on library usage.
#1659 by ghost was closed Apr 2, 2024
Implement the English Grammar Correction evaluation feature request A feature that isn't implemented yet. good first issue Good for newcomers
#28 by StellaAthena was closed Nov 21, 2022
2 tasks
Implement the News Article Generation evaluation feature request A feature that isn't implemented yet.
#29 by StellaAthena was closed Nov 21, 2022
2 tasks
Implement the QuAC evaluation feature request A feature that isn't implemented yet. good first issue Good for newcomers
#18 by StellaAthena was closed Nov 14, 2023
1 of 2 tasks
Implement the CoQA evaluation feature request A feature that isn't implemented yet. good first issue Good for newcomers
#17 by StellaAthena was closed Feb 14, 2021
1 of 2 tasks
Implement the OpenBookQA evaluation feature request A feature that isn't implemented yet. good first issue Good for newcomers
#16 by StellaAthena was closed Feb 9, 2021
2 tasks done
Implement the ARC Challenge evaluation feature request A feature that isn't implemented yet. good first issue Good for newcomers
#15 by StellaAthena was closed Feb 5, 2021
2 tasks done
Implement the WebQuestions evaluation feature request A feature that isn't implemented yet. good first issue Good for newcomers
#10 by StellaAthena was closed Feb 8, 2021
1 of 2 tasks
Implement the Natural Questions evaluation feature request A feature that isn't implemented yet. good first issue Good for newcomers
#9 by StellaAthena was closed Aug 21, 2023
1 of 2 tasks
ProTip! Type g i on any issue or pull request to go back to the issue listing page.