Skip to content

Issues: EleutherAI/lm-evaluation-harness

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

Implementing lessons from OLMES
#2002 opened Jun 20, 2024 by lintangsutawika
TypeError of scrolls_narrativeqa
#1891 opened May 27, 2024 by hicleo
Duplicate sample entries
#2025 opened Jun 26, 2024 by baberabb
Add more math evaluation tasks
#1869 opened May 22, 2024 by jordane95
Empty --log_samples outputs
#2115 opened Jul 19, 2024 by IsraelAbebe
Long time testing Qwen2-72B bug Something isn't working.
#1984 opened Jun 18, 2024 by djstrong
coqa not working bug Something isn't working.
#1529 opened Mar 5, 2024 by lchu-ibm
ProTip! Follow long discussions with comments:>50.