Skip to content

Issues: EleutherAI/lm-evaluation-harness

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

Add Logits to OpenAI ChatCompletions model declined A proposed dataset or feature request that will not be implemented. feature request A feature that isn't implemented yet. help wanted Contributors and extra help welcome.
#1196 by haileyschoelkopf was closed May 23, 2024
Support wrapping prompts with a given Chat Template feature request A feature that isn't implemented yet. help wanted Contributors and extra help welcome. opinions wanted For discussing open questions.
#1098 by haileyschoelkopf was closed Jun 11, 2024 v0.4.3
pubmedqa task data fails to download
#312 by stas00 was closed May 11, 2022
Implement the Natural Questions evaluation feature request A feature that isn't implemented yet. good first issue Good for newcomers
#9 by StellaAthena was closed Aug 21, 2023
1 of 2 tasks
Support for ggml good first issue Good for newcomers help wanted Contributors and extra help welcome.
#417 by philwee was closed Nov 3, 2023
Add --predict_only mode (run without scoring outputs) feature request A feature that isn't implemented yet. help wanted Contributors and extra help welcome.
#1152 by haileyschoelkopf was closed Jan 31, 2024
Local dataset or model path support bug Something isn't working.
#1224 by ycsong1212 was closed Jan 2, 2024
Dummy perplexity on LAMBADA good first issue Good for newcomers help wanted Contributors and extra help welcome.
#350 by lostmsu was closed Nov 8, 2023
Bad results for LLaMA bug Something isn't working. good first issue Good for newcomers help wanted Contributors and extra help welcome.
#443 by juletx was closed Aug 8, 2023
GGUF Local Model bug Something isn't working.
#1254 by kolbeuk was closed Jan 8, 2024
Security features from the Hugging Face datasets library feature request A feature that isn't implemented yet. good first issue Good for newcomers help wanted Contributors and extra help welcome.
#1135 by lhoestq was closed Mar 3, 2024
Inverse Scaling Tasks? feature request A feature that isn't implemented yet. good first issue Good for newcomers help wanted Contributors and extra help welcome.
#1442 by RylanSchaeffer was closed Jul 3, 2024
Implement GPT-3 style contamination study feature request A feature that isn't implemented yet.
#231 by StellaAthena was closed Nov 1, 2023
RecursionError: maximum recursion depth exceeded bug Something isn't working.
#442 by philwee was closed Nov 8, 2023
Winogrande Performance Discrepency bug Something isn't working.
#1249 by lintangsutawika was closed Jan 8, 2024
ProTip! Type g i on any issue or pull request to go back to the issue listing page.