-
Notifications
You must be signed in to change notification settings - Fork 1.6k
Issues: EleutherAI/lm-evaluation-harness
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Add Logits to OpenAI ChatCompletions model
declined
A proposed dataset or feature request that will not be implemented.
feature request
A feature that isn't implemented yet.
help wanted
Contributors and extra help welcome.
#1196
by haileyschoelkopf
was closed May 23, 2024
Support wrapping prompts with a given Chat Template
feature request
A feature that isn't implemented yet.
help wanted
Contributors and extra help welcome.
opinions wanted
For discussing open questions.
Implement the Natural Questions evaluation
feature request
A feature that isn't implemented yet.
good first issue
Good for newcomers
#9
by StellaAthena
was closed Aug 21, 2023
1 of 2 tasks
Support for ggml
good first issue
Good for newcomers
help wanted
Contributors and extra help welcome.
#417
by philwee
was closed Nov 3, 2023
FileNotFoundError: Couldn't find a module script at exact_match.py. Module 'exact_match' doesn't exist on the Hugging Face Hub either.
bug
Something isn't working.
#1071
by xinghuang2050
was closed Jul 1, 2024
Add A feature that isn't implemented yet.
help wanted
Contributors and extra help welcome.
--predict_only
mode (run without scoring outputs)
feature request
#1152
by haileyschoelkopf
was closed Jan 31, 2024
Local dataset or model path support
bug
Something isn't working.
#1224
by ycsong1212
was closed Jan 2, 2024
I get this error whenever I try to run an eval: ImportError: cannot import name 'HfApi' from 'huggingface_hub'
#1826
by menhguin
was closed May 26, 2024
Revert PR 497 for MMLU/hendrycksTest to be compatible with Open LLM Leaderboard
#614
by taoari
was closed Nov 8, 2023
Dummy perplexity on LAMBADA
good first issue
Good for newcomers
help wanted
Contributors and extra help welcome.
#350
by lostmsu
was closed Nov 8, 2023
KeyError: 'Cache only has 0 layers, attempted to access layer with index 0'
bug
Something isn't working.
#1250
by kirayomato
was closed Jan 31, 2024
Bad results for LLaMA
bug
Something isn't working.
good first issue
Good for newcomers
help wanted
Contributors and extra help welcome.
#443
by juletx
was closed Aug 8, 2023
Security features from the Hugging Face datasets library
feature request
A feature that isn't implemented yet.
good first issue
Good for newcomers
help wanted
Contributors and extra help welcome.
#1135
by lhoestq
was closed Mar 3, 2024
Inverse Scaling Tasks?
feature request
A feature that isn't implemented yet.
good first issue
Good for newcomers
help wanted
Contributors and extra help welcome.
#1442
by RylanSchaeffer
was closed Jul 3, 2024
Implement GPT-3 style contamination study
feature request
A feature that isn't implemented yet.
#231
by StellaAthena
was closed Nov 1, 2023
RecursionError: maximum recursion depth exceeded
bug
Something isn't working.
#442
by philwee
was closed Nov 8, 2023
Winogrande Performance Discrepency
bug
Something isn't working.
#1249
by lintangsutawika
was closed Jan 8, 2024
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.