-
Notifications
You must be signed in to change notification settings - Fork 1.6k
Issues: EleutherAI/lm-evaluation-harness
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Module 'exact_match' doesn't exist on the Hugging Face Hub either.
#1697
by BBaekdabang
was closed Jul 1, 2024
not getting same accuracy as on the leaderboard when evaluating locally.
#2091
by sorobedio
was closed Jul 13, 2024
Implement arithmetic evaluations
feature request
A feature that isn't implemented yet.
good first issue
Good for newcomers
#25
by StellaAthena
was closed Jan 28, 2021
2 tasks done
New Evaluation: Math
feature request
A feature that isn't implemented yet.
good first issue
Good for newcomers
#77
by StellaAthena
was closed Feb 25, 2022
2 tasks
Failure to evaluate facebook/opt-350m model
bug
Something isn't working.
good first issue
Good for newcomers
#368
by underactuated
was closed Mar 9, 2023
Validate TriviaQA
good first issue
Good for newcomers
validation
For validation of task implementations.
#456
by StellaAthena
was closed Jun 14, 2023
AutoGPTQ not working with HuggingFace accelerate (multi GPU)
bug
Something isn't working.
#1247
by JeevanBhoot
was closed Jan 15, 2024
dataset path
asking questions
For asking for clarification / support on library usage.
#1659
by ghost
was closed Apr 2, 2024
Out-Of-Memory Error for same batch size but different dataset
#1811
by richardzhuang0412
was closed May 9, 2024
New Evaluation: Biology
feature request
A feature that isn't implemented yet.
good first issue
Good for newcomers
#78
by StellaAthena
was closed Nov 21, 2022
2 tasks
Validate MMMLU
validation
For validation of task implementations.
#475
by ollmer
was closed Jun 23, 2023
How to deal with load_in_8bit error?
bug
Something isn't working.
#607
by ryusangwon
was closed Aug 8, 2023
ConnectionError: Couldn't reach 'truthful_qa' on the Hub (ProxyError)
#762
by SefaZeng
was closed Aug 11, 2023
[refactor] squadv2 task results quite different than main branch
#938
by emilyvanark
was closed Nov 21, 2023
Same result on GPTQ 8bit and 4bit model, normal?
bug
Something isn't working.
#973
by Chrisz236
was closed Nov 22, 2023
Generator Error when evaluating GLUE and superGLUE
bug
Something isn't working.
#1240
by shiweijiezero
was closed Jan 22, 2024
[New Task] Upstream remaining Okapi multilingual tasks
feature request
A feature that isn't implemented yet.
good first issue
Good for newcomers
help wanted
Contributors and extra help welcome.
#1244
by haileyschoelkopf
was closed Feb 21, 2024
ProTip!
no:milestone will show everything without a milestone.