-
Notifications
You must be signed in to change notification settings - Fork 1.5k
Issues: EleutherAI/lm-evaluation-harness
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Integrate Semantic Answer Similarity (SAS) into the evaluation metrics.
#1703
opened Apr 15, 2024 by
gonzalo-santamaria-iic
ValueError: BuilderConfig 'pile_freelaw' not found., issue on running PILE eval
#1714
opened Apr 16, 2024 by
Harryalways317
Should num_fewshot be type list?
feature request
A feature that isn't implemented yet.
#837
opened Sep 6, 2023 by
Wehzie
Allow A feature that isn't implemented yet.
good first issue
Good for newcomers
help wanted
Contributors and extra help welcome.
--include_path
to import an externally-defined LM subclass
feature request
#1457
opened Feb 22, 2024 by
haileyschoelkopf
Sanity checking the semantic meaning of "perplexity" in code
asking questions
For asking for clarification / support on library usage.
#1581
opened Mar 15, 2024 by
RylanSchaeffer
Whitespace before label in MultipleChoiceTask causes wrong label probability prediction
#1556
opened Mar 11, 2024 by
RibinMTC
Add a way to instantiate from HF.AutoModel (again)
#1978
opened Jun 17, 2024 by
dmitrii-palisaderesearch
Quac Dataset
feature request
A feature that isn't implemented yet.
good first issue
Good for newcomers
#827
opened Sep 4, 2023 by
RanchiZhao
OpenaiCompletionsLM invokes the completions API with max_tokens set to 0
#1903
opened May 29, 2024 by
chimezie
[TruthfulQA] update rouge-score version or add a way to suppress tokenizer logging
#1692
opened Apr 9, 2024 by
skramer-dev
Allow Task objects to defer dataset download
feature request
A feature that isn't implemented yet.
good first issue
Good for newcomers
help wanted
Contributors and extra help welcome.
#1558
opened Mar 11, 2024 by
haileyschoelkopf
[New Task] CommonsenseQA
feature request
A feature that isn't implemented yet.
good first issue
Good for newcomers
help wanted
Contributors and extra help welcome.
#1026
opened Nov 27, 2023 by
haileyschoelkopf
Request for files to be placed in 'path/containing/training/set/ngrams'.
#1375
opened Jan 31, 2024 by
dsdanielpark
Add tasks for performance on long context lengths
feature request
A feature that isn't implemented yet.
#1748
opened Apr 25, 2024 by
nairbv
Previous Next
ProTip!
Find all open issues with in progress development work with linked:pr.