-
Notifications
You must be signed in to change notification settings - Fork 1.5k
Issues: EleutherAI/lm-evaluation-harness
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Cannot have both a group list and task list
asking questions
For asking for clarification / support on library usage.
bug
Something isn't working.
#1767
opened Apr 29, 2024 by
steven-basart
"Please select a token to use as
pad_token
" error for alpaca-lora-7b
model
#434
opened Apr 24, 2023 by
oshev
toxigen task measures toxicity classification rather than whether generations are toxic?
#974
opened Nov 8, 2023 by
laphang
How to compute the perplexity only on the answer?
asking questions
For asking for clarification / support on library usage.
#1370
opened Jan 30, 2024 by
Luobots
wikitext weird results Mistral-7B-v0.1 length=4096 // Gemma-7B bos missing
bug
Something isn't working.
#1471
opened Feb 26, 2024 by
vince62s
Quac Dataset
feature request
A feature that isn't implemented yet.
good first issue
Good for newcomers
#827
opened Sep 4, 2023 by
RanchiZhao
assert len(continuation_enc) error in _loglikelihood_tokens for certain (but not all) tasks?
#1053
opened Dec 2, 2023 by
lhl
Process hangs when using Something isn't working.
tensor_parallel_size
and data_parallel_size
together
bug
#1734
opened Apr 22, 2024 by
harshakokel
Implement the SuperGLUE evaluation
feature request
A feature that isn't implemented yet.
#22
opened Sep 16, 2020 by
StellaAthena
1 of 2 tasks
Acc vs acc_norm
asking questions
For asking for clarification / support on library usage.
#1396
opened Feb 5, 2024 by
sqrkl
TGI support - API evaluation of HF models
feature request
A feature that isn't implemented yet.
help wanted
Contributors and extra help welcome.
#869
opened Sep 19, 2023 by
ManuelFay
NAN value for truthfulqa_mc2 on full finetuned model TinyLlama
#1340
opened Jan 23, 2024 by
hahmad2008
Clarification on API Endpoint: /v1/completions vs /v1/chat/completions
#1637
opened Mar 26, 2024 by
gerayking
The tokenizer add_special_tokens parameter for t5 model lambada task
#1017
opened Nov 22, 2023 by
daisyden
Inverse Scaling Tasks?
feature request
A feature that isn't implemented yet.
good first issue
Good for newcomers
help wanted
Contributors and extra help welcome.
#1442
opened Feb 18, 2024 by
RylanSchaeffer
Wandb logger can't handle groups with heterogenous metrics
#1958
opened Jun 12, 2024 by
dmitrii-palisaderesearch
When using Something isn't working.
parallelize=True
, raise Runtime Error: expected all tensors to be on the same device
bug
#1575
opened Mar 14, 2024 by
feiba54
ProTip!
Mix and match filters to narrow down what you’re looking for.