Skip to content

Pull requests: EleutherAI/lm-evaluation-harness

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

fix cache
#2037 by baberabb was merged Jun 28, 2024 Loading…
Fix strip whitespace filter
#2036 by NathanHB was closed Jun 28, 2024 Loading…
Add chat template to vllm
#2034 by baberabb was merged Jun 28, 2024 Loading…
Fix trust_remote_code-related test failures
#2024 by haileyschoelkopf was merged Jun 26, 2024 Loading…
adds leaderboard tasks
#2023 by NathanHB was closed Jun 26, 2024 Loading…
Add MMLU-ru based on MERA
#2019 by SpirinEgor was closed Jun 25, 2024 Loading…
Hotfix breaking import
#2015 by StellaAthena was merged Jun 24, 2024 Loading…
Remove LM dependency from build_all_requests
#2011 by baberabb was merged Jun 25, 2024 Loading…
Remove LM dependency from build_all_requests
#2009 by baberabb was closed Jun 22, 2024 Loading…
Fixes scrolls task bug with few_shot examples
#2003 by xksteven was merged Jun 28, 2024 Loading…
Fix Datasets --trust_remote_code
#1998 by haileyschoelkopf was merged Jun 19, 2024 Loading…
Log fewshot_as_multiturn in results files
#1995 by haileyschoelkopf was merged Jun 19, 2024 Loading…
Fix Paloma Template yaml
#1993 by haileyschoelkopf was merged Jun 19, 2024 Loading…
added yaml and util file
#1991 by satyamshukl was closed Jun 25, 2024 Loading…
Fix self assignment in neuron_optimum.py
#1990 by LSinev was merged Jun 18, 2024 Loading…
Added ArabicMMLU
#1987 by Yazeed7 was merged Jun 19, 2024 Loading…
Added ArabicMMLU
#1986 by Yazeed7 was closed Jun 18, 2024 Loading…
add trust_remote_code for piqa
#1983 by changwangss was merged Jun 18, 2024 Loading…
Update interface.md
#1982 by johnwee1 was merged Jun 25, 2024 Loading…
Add Task: CBT
#1981 by ookkeeeee was closed Jun 25, 2024 Loading…
added bias and stereotype classification tasks
#1974 by aditya20t was closed Jun 17, 2024 Loading…
Add GigaChat API
#1973 by seldereyy was closed Jun 19, 2024 Draft
ProTip! Exclude everything labeled bug with -label:bug.