Skip to content

Pull requests: EleutherAI/lm-evaluation-harness

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

fix wandb logger module import in example
#2041 opened Jun 30, 2024 by ToluClassics Loading…
[Draft] Exploring multimodality
#2039 opened Jun 28, 2024 by haileyschoelkopf Loading…
add mmlusr tasks
#2032 opened Jun 28, 2024 by SkySuperCat Loading…
swahili_ARC_Challenge
#2031 opened Jun 27, 2024 by msamwelmollel Loading…
Use shell=False in subprocess Function Calls
#2030 opened Jun 27, 2024 by pixeeai Loading…
Update trust_remote_code for Hellaswag
#2029 opened Jun 27, 2024 by haileyschoelkopf Loading…
Add Redlite tasks for safety benchmarking
#2020 opened Jun 25, 2024 by inno-simon Loading…
[Not For Merge] Enable chat-template for vLLM
#2017 opened Jun 25, 2024 by akjindal53244 Loading…
Fix regexp parsing for bbh_cot_fewshot
#2013 opened Jun 24, 2024 by arkapal3 Loading…
Added MedConceptsQA Benchmark
#2010 opened Jun 22, 2024 by Ofir408 Loading…
Refactor API models
#2008 opened Jun 22, 2024 by baberabb Loading…
make pytorch an optional dependency
#2004 opened Jun 20, 2024 by dlwh Loading…
Handle Empty openai response
#1999 opened Jun 19, 2024 by ciaranby Loading…
Fix partial caching of openai models
#1997 opened Jun 19, 2024 by ciaranby Loading…
Add Gigachat model
#1996 opened Jun 19, 2024 by seldereyy Draft
Add HumanEval
#1992 opened Jun 19, 2024 by hjlee1371 Loading…
main
#1988 opened Jun 18, 2024 by msamwelmollel Loading…
Fix local completion huggingface tokenizer
#1975 opened Jun 17, 2024 by okdshin Loading…
mela
#1970 opened Jun 16, 2024 by Geralt-Targaryen Loading…
Fix OpenAI API discrepancies
#1969 opened Jun 14, 2024 by chimezie Loading…
Mmlu Pro
#1961 opened Jun 13, 2024 by ysjprojects Loading…
ProTip! no:milestone will show everything without a milestone.