Skip to content

Pull requests: EleutherAI/lm-evaluation-harness

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Add KoCommonGEN v2 benchmark
#2208 opened Aug 12, 2024 by metterian Loading…
Logging
#2203 opened Aug 9, 2024 by lintangsutawika Loading…
Add new benchmark: Basque bench
#2153 opened Jul 30, 2024 by zxcvuser Loading…
Chat template fix
#2058 opened Jul 2, 2024 by NathanHB Loading…
swahili_ARC_Challenge
#2031 opened Jun 27, 2024 by msamwelmollel Loading…
Add Redlite tasks for safety benchmarking
#2020 opened Jun 25, 2024 by inno-simon Loading…
Addition of BedrockChatModel
#1708 opened Apr 16, 2024 by jacquelinegarrahan Loading…
Klokan-qa task
#1657 opened Apr 1, 2024 by hynky1999 Loading…
Physics GRE task added
#1655 opened Apr 1, 2024 by ShayekhBinIslam Loading…
Adding new task: Boxes
#1557 opened Mar 11, 2024 by irafayabdul Loading…
add all vlsp
#1123 opened Dec 14, 2023 by qnguyen3 Draft
ProTip! Type g i on any issue or pull request to go back to the issue listing page.