Skip to content

Pull requests: EleutherAI/lm-evaluation-harness

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

mlx Model (loglikelihood & generate_until)
#1902 opened May 29, 2024 by chimezie Loading…
Vllm get tokenizer
#1794 opened May 6, 2024 by AguirreNicolas Loading…
[API] Add octoai back-end
#936 opened Oct 19, 2023 by vvchernov Loading…
Update scorer for gsm8k task
#943 opened Oct 24, 2023 by vvchernov Draft
Added no-softmax entries to MODEL_REGISTRY
#1052 opened Dec 2, 2023 by denizyuret Loading…
Add Selfcheckgpt evaluation to tasks
#1080 opened Dec 7, 2023 by PingNie1 Loading…
add all vlsp
#1123 opened Dec 14, 2023 by qnguyen3 Draft
Standardize metrics
#1167 opened Dec 19, 2023 by lintangsutawika Draft
Add various social bias tasks
#1185 opened Dec 21, 2023 by oskarvanderwal Loading…
1 task
Add task table
#1219 opened Dec 28, 2023 by baberabb Draft
Add Cohere API as available language model
#395 opened Mar 10, 2023 by rdnfn Loading…
Add Group-Config
#1373 opened Jan 31, 2024 by lintangsutawika Draft
fix wandb logger module import in example
#2041 opened Jun 30, 2024 by ToluClassics Loading…
Add parallel processing for OpenAI completion models
#1460 opened Feb 22, 2024 by pbevan1 Loading…
Adding new task: Boxes
#1557 opened Mar 11, 2024 by irafayabdul Loading…
add context-based requests processing
#1571 opened Mar 13, 2024 by artemorloff Loading…
Physics GRE task added
#1655 opened Apr 1, 2024 by ShayekhBinIslam Loading…
Klokan-qa task
#1657 opened Apr 1, 2024 by hynky1999 Loading…
ProTip! Follow long discussions with comments:>50.