Skip to content

Actions: EleutherAI/lm-evaluation-harness

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
6,202 workflow runs
6,202 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

squad v2: load metric with evaluate
Unit Tests #3392: Pull request #2351 opened by baberabb
September 25, 2024 12:36 6m 5s dsev
September 25, 2024 12:36 6m 5s
fix writeout script
Tasks Modified #3419: Pull request #2350 opened by baberabb
September 25, 2024 12:25 16s writeout
September 25, 2024 12:25 16s
fix writeout script
Unit Tests #3391: Pull request #2350 opened by baberabb
September 25, 2024 12:25 5m 59s writeout
September 25, 2024 12:25 5m 59s
Support pipeline parallel with OpenVINO models
Tasks Modified #3418: Pull request #2349 opened by sstrehlk
September 25, 2024 11:30 Action required sstrehlk:sstrehlk-ov-parallelizm
September 25, 2024 11:30 Action required
Support pipeline parallel with OpenVINO models
Unit Tests #3390: Pull request #2349 opened by sstrehlk
September 25, 2024 11:30 Action required sstrehlk:sstrehlk-ov-parallelizm
September 25, 2024 11:30 Action required
Fix float limit override
Unit Tests #3389: Pull request #2325 synchronize by cjluo-omniml
September 24, 2024 23:15 5m 34s cjluo-omniml:patch-1
September 24, 2024 23:15 5m 34s
Fix float limit override
Tasks Modified #3417: Pull request #2325 synchronize by cjluo-omniml
September 24, 2024 23:15 14s cjluo-omniml:patch-1
September 24, 2024 23:15 14s
Merge New Tasks
Unit Tests #3388: Pull request #2341 opened by ToluClassics
September 24, 2024 15:16 7m 33s ToluClassics:main
September 24, 2024 15:16 7m 33s
Merge New Tasks
Tasks Modified #3416: Pull request #2341 opened by ToluClassics
September 24, 2024 15:16 4m 22s ToluClassics:main
September 24, 2024 15:16 4m 22s
add a note for missing dependencies (#2336)
Tasks Modified #3415: Commit bc50a9a pushed by baberabb
September 24, 2024 14:13 4m 4s main
September 24, 2024 14:13 4m 4s
add a note for missing dependencies (#2336)
Unit Tests #3387: Commit bc50a9a pushed by baberabb
September 24, 2024 14:13 6m 55s main
September 24, 2024 14:13 6m 55s
Mathvista
Tasks Modified #3414: Pull request #2321 synchronize by baberabb
September 24, 2024 13:29 1m 38s mathvista
September 24, 2024 13:29 1m 38s
Mathvista
Unit Tests #3386: Pull request #2321 synchronize by baberabb
September 24, 2024 13:29 5m 45s mathvista
September 24, 2024 13:29 5m 45s
Mathvista
Tasks Modified #3413: Pull request #2321 synchronize by baberabb
September 24, 2024 13:15 1m 43s mathvista
September 24, 2024 13:15 1m 43s
Mathvista
Unit Tests #3385: Pull request #2321 synchronize by baberabb
September 24, 2024 13:15 6m 5s mathvista
September 24, 2024 13:15 6m 5s
Added metric aggregation for leaderboard tasks.
Unit Tests #3384: Pull request #2340 opened by Am1n3e
September 24, 2024 12:34 5m 43s Am1n3e:add-leaderboard-aggregation
September 24, 2024 12:34 5m 43s
Added metric aggregation for leaderboard tasks.
Tasks Modified #3412: Pull request #2340 opened by Am1n3e
September 24, 2024 12:34 1m 47s Am1n3e:add-leaderboard-aggregation
September 24, 2024 12:34 1m 47s
Fixed dummy model (#2339)
Unit Tests #3383: Commit d7734d1 pushed by baberabb
September 24, 2024 12:08 5m 23s main
September 24, 2024 12:08 5m 23s
Fixed dummy model (#2339)
Tasks Modified #3411: Commit d7734d1 pushed by baberabb
September 24, 2024 12:08 13s main
September 24, 2024 12:08 13s
Fixed dummy model
Tasks Modified #3410: Pull request #2339 opened by Am1n3e
September 24, 2024 11:58 16s Am1n3e:fix-dummy-model
September 24, 2024 11:58 16s
Fixed dummy model
Unit Tests #3382: Pull request #2339 opened by Am1n3e
September 24, 2024 11:58 5m 23s Am1n3e:fix-dummy-model
September 24, 2024 11:58 5m 23s
Add a note for missing dependencies
Tasks Modified #3407: Pull request #2336 opened by eldarkurtic
September 24, 2024 05:14 4m 1s eldarkurtic:fix-leaderboard-docs
September 24, 2024 05:14 4m 1s
Add a note for missing dependencies
Unit Tests #3379: Pull request #2336 opened by eldarkurtic
September 24, 2024 05:14 6m 13s eldarkurtic:fix-leaderboard-docs
September 24, 2024 05:14 6m 13s
mmlu-pro: add newlines to task descriptions (not leaderboard)
Unit Tests #3378: Pull request #2334 synchronize by baberabb
September 23, 2024 19:46 5m 48s mmlupro_
September 23, 2024 19:46 5m 48s
mmlu-pro: add newlines to task descriptions (not leaderboard)
Tasks Modified #3406: Pull request #2334 synchronize by baberabb
September 23, 2024 19:46 5m 20s mmlupro_
September 23, 2024 19:46 5m 20s