Skip to content

Actions: huggingface/lighteval

Actions

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
1,648 workflow runs
1,648 workflow runs

Filter by Event

Loading

Filter by Status

Loading

Filter by Branch

Loading

Filter by Actor

Loading
Expose samples via the CLI
Quality #882: Pull request #228 synchronize by clefourrier
July 25, 2024 17:14 2m 17s clem_details
July 25, 2024 17:14 2m 17s
Expose samples via the CLI
Tests #882: Pull request #228 synchronize by clefourrier
July 25, 2024 17:14 38m 18s clem_details
July 25, 2024 17:14 38m 18s
adds llm as judge using transformers
Tests #879: Pull request #223 synchronize by clefourrier
July 25, 2024 13:09 39m 26s nathan-add-judge-transformers
July 25, 2024 13:09 39m 26s
adds llm as judge using transformers
Quality #879: Pull request #223 synchronize by clefourrier
July 25, 2024 13:09 9m 44s nathan-add-judge-transformers
July 25, 2024 13:09 9m 44s
adds llm as judge using transformers
Quality #878: Pull request #223 synchronize by NathanHB
July 25, 2024 12:52 2m 21s nathan-add-judge-transformers
July 25, 2024 12:52 2m 21s
adds llm as judge using transformers
Tests #878: Pull request #223 synchronize by NathanHB
July 25, 2024 12:52 43m 4s nathan-add-judge-transformers
July 25, 2024 12:52 43m 4s
udpated piqa (#222)
Tests #877: Commit 506abda pushed by clefourrier
July 25, 2024 09:43 39m 5s main
July 25, 2024 09:43 39m 5s
udpated piqa (#222)
Quality #877: Commit 506abda pushed by clefourrier
July 25, 2024 09:43 2m 18s main
July 25, 2024 09:43 2m 18s
Use inference endpoints as judge
Tests #876: Pull request #237 synchronize by clefourrier
July 25, 2024 08:39 37m 13s inference_endpoints
July 25, 2024 08:39 37m 13s
Use inference endpoints as judge
Quality #876: Pull request #237 synchronize by clefourrier
July 25, 2024 08:39 2m 16s inference_endpoints
July 25, 2024 08:39 2m 16s
Use inference endpoints as judge
Quality #875: Pull request #237 opened by clefourrier
July 25, 2024 08:23 2m 19s inference_endpoints
July 25, 2024 08:23 2m 19s
Use inference endpoints as judge
Tests #875: Pull request #237 opened by clefourrier
July 25, 2024 08:23 36m 14s inference_endpoints
July 25, 2024 08:23 36m 14s
Fixing PIQA implementation
Tests #874: Pull request #222 synchronize by clefourrier
July 25, 2024 07:26 37m 16s piqa_edits
July 25, 2024 07:26 37m 16s
Fixing PIQA implementation
Quality #874: Pull request #222 synchronize by clefourrier
July 25, 2024 07:26 2m 11s piqa_edits
July 25, 2024 07:26 2m 11s
fix (#233)
Tests #873: Commit db502dd pushed by NathanHB
July 24, 2024 18:23 37m 54s main
July 24, 2024 18:23 37m 54s
fix (#233)
Quality #873: Commit db502dd pushed by NathanHB
July 24, 2024 18:23 3m 2s main
July 24, 2024 18:23 3m 2s
Fixing PIQA implementation
Tests #872: Pull request #222 synchronize by NathanHB
July 24, 2024 18:23 38m 49s piqa_edits
July 24, 2024 18:23 38m 49s
Fixing PIQA implementation
Quality #872: Pull request #222 synchronize by NathanHB
July 24, 2024 18:23 2m 18s piqa_edits
July 24, 2024 18:23 2m 18s
Fixes #165 - adds a check with generation length is not set by the user
Tests #870: Pull request #233 synchronize by NathanHB
July 24, 2024 13:00 38m 1s #165
July 24, 2024 13:00 38m 1s