Skip to content

Actions: huggingface/lighteval

Actions

Quality

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
187 workflow run results
187 workflow run results

Filter by Event

Loading

Filter by Status

Loading

Filter by Branch

Loading

Filter by Actor

Loading
udpated piqa (#222)
Quality #877: Commit 506abda pushed by clefourrier
July 25, 2024 09:43 2m 18s main
July 25, 2024 09:43 2m 18s
fix (#233)
Quality #873: Commit db502dd pushed by NathanHB
July 24, 2024 18:23 3m 2s main
July 24, 2024 18:23 3m 2s
Removes default bert scorer init (#234)
Quality #867: Commit 2b4b637 pushed by NathanHB
July 24, 2024 12:43 2m 17s main
July 24, 2024 12:43 2m 17s
remove latex writer since we don't use it (#231)
Quality #859: Commit 86fbe64 pushed by clefourrier
July 23, 2024 12:33 2m 31s main
July 23, 2024 12:33 2m 31s
Update issue templates (#235)
Quality #854: Commit 003f05e pushed by NathanHB
July 23, 2024 10:19 2m 15s main
July 23, 2024 10:19 2m 15s
Fix a tiny bug in DROP metric (#229)
Quality #848: Commit 66ed7a2 pushed by clefourrier
July 18, 2024 10:36 2m 11s main
July 18, 2024 10:36 2m 11s
Quantization related issues (#224)
Quality #842: Commit 44f9a46 pushed by clefourrier
July 17, 2024 14:36 2m 17s main
July 17, 2024 14:36 2m 17s
Make evaluator invariant of input request type order (#215)
Quality #838: Commit 951cd5b pushed by clefourrier
July 17, 2024 13:48 2m 19s main
July 17, 2024 13:48 2m 19s
Fix _init_max_length in base_model.py (#185)
Quality #832: Commit d43c9a3 pushed by clefourrier
July 17, 2024 12:34 2m 20s main
July 17, 2024 12:34 2m 20s
launch lighteval using lighteval --args (#152)
Quality #826: Commit 4550cb7 pushed by clefourrier
July 17, 2024 09:16 2m 13s main
July 17, 2024 09:16 2m 13s
Add metrics as functions (#214)
Quality #821: Commit aaf7e8a pushed by clefourrier
July 17, 2024 08:07 2m 15s main
July 17, 2024 08:07 2m 15s
should fix most inference endpoints issues of version config (#226)
Quality #816: Commit 733257f pushed by NathanHB
July 16, 2024 13:46 2m 24s main
July 16, 2024 13:46 2m 24s
Transformers model as Judge
Quality #803: Pull request #174 synchronize by NathanHB
July 11, 2024 11:45 2m 13s anilaltuner:main
July 11, 2024 11:45 2m 13s
Data split depending on eval params (#169)
Quality #802: Commit 66e6aae pushed by NathanHB
July 11, 2024 11:16 2m 21s main
July 11, 2024 11:16 2m 21s
Now only uses functions for prompt definition (#213)
Quality #792: Commit 4651531 pushed by clefourrier
July 9, 2024 13:29 2m 19s main
July 9, 2024 13:29 2m 19s
Use only dataclasses for task init (#212)
Quality #789: Commit 3aaec22 pushed by clefourrier
July 9, 2024 12:42 3m 36s main
July 9, 2024 12:42 3m 36s
Fix a few typos in metrics.py (#218)
Quality #787: Commit 3f90950 pushed by clefourrier
July 9, 2024 11:42 3m 8s main
July 9, 2024 11:42 3m 8s
Homogeneize logging system (#150)
Quality #784: Commit ac57b78 pushed by clefourrier
July 9, 2024 10:13 2m 19s main
July 9, 2024 10:13 2m 19s
Adds a dummy/random model for baseline init (#220)
Quality #778: Commit 70f7fc6 pushed by clefourrier
July 9, 2024 07:41 2m 12s main
July 9, 2024 07:41 2m 12s
Fix the bug (#216)
Quality #763: Commit 0528f29 pushed by clefourrier
July 8, 2024 09:04 2m 22s main
July 8, 2024 09:04 2m 22s
July 8, 2024 06:38 2m 22s
Fix a few typos and do a tiny refactor (#187)
Quality #746: Commit 843a0f8 pushed by clefourrier
July 5, 2024 06:57 2m 36s main
July 5, 2024 06:57 2m 36s
Fix a few typos and do a tiny refactor
Quality #745: Pull request #187 synchronize by sadra-barikbin
July 4, 2024 19:42 2m 45s sadra-barikbin:main
July 4, 2024 19:42 2m 45s
ADD GPT-4 as Judge (#206)
Quality #743: Commit 0bceaee pushed by clefourrier
July 4, 2024 14:38 2m 12s main
July 4, 2024 14:38 2m 12s
fix llm as judge warnings (#173)
Quality #737: Commit 3a80833 pushed by clefourrier
July 4, 2024 10:48 2m 16s main
July 4, 2024 10:48 2m 16s