Skip to content

Actions: openai/evals

Actions

Run unit tests

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
1,102 workflow runs
1,102 workflow runs
Event

Filter by event

Loading
Status

Filter by status

Loading
Branch
Actor

Filter by actor

Loading
Added Quran Eval & Simple Fact Model-Graded Definition
Run unit tests #1758: Pull request #1511 synchronize by sakher
June 24, 2024 18:51 Action required sakher:quran-eval
June 24, 2024 18:51 Action required
Added Quran Eval & Simple Fact Model-Graded Definition
Run unit tests #1757: Pull request #1511 synchronize by sakher
June 24, 2024 18:47 Action required sakher:quran-eval
June 24, 2024 18:47 Action required
Added Quran Eval & Simple Fact Model-Graded Definition
Run unit tests #1756: Pull request #1511 synchronize by sakher
June 24, 2024 09:13 Action required sakher:quran-eval
June 24, 2024 09:13 Action required
Added Quran Eval & Simple Fact Model-Graded Definition
Run unit tests #1754: Pull request #1511 synchronize by sakher
June 20, 2024 14:13 3m 43s sakher:quran-eval
June 20, 2024 14:13 3m 43s
Fix problematic sample in Schelling Point
Run unit tests #1752: Pull request #1534 opened by JunShern
May 22, 2024 23:04 8m 5s jun/schellingpoint-fix
May 22, 2024 23:04 8m 5s
eval pattern-concat-logic
Run unit tests #1735: Pull request #1508 synchronize by natanaelwf
May 9, 2024 13:18 3m 55s natanaelwf:pattern-concat-logic
May 9, 2024 13:18 3m 55s
Release 3.0.1 (#1525)
Run unit tests #1733: Commit d3dc890 pushed by etr2460
May 1, 2024 00:50 4m 10s main
May 1, 2024 00:50 4m 10s
Release 3.0.1
Run unit tests #1732: Pull request #1525 opened by etr2460
May 1, 2024 00:24 3m 59s release/3.0.1
May 1, 2024 00:24 3m 59s
Make the torch dep optional (#1524)
Run unit tests #1731: Commit 1d3f11c pushed by etr2460
May 1, 2024 00:14 10m 41s main
May 1, 2024 00:14 10m 41s
Make the torch dep optional
Run unit tests #1730: Pull request #1524 synchronize by etr2460
April 30, 2024 23:56 4m 1s erik/torch-optional
April 30, 2024 23:56 4m 1s
Make the torch dep optional
Run unit tests #1729: Pull request #1524 opened by etr2460
April 30, 2024 23:52 2m 38s erik/torch-optional
April 30, 2024 23:52 2m 38s
Release 3.0.0 (#1520)
Run unit tests #1723: Commit 778caa6 pushed by etr2460
April 17, 2024 22:27 4m 5s main
April 17, 2024 22:27 4m 5s
Release 3.0.0
Run unit tests #1722: Pull request #1520 opened by etr2460
April 17, 2024 22:24 3m 54s release/3.0.0
April 17, 2024 22:24 3m 54s
Unpin dependencies (#1519)
Run unit tests #1721: Commit 518a9a8 pushed by etr2460
April 17, 2024 14:45 3m 58s main
April 17, 2024 14:45 3m 58s
Unpin dependencies
Run unit tests #1720: Pull request #1519 opened by hauntsaninja
April 17, 2024 08:01 3m 58s unpin
April 17, 2024 08:01 3m 58s
Remove citation prediction eval (#1512)
Run unit tests #1718: Commit c124f98 pushed by JunShern
April 5, 2024 04:07 6m 28s main
April 5, 2024 04:07 6m 28s
Allow for evals with no args (#1517)
Run unit tests #1717: Commit 4ed2f6f pushed by JunShern
April 5, 2024 04:06 3m 51s main
April 5, 2024 04:06 3m 51s
Allow for evals with no args
Run unit tests #1716: Pull request #1517 opened by thesofakillers
April 4, 2024 16:50 3m 54s thesofakillers:optional-args
April 4, 2024 16:50 3m 54s
Relax version constraint for playwright module (#1516)
Run unit tests #1715: Commit 20de8c5 pushed by JunShern
April 4, 2024 05:18 13m 23s main
April 4, 2024 05:18 13m 23s