Add EQ-Bench #1459

djstrong · 2024-02-22T17:28:58Z

EQ-Bench (https://github.com/EQ-bench/EQ-Bench) is more and more popular. I think it should be possible to implement, do you agree?

haileyschoelkopf · 2024-02-23T15:52:28Z

We'd welcome a contribution for EQ-Bench!

pbevan1 · 2024-02-25T15:18:42Z

I can have a look at doing this if no one is already

djstrong · 2024-02-27T19:47:35Z

I will be happy to test it :)

* Start adding eq-bench * Start adding to yaml and utils * Get metric working * Add README * Handle cases where answer is not parseable * Deal with unparseable answers and add percent_parseable metric * Update README

haileyschoelkopf added help wanted Contributors and extra help welcome. feature request A feature that isn't implemented yet. good first issue Good for newcomers labels Feb 23, 2024

haileyschoelkopf assigned djstrong and pbevan1 and unassigned djstrong Mar 1, 2024

pbevan1 mentioned this issue Mar 2, 2024

Add EQ-Bench as per #1459 #1511

Merged

haileyschoelkopf closed this as completed Mar 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add EQ-Bench #1459

Add EQ-Bench #1459

djstrong commented Feb 22, 2024

haileyschoelkopf commented Feb 23, 2024

pbevan1 commented Feb 25, 2024

djstrong commented Feb 27, 2024

Add EQ-Bench #1459

Add EQ-Bench #1459

Comments

djstrong commented Feb 22, 2024

haileyschoelkopf commented Feb 23, 2024

pbevan1 commented Feb 25, 2024

djstrong commented Feb 27, 2024