Add task variants replicating Llama 1 / 2 evaluation numbers #1078

haileyschoelkopf · 2023-12-07T15:27:16Z

In some cases, Llama 1 (and 2, whose eval setups at times differ) paper results are not replicable by our implementations due to Meta’s custom undisclosed prompts or prepended task descriptions.

However, for some tasks like Triviaqa, we have successfully found setups / reverse engineered prompts. Where we have done this we should add documentation and variants of the tasks for ease of use.

afcruzs · 2024-03-16T20:19:57Z

@haileyschoelkopf Is this documented yet somewhere? Even if it's in an informal capacity (eg a branch, a Google doc etc)

haileyschoelkopf added the feature request A feature that isn't implemented yet. label Dec 7, 2023

haileyschoelkopf self-assigned this Dec 7, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add task variants replicating Llama 1 / 2 evaluation numbers #1078

Add task variants replicating Llama 1 / 2 evaluation numbers #1078

haileyschoelkopf commented Dec 7, 2023

afcruzs commented Mar 16, 2024 •

edited

Loading

Add task variants replicating Llama 1 / 2 evaluation numbers #1078

Add task variants replicating Llama 1 / 2 evaluation numbers #1078

Comments

haileyschoelkopf commented Dec 7, 2023

afcruzs commented Mar 16, 2024 • edited Loading

afcruzs commented Mar 16, 2024 •

edited

Loading