Fix triviaqa task #525

seopbo · 2023-05-26T16:39:16Z

Fix triviaqa task
Test task code by usinggpt-neox-20b
Below command is my test command

python main.py \
--model hf-causal-experimental \
--model_args use_accelerate=True,pretrained=/mount/lm_storage/checkpoints/gpt-neox-20b \
--tasks triviaqa \
--num_fewshot 5 \
--batch_size 16 \
--limit 100

to: @StellaAthena
resolved: #456

CLAassistant · 2023-05-26T16:39:24Z

All committers have signed the CLA.

StellaAthena · 2023-05-31T16:23:13Z

@seopbo Thanks for the contribution! Can you explain how this fixes TriviaQA?

seopbo · 2023-06-01T01:04:47Z

@seopbo Thanks for the contribution! Can you explain how this fixes TriviaQA?

I implemented this along our previous discussion. (#456 (comment).)
By this code, gpt-neox-20b score

0 shot: 0.2705
5 shot: 0.3818

In gpt-neox-20b paper, score is

0 shot: 0.259
5 shot: 0.347

to: @StellaAthena

haileyschoelkopf · 2023-06-06T02:37:59Z

Thanks very much for this contribution, @seopbo ! It is very appreciated :)

Will merge this shortly--LLama-7B achieves 40.5% instead of the 50% in the paper with this, and I'd like to confirm that we can't get any closer to their / other published setups first. (The paper seems to imply the prompt is "Q: {question}\nA:")

haileyschoelkopf · 2023-06-14T15:10:10Z

Thanks again! Changed this so it should use the filtered dev set, following LLaMA.

wwngh1233 · 2023-06-27T09:15:09Z

can you update the new performance of llama 7b on triviaqa?

Fix triviaqa task

Fix triviaqa task

8c419c8

seopbo requested review from jon-tow and StellaAthena as code owners May 26, 2023 16:39

Change triviaqa dataset path

a116039

haileyschoelkopf requested review from haileyschoelkopf and lintangsutawika as code owners June 14, 2023 14:07

haileyschoelkopf added 2 commits June 14, 2023 10:09

rm triviaqa to use HF filtered dev set

859e592

Update triviaqa.py

50cf098

haileyschoelkopf merged commit b018a7d into EleutherAI:master Jun 14, 2023
2 checks passed

seopbo deleted the fix-triviaqa branch June 29, 2023 01:15

qmdnls pushed a commit to qmdnls/lm-evaluation-harness that referenced this pull request Aug 17, 2023

Merge pull request EleutherAI#525 from seopbo/fix-triviaqa

1345445

Fix triviaqa task

LZY-the-boys pushed a commit to LZY-the-boys/lm-evaluation-harness-fast that referenced this pull request Sep 12, 2023

Merge pull request EleutherAI#525 from seopbo/fix-triviaqa

e4ef153

Fix triviaqa task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix triviaqa task #525

Fix triviaqa task #525

seopbo commented May 26, 2023 •

edited

Loading

CLAassistant commented May 26, 2023 •

edited

Loading

StellaAthena commented May 31, 2023

seopbo commented Jun 1, 2023 •

edited

Loading

haileyschoelkopf commented Jun 6, 2023

haileyschoelkopf commented Jun 14, 2023

wwngh1233 commented Jun 27, 2023

Fix triviaqa task #525

Fix triviaqa task #525

Conversation

seopbo commented May 26, 2023 • edited Loading

CLAassistant commented May 26, 2023 • edited Loading

StellaAthena commented May 31, 2023

seopbo commented Jun 1, 2023 • edited Loading

haileyschoelkopf commented Jun 6, 2023

haileyschoelkopf commented Jun 14, 2023

wwngh1233 commented Jun 27, 2023

seopbo commented May 26, 2023 •

edited

Loading

CLAassistant commented May 26, 2023 •

edited

Loading

seopbo commented Jun 1, 2023 •

edited

Loading