Can't reproduce the results for GLUE and hyperparameter misalignment #149

nbasyl · 2023-11-22T08:20:01Z

Hi,
Thanks for the great work.

I am trying to reproduce the result of Roberta-large on the NLU tasks, however, I got a CoLA score = 0 and MNLI = 31.3 using the provided finetuning scripts, and then I found out that there are misalignments between the hyperparameters in the provided training scripts and those on the paper. For example, in roberta_large_cola.sh the lr is set to 3e-4, but in the paper, it is set to 2e-4. Which settings should I follow to reproduce the reported result?

looking forward to your reply!

Best,
Sean

nbasyl · 2023-11-22T09:26:07Z

I changed the lr in the CoLA training script to 2e-4 and solved the CoLA constant 0 eval correlation value problem, but still couldn't reproduce the MNLI result :(

nbasyl · 2023-11-22T09:28:42Z

But I am still only getting 62.82 CoLA score, anyone encountered similar problem when trying to reproduce the result

zxchasing · 2024-03-17T02:10:28Z

But I am still only getting 62.82 CoLA score, anyone encountered similar problem when trying to reproduce the result

Hi，Did you solve this problem?

Car-pe · 2024-04-14T11:12:30Z

I changed the lr in the CoLA training script to 2e-4 and solved the CoLA constant 0 eval correlation value problem, but still couldn't reproduce the MNLI result :(

My result in CoLA is 63.48 which matches the paper. And the random seeds used are (1 3 13 37 71), but I can not reproduce other task, only CoLA can match the paper.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Can't reproduce the results for GLUE and hyperparameter misalignment #149

Can't reproduce the results for GLUE and hyperparameter misalignment #149

nbasyl commented Nov 22, 2023

nbasyl commented Nov 22, 2023

nbasyl commented Nov 22, 2023

zxchasing commented Mar 17, 2024

Car-pe commented Apr 14, 2024

Can't reproduce the results for GLUE and hyperparameter misalignment #149

Can't reproduce the results for GLUE and hyperparameter misalignment #149

Comments

nbasyl commented Nov 22, 2023

nbasyl commented Nov 22, 2023

nbasyl commented Nov 22, 2023

zxchasing commented Mar 17, 2024

Car-pe commented Apr 14, 2024