Get a 0 score in Evaluation of qwen chat in ScienceQA #61

YongLD · 2024-01-22T07:45:19Z

As I said in the title,

Why? I have some result in qwen_chat_ScienceQA_TEST_prefetch.xlsx:

But I have no answer with openai, should I must use openai api to evaluate the dataset?

The text was updated successfully, but these errors were encountered:

kennymckormick · 2024-01-23T11:52:38Z

Hi, @YongLD ,
we are trying to reproduce this problem and will provide feedback asap.

kennymckormick · 2024-01-24T04:02:55Z

Hi, @YongLD ,
According to my experiment, this error will not occur on latest main branch. Would you please try again with our latest main branch?

YongLD · 2024-01-24T06:32:53Z

Fantastic!! I didn't notice that you updated it two days ago. It works! Thanks for the amazing job.

kennymckormick · 2024-01-24T07:07:00Z

Fantastic!! I didn't notice that you updated it two days ago. It works! Thanks for the amazing job.

😄 Please help star this project if you appreciate the efforts~

YongLD closed this as completed Jan 24, 2024

Provide feedback