Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Get a 0 score in Evaluation of qwen chat in ScienceQA #61

Closed
YongLD opened this issue Jan 22, 2024 · 4 comments
Closed

Get a 0 score in Evaluation of qwen chat in ScienceQA #61

YongLD opened this issue Jan 22, 2024 · 4 comments

Comments

@YongLD
Copy link

YongLD commented Jan 22, 2024

As I said in the title,

image

Why? I have some result in qwen_chat_ScienceQA_TEST_prefetch.xlsx:

image

But I have no answer with openai, should I must use openai api to evaluate the dataset?

@kennymckormick
Copy link
Member

Hi, @YongLD ,
we are trying to reproduce this problem and will provide feedback asap.

@kennymckormick
Copy link
Member

Hi, @YongLD ,
According to my experiment, this error will not occur on latest main branch. Would you please try again with our latest main branch?
image

@YongLD
Copy link
Author

YongLD commented Jan 24, 2024

Fantastic!! I didn't notice that you updated it two days ago. It works! Thanks for the amazing job.

@YongLD YongLD closed this as completed Jan 24, 2024
@kennymckormick
Copy link
Member

Fantastic!! I didn't notice that you updated it two days ago. It works! Thanks for the amazing job.

😄 Please help star this project if you appreciate the efforts~

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants