Error when evaluating MMBench_TEST_EN #57

KosumosuL · 2024-01-19T03:18:43Z

Looks like the annotation in MMBench_TEST_EN does not contains the key "answer"? Yet the prefetch_acc has to access this key, as shown in

VLMEvalKit/vlmeval/inference.py

Line 156 in 7f099c1

if matched == item['answer']:

YongLD · 2024-01-20T09:10:28Z

Because The test_file have no key "answer", I think they do not add the answer of the choise in the file MMBENCH_TEST_EN.tsv.

kennymckormick · 2024-01-20T09:11:46Z

Hi, @KosumosuL ,
The label of testing samples is confidential, to obtain the evaluation accuracy of MMBench test split, please upload it to the evaluation service of send it to [email protected]

KosumosuL · 2024-01-20T13:51:31Z

Hi, @KosumosuL , The label of testing samples is confidential, to obtain the evaluation accuracy of MMBench test split, please upload it to the evaluation service of send it to [email protected]

I can understand that. My point is, however, the code should consider this situation for avoiding this error, or the readme should point this out.

kennymckormick · 2024-01-20T15:43:20Z

Hi, @KosumosuL , The label of testing samples is confidential, to obtain the evaluation accuracy of MMBench test split, please upload it to the evaluation service of send it to [email protected]

I can understand that. My point is, however, the code should consider this situation for avoiding this error, or the readme should point this out.

Hi, @KosumosuL , that's a good point, I have fixed the problem in this commit: e992046, the evaluation skip for MMBench_TEST will be skipped on any non-official servers. You can try again on the latest main branch.

KosumosuL closed this as completed Jan 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Error when evaluating MMBench_TEST_EN #57

Error when evaluating MMBench_TEST_EN #57

KosumosuL commented Jan 19, 2024

YongLD commented Jan 20, 2024 •

edited

kennymckormick commented Jan 20, 2024

KosumosuL commented Jan 20, 2024

kennymckormick commented Jan 20, 2024

Error when evaluating MMBench_TEST_EN #57

Error when evaluating MMBench_TEST_EN #57

Comments

KosumosuL commented Jan 19, 2024

YongLD commented Jan 20, 2024 • edited

kennymckormick commented Jan 20, 2024

KosumosuL commented Jan 20, 2024

kennymckormick commented Jan 20, 2024

YongLD commented Jan 20, 2024 •

edited