Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Some confusion about eval #22

Open
willer-lu opened this issue Mar 20, 2024 · 1 comment
Open

Some confusion about eval #22

willer-lu opened this issue Mar 20, 2024 · 1 comment

Comments

@willer-lu
Copy link

For example, if the ground truth is 'University Yale', but the output of llm is 'Yale University', or '01.01' vs '1st January'.
Such instance should be considered right or wrong when caculating EM?

@GasolSun36
Copy link
Collaborator

Hi,
It's definitely wrong, so this is a shortcoming of EM. However, you can use turbo to determine whether the calculation is correct or incorrect given the ground-truth and the generated result.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants