feat: `AnswerExactMatchEvaluator` input should be one answer per query instead of list of answers #7459

julian-risch · 2024-04-03T12:37:27Z

We should change the inputs of the AnswerExactMatchEvaluator added in #7050 from
questions: List[str], ground_truth_answers: List[List[str]], predicted_answers: List[List[str]]
to
ground_truth_answers: List[str], predicted_answers: List[str]

This change will make all metrics in Haystack core and in integrations, statistical and model-based, consistent in the inputs they expect. For answers always List[str], for queries always List[str] and for documents (contexts) always List[List[str]]. It also simplifies the implementation of the new metrics and will allow us to move faster.

Describe alternatives you've considered
Keeping inputs as is would mean they are inconsistent with the model based metrics and the integrations of evaluation frameworks. However, the behavior would be the same as the exact match metric in Haystack 1.x and more flexible for datasets with multiple ground truth answers such as SQuAD 2.0 and multiple predicted answers like our Reader's output.

The text was updated successfully, but these errors were encountered:

shadeMe · 2024-04-04T10:11:58Z

Mentioning this here so that we don't forget - We should also remove the questions input parameter as it's unused.

masci · 2024-04-11T12:14:57Z

Mentioning this here so that we don't forget - We should also remove the questions input parameter as it's unused.

Removed in #7466

julian-risch added topic:eval P1 High priority, add to the next sprint labels Apr 3, 2024

masci self-assigned this Apr 7, 2024

julian-risch added this to the 2.1.0 milestone Apr 9, 2024

masci mentioned this issue Apr 11, 2024

refactor: AnswerExactMatchEvaluator component inputs #7536

Merged

masci closed this as completed in #7536 Apr 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: `AnswerExactMatchEvaluator` input should be one answer per query instead of list of answers #7459

feat: `AnswerExactMatchEvaluator` input should be one answer per query instead of list of answers #7459

julian-risch commented Apr 3, 2024 •

edited

Loading

shadeMe commented Apr 4, 2024

masci commented Apr 11, 2024

feat: AnswerExactMatchEvaluator input should be one answer per query instead of list of answers #7459

feat: AnswerExactMatchEvaluator input should be one answer per query instead of list of answers #7459

Comments

julian-risch commented Apr 3, 2024 • edited Loading

shadeMe commented Apr 4, 2024

masci commented Apr 11, 2024

feat: `AnswerExactMatchEvaluator` input should be one answer per query instead of list of answers #7459

feat: `AnswerExactMatchEvaluator` input should be one answer per query instead of list of answers #7459

julian-risch commented Apr 3, 2024 •

edited

Loading