MAP and MRR wrong for multiple gold documents #7758

ju-gu · 2024-05-29T10:11:56Z

Describe the bug
Both the MAP and the MRR show wrong values. It seems we calculate the score for single gold documents and then override it after each instead of calculating it for the whole batch of gold documents.

Expected behavior
correct values

To Reproduce

retrieved_docs = ["one", "two", "three", "four", "five", "six", "seven", "eight", "nine", "ten"]
gold_docs = ["one", "two", "three","four","seven"]

from haystack.components.evaluators import DocumentMAPEvaluator, DocumentMRREvaluator
from haystack import Document

mapevaluator = DocumentMAPEvaluator()
mrrevaluator = DocumentMRREvaluator()
mapresult = mapevaluator.run(
    ground_truth_documents=[[Document(content=content) for content in gold_docs]],
    retrieved_documents=[[Document(content=content) for content in retrieved_docs]])

mrrresult = mrrevaluator.run(
    ground_truth_documents=[[Document(content=content) for content in gold_docs]],
    retrieved_documents=[[Document(content=content) for content in retrieved_docs]])

print(mapresult["individual_scores"])
print(mrrresult["individual_scores"])
print(mapresult["score"])
print(mrrresult["score"])

The text was updated successfully, but these errors were encountered:

mrm1001 · 2024-06-11T10:09:53Z

Hi @ju-gu I'm trying to understand the example above. You are providing 10 "retrievals" but have only 5 gold truth docs? In other words, you have:

Actual retrieved	True doc
one	one
two	two
three	three
four	four
five	seven
six	?
seven	?
eight	?
nine	?
ten	?

I don't think this is a typical use case, are you sure this is what you're trying to do?

Or are you trying to do this instead:

Actual retrieved	True doc
["one", "two", "three", "four", "five", "six", "seven", "eight", "nine", "ten"]	["one", "two", "three","four","seven"]

ju-gu · 2024-06-11T10:27:53Z

I am doing the latter. The two lists retrieved docs and true docs become a list of lists in these lines:

ground_truth_documents=[[Document(content=content) for content in gold_docs]]
retrieved_documents=[[Document(content=content) for content in retrieved_docs]])

So it is just one set of retrieved docs and one set of gold documents

ju-gu added type:bug Something isn't working 2.x Related to Haystack v2.0 labels May 29, 2024

ju-gu changed the title ~~MAP and MRR wrong~~ MAP and MRR wrong for multiple gold documents May 29, 2024

shadeMe added the P1 High priority, add to the next sprint label May 29, 2024

julian-risch assigned Amnah199 Jun 3, 2024

Amnah199 mentioned this issue Jun 11, 2024

bug: fix MRR and MAP calculations #7841

Merged

Amnah199 closed this as completed in #7841 Jun 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MAP and MRR wrong for multiple gold documents #7758

MAP and MRR wrong for multiple gold documents #7758

ju-gu commented May 29, 2024

mrm1001 commented Jun 11, 2024 •

edited

Loading

ju-gu commented Jun 11, 2024 •

edited

Loading

MAP and MRR wrong for multiple gold documents #7758

MAP and MRR wrong for multiple gold documents #7758

Comments

ju-gu commented May 29, 2024

mrm1001 commented Jun 11, 2024 • edited Loading

ju-gu commented Jun 11, 2024 • edited Loading

mrm1001 commented Jun 11, 2024 •

edited

Loading

ju-gu commented Jun 11, 2024 •

edited

Loading