Add switch to QA pred head for ranking by confidence scores #836

julian-risch · 2021-08-17T15:11:27Z

QACandidates contain score and confidence fields but only score was used to rank QACandidates so far.
This PR makes ranking QACandidates by score the default but also allows to set use_confidence_scores_for_ranking in QAPredictionHead, which activates ranking
QACandidates by confidence.

I think, it's better to keep both fields, score and confidence in QACandidates instead of dropping one of them entirely. Thereby, we are more flexible and allow backwards compatibility.

A new test case checks that results are ranked as expected with and without setting use_confidence_scores_for_ranking.

closes #808

tholor

LGTM.
Small thing: Not directly related to this PR but I think it's a great opportunity to add a bit more explanation around the different scores we have (see comment)

tholor · 2021-08-18T16:35:25Z

farm/modeling/prediction_head.py

@@ -963,6 +964,8 @@ def __init__(self, layer_dims=[768,2],
 :type duplicate_filtering: int
 :param temperature_for_confidence: The divisor that is used to scale logits to calibrate confidence scores
 :type temperature_for_confidence: float
+ :param use_confidence_scores_for_ranking: Whether to sort answers by confidence score (normalized between 0 and 1) or by standard score (unbounded)


Can we document here somewhere a bit clearer what these different scores exactly are and how they are calculated? If I remember correctly we have these three cases:
a) score = start + end logit (unbounded)
b) confidence (default) = logits scaled to 0-1 and incorporating no_answer
c) confidence (calibrated) - same a b) but we multiply it with a learned scaling parameter
I am sure I will forget about it in a couple of weeks and would be helpful to have it documented for others. Probably it's best to do that in the general prediction head doc string (+ Haystack's FARMReader).

Good point. I added an explanation of the three kinds of scores to the doc string of the QAPredictionHead.

julian-risch added 2 commits August 17, 2021 17:09

Add switch to QA pred head for ranking by confidence scores

5aa94e6

Add doc string and test for ranking answers by confidence

06a1747

julian-risch changed the title ~~WIP: Add switch to QA pred head for ranking by confidence scores~~ Add switch to QA pred head for ranking by confidence scores Aug 18, 2021

julian-risch marked this pull request as ready for review August 18, 2021 12:56

julian-risch requested a review from tholor August 18, 2021 12:57

tholor approved these changes Aug 18, 2021

View reviewed changes

Explain different kinds of scores in doc string of QAPredictionHead

83122bf

julian-risch merged commit 2baa409 into master Aug 18, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add switch to QA pred head for ranking by confidence scores #836

Add switch to QA pred head for ranking by confidence scores #836

julian-risch commented Aug 17, 2021 •

edited

tholor left a comment

tholor Aug 18, 2021

julian-risch Aug 18, 2021

Add switch to QA pred head for ranking by confidence scores #836

Add switch to QA pred head for ranking by confidence scores #836

Conversation

julian-risch commented Aug 17, 2021 • edited

tholor left a comment

Choose a reason for hiding this comment

tholor Aug 18, 2021

Choose a reason for hiding this comment

julian-risch Aug 18, 2021

Choose a reason for hiding this comment

julian-risch commented Aug 17, 2021 •

edited