Explain why a subject was matched #19

osma · 2017-10-05T07:52:40Z

When Annif returns bad subjects, it can be difficult to understand why they were suggested. An explain parameter for the analyze functionality could be used to enable explanation functionality, which would return, for each suggested subject, the text from all of the blocks in the document that contributed to the subject assignment, sorted by their scores (highest score first). This would give at least some kind of idea which parts of the document caused the match.

The text was updated successfully, but these errors were encountered:

osma · 2018-05-19T08:14:02Z

LIME could be useful for this: https://github.com/marcotcr/lime/

annakasprzik · 2019-07-29T07:17:56Z

In general, I would like an option both for 'suggest' and for 'eval' that returns the confidence scores for each descriptor and for each document, for evaluation purposes. Not sure if Annif already produces such an output anywhere?

osma · 2019-09-03T12:49:26Z

@annakasprzik This is what the suggest command does - it will give you the confidence scores in the output. Like this:

$ echo "the cat sat on the mat" | annif suggest tfidf-en
<http:https://www.yso.fi/onto/yso/p26645>	place mats	0.5739196571753897
<http:https://www.yso.fi/onto/yso/p19378>	cat	0.412109991386263
<http:https://www.yso.fi/onto/yso/p864>	Felidae	0.4004559418090339
<http:https://www.yso.fi/onto/yso/p24992>	stray cats	0.31746311805949967
<http:https://www.yso.fi/onto/yso/p24619>	exotic (cat)	0.27605877849495275
<http:https://www.yso.fi/onto/yso/p24278>	Norwegian forest cat	0.2735824095480068
<http:https://www.yso.fi/onto/yso/p24186>	Siberian cat	0.2712520343571323
<http:https://www.yso.fi/onto/yso/p20058>	wildcat	0.2446630680506471
<http:https://www.yso.fi/onto/yso/p21172>	street musicians	0.23004085661703863
<http:https://www.yso.fi/onto/yso/p29087>	cat breeders	0.2211696167751634

The third column is the confidence score (between 0.0 and 1.0). Its interpretation varies a bit between the models.

For the eval command I don't think returning such scores makes sense, as the operation is on a higher level - you give it a bunch of manually indexed documents and it will compare the algorithm-suggested subjects with the manual ones, taking into account the predicted scores, and calculate overall similarity measures like F1 and NDCG.

osma · 2019-09-03T12:50:14Z

BTW there's a great blog post on the ideas behind LIME, by the authors.

osma added the enhancement label Oct 5, 2017

osma added this to the Long term milestone Oct 5, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Explain why a subject was matched #19

Explain why a subject was matched #19

osma commented Oct 5, 2017

osma commented May 19, 2018

annakasprzik commented Jul 29, 2019

osma commented Sep 3, 2019

osma commented Sep 3, 2019

Explain why a subject was matched #19

Explain why a subject was matched #19

Comments

osma commented Oct 5, 2017

osma commented May 19, 2018

annakasprzik commented Jul 29, 2019

osma commented Sep 3, 2019

osma commented Sep 3, 2019