Raise warning to advice to retrain project after vocabulary update #485

juhoinkinen · 2021-04-26T15:01:45Z

In the previous Finto AI model update-round the the same mistake was made twice: a (base) project training was interrupted but not immediately noticed as there existed an old model with the same project id. Noticing the mistake was not easy from the suggestion or evaluation results either, because the old model produced sensible suggestions coming from the new vocabulary. The vocabulary had of course been loaded before the training (updating the vocabulary was introduced in #274/#383).

Annif could emit a warning when suggesting with a model, whose vocabulary has been modified since the model has been trained.

Implementation could rely on comparing the timestamps of the model/vocabulary files in the project/vocabulary directories, which would be straightforward. However, the timestamps of the model files could be greater than (after) the timestamps of the vocabulary files even when the model has not been retrained at least in two cases, and these could lead the warning to be missing:

in case of learn of a learning backend has been used
if (re)training has been interrupted, but the backend have created some temporary files in the project directory (like fasttext-train9ic43tsy.txt) that remain

The text was updated successfully, but these errors were encountered:

juhoinkinen added the enhancement label Apr 26, 2021

juhoinkinen mentioned this issue May 3, 2021

YAKE backend #461

Merged

osma added this to the Long term milestone Feb 14, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Raise warning to advice to retrain project after vocabulary update #485

Raise warning to advice to retrain project after vocabulary update #485

juhoinkinen commented Apr 26, 2021

Raise warning to advice to retrain project after vocabulary update #485

Raise warning to advice to retrain project after vocabulary update #485

Comments

juhoinkinen commented Apr 26, 2021