Caching analysis results with memcached #241

osma · 2019-01-21T08:39:16Z

When testing different settings and training ensembles, often the same documents are analyzed over and over with the same backend. This is needlessly slow. It would help a lot if the analysis results could be cached.

Ideally the cache should be persistent across separate invocations of Annif, sharable by multiple Annif processes working in parallel, and automatically expire cached results after some TTL. memcached seems ideal for this purpose.

The cache keys have to be carefully chosen based on e.g. project configuration and timestamps on trained model files, to avoid stale entries being retrieved from the cache.

kinow · 2019-02-01T09:04:31Z

Might be easier if we use containers and/or VM's for test/developing it I think. The docs are great already for development, I could set up Annif quite quickly. But nothing beats being able to run one command (and also being able to quickly have access to all commands used for the installation)

osma added the enhancement label Jan 21, 2019

osma added this to the Long term milestone Jan 21, 2019

osma modified the milestones: Long term, Blue Sky Feb 14, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Caching analysis results with memcached #241

Caching analysis results with memcached #241

osma commented Jan 21, 2019

kinow commented Feb 1, 2019 •

edited

Loading

Caching analysis results with memcached #241

Caching analysis results with memcached #241

Comments

osma commented Jan 21, 2019

kinow commented Feb 1, 2019 • edited Loading

kinow commented Feb 1, 2019 •

edited

Loading