Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

analyzedir command #94

Closed
osma opened this issue Apr 6, 2018 · 0 comments
Closed

analyzedir command #94

osma opened this issue Apr 6, 2018 · 0 comments
Milestone

Comments

@osma
Copy link
Member

osma commented Apr 6, 2018

The analyze command only analyzes one document. For some backends the startup overhead is very significant so this is inefficient.

We could have an analyzedir CLI command that works similar to evaldir (and MauiTopicIndexer), i.e. it runs analyze on a directory of .txt files and writes the result into TSV files. The file extension / suffix could default to .annif but it should be possible to set on the command line. Something like

annif analyzedir [--suffix .annif] [--limit 10] [--threshold 0.00] [--verbosity INFO] <backend> <directory>

(also --backend-param should be supported)

@osma osma added this to the Short term milestone Apr 6, 2018
@osma osma modified the milestones: Short term, v0.19 Apr 6, 2018
@osma osma closed this as completed in c481c8a Apr 6, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant