pip
installable versions of tesseract-ocr data
usually you'll want to pick a particular package for installation.
currently the following are provided (send a PR to add more!):
- tessdata.deu
- tessdata.eng
- tessdata.jpn
- tessdata.fast-deu
- tessdata.fast-eng
- tessdata.fast-jpn
these will install to {prefix}/share/tessdata
-- you can access this
directory path using the tessdata
api:
import tessdata
print(tessdata.data_path()) # /path/to/venv/share/tessdata