A tool to interactively select text regions of PDFs and images. Use with PDFQuery to select PDF areas using Python or tesseract for image-to-text with UZN/OCR zone files.
Find the interactive version at https://jsoma.github.io/kull/.
If you'd like to simplify the tesseract/.uzn
process, try tesseract-uzn.