A command line application designed to recursively walk through the input path submitting all image files for optical character recognition (OCR) via either the Google Cloud Vision API or a Google Cloud Document AI processor if a prediction endpoint is provided. The application will then output the image information and annotations to a single newline delimited JSON File.
USAGE:
ocr-runner -i PATH -o FILE
ARGS:
-endpoint string
Document AI Prediction Endpoint (Optional)
-full
Output full details to JSON
-i string
Input Path (Required)
-o string
Output File (Required)
-verbose
Display verbose or debug detail
The application will automatically filter out all files that do not have one of the following extensions:
.bmp
.gif
.jpg
.jpeg
.pdf
.png
.tif
.tiff
.webp
ocr-runner is released under the Apache License 2.0 unless explicitly mentioned in the file header.