Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

PyTesseract cannot read my number #538

Closed
matheustecchio opened this issue Feb 28, 2024 · 2 comments
Closed

PyTesseract cannot read my number #538

matheustecchio opened this issue Feb 28, 2024 · 2 comments

Comments

@matheustecchio
Copy link

PROCESSED-2024-02-27

I'm trying to read this number with pyTesseract, but it can't read the number 1 in 351.

How can I fix it? All the value that I want are numbers.

@stefan6419846
Copy link
Contributor

Please provide the corresponding pytesseract code you are using. You might want to play with tessedit_char_whitelist and the PSMs (page segmentation modes).

Nevertheless, pytesseract will never have any direct influence on this, as it is just a wrapper around the Tesseract CLI itself. If Tesseract performs bad, pytesseract cannot do much about it. For this reason, your request seems to be out of scope here. You might want to try further examples or some different preprocessing which might perform better (or re-train a Tesseract model yourself).

@matheustecchio
Copy link
Author

matheustecchio commented Feb 28, 2024

Hi Stefan, alright, thank you for your reply.
The whitelist and PSMs didn't work so I find out that I probably will need to re-train my Tesseract model as the problem is the model itself.
I'm closing this issue as is out of scope.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants