Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

image_to_data default output type is string #510

Closed
karladler opened this issue Aug 30, 2023 · 2 comments
Closed

image_to_data default output type is string #510

karladler opened this issue Aug 30, 2023 · 2 comments

Comments

@karladler
Copy link

Not sure if this can be updated without breaking backwards compatibility, but I got really confused, that the output type for image_to_data() function is actually string. Regarding to the function name IMHO it should be data.frame.

@stefan6419846
Copy link
Contributor

This is documented explicitly:

"""
Returns string containing box boundaries, confidences,
and other information. Requires Tesseract 3.05+
"""

You still can choose between the different representations. I assume that data.frame is not the default as pandas is an optional dependency, thus this might fail.

@bozhodimitrov
Copy link
Collaborator

bozhodimitrov commented Aug 30, 2023

Are you talking about the image_to_data and the param output_type has specific parameter for that one. Please check the docs about the pytesseract.Output class

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants