Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

OCR Engine Support: Pororo (kakaobrain/pororo, potentially used together with EasyOCR) #2

Closed
junhoyeo opened this issue Oct 29, 2023 · 1 comment · Fixed by #8
Closed

Comments

@junhoyeo
Copy link
Owner

@black7375
Copy link

black7375 commented Oct 29, 2023

To summarize the tweet again, (Currently, X can't see the list of answers without login)

  1. EasyOCR is excellent in text detection.
  2. Pororo is superior to EasyOCR in text recognition.
  3. Pre-processing is another way to increase the recognition rate, and the application method must also vary depending on the characteristics of the OCR engine.
    • For example, below v3.05 of Tesseract is advantageous for dark backgrounds, but after v4.0 that it is advantageous for bright backgrounds.
    • Technologies such as normalization, binarization, and skeletonization may be good in document images, but they are not suitable for photographic images. (Shades of small letters become clumpy and indistinguishable with high probability)
    • One of the few pretreatment that works well with most OCR engines is grayscale.
    • If the size of the ROI(region of interest) is too small, it is better to scale up.
    • I am convinced that the Tesseract's OSD(Orientation and script detection),estimate perspective transformations or dewarping will improve performance.
      However, it is expected to be difficult to apply only when detecting and appropriate.
      The easy way is to apply to each ROI.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants