-
Notifications
You must be signed in to change notification settings - Fork 11
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Do you have any plan that release the code for dataset preparation? #19
Comments
Hi, thanks for your interest! We'll have a code release in the next few weeks, including the dataset-related code. |
@StevenyzZhang follow up on this. Specifically, I'm interested in the code that merges the OCR texts from PaddleOCR to paragraphs based on the geometric relationships mentioned in Section3 of paper. Thanks in advance. |
For the code merging OCR blocks, I used this. |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Hey, thanks for your great work!
I want to make this type of dataset on other image-text datasets.
Do you have any plan that release the related code?
Thanks.
The text was updated successfully, but these errors were encountered: