How to train using the original lmdb dataset from this repo but also using a custom dataset? #422

CharlyJazz · 2024-05-19T23:54:13Z

I need to merge the lmdb dataset that this repo use in the instructions with my dataset? Because when I use only my dataset the OCR is not as good as the OCR trained with the dataset created for this repo. Thanks!

AdamKheire · 2024-09-17T10:28:57Z

@CharlyJazz did you get the solution

CharlyJazz · 2024-09-18T00:44:55Z

There is a class ConcatenatedDataset, you can put your dataset side by side of the others datasets but keep in mind that the the are sorted by folder name, and it gonna take a long time training on the originals lmdb, so your dataset should be first, to make sure it gonna be use in the training session first. I start my dataset name with "a"

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to train using the original lmdb dataset from this repo but also using a custom dataset? #422

How to train using the original lmdb dataset from this repo but also using a custom dataset? #422

CharlyJazz commented May 19, 2024

AdamKheire commented Sep 17, 2024

CharlyJazz commented Sep 18, 2024

How to train using the original lmdb dataset from this repo but also using a custom dataset? #422

How to train using the original lmdb dataset from this repo but also using a custom dataset? #422

Comments

CharlyJazz commented May 19, 2024

AdamKheire commented Sep 17, 2024

CharlyJazz commented Sep 18, 2024