Here are
1,971 public repositories
matching this topic...
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
Updated
Jun 3, 2024
Python
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Updated
May 30, 2024
Python
A community-supported supercharged version of paperless: scan, index and archive all your physical documents
Updated
Jun 3, 2024
Python
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
Updated
Jun 1, 2024
Python
pix2tex: Using a ViT to convert images of equations into LaTeX code.
Updated
May 29, 2024
Python
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Updated
Jun 3, 2024
Python
Updated
May 28, 2024
Python
Scan, index, and archive all of your paper documents
Updated
Apr 6, 2021
Python
Updated
Aug 29, 2022
Python
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
Updated
May 30, 2024
Python
A supercharged version of paperless: scan, index and archive all your physical documents
Updated
Feb 14, 2023
Python
视频硬字幕提取,生成srt文件。无需申请第三方API,本地实现文本识别。基于深度学习的视频字幕提取框架,包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files.
Updated
Feb 21, 2024
Python
A Unified Toolkit for Deep Learning Based Document Image Analysis
Updated
Mar 7, 2024
Python
Updated
May 24, 2024
Python
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
Updated
Jun 1, 2024
Python
OpenMMLab Text Detection, Recognition and Understanding Toolbox
Updated
Jun 2, 2024
Python
text detection mainly based on ctpn model in tensorflow, id card detect, connectionist text proposal network
Updated
Oct 3, 2023
Python
AdelaiDet is an open source toolbox for multiple instance-level detection and recognition tasks.
Updated
Mar 5, 2024
Python
docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.
Updated
Jun 2, 2024
Python
A synthetic data generator for text recognition
Updated
May 22, 2024
Python
Improve this page
Add a description, image, and links to the
ocr
topic page so that developers can more easily learn about it.
Curate this topic
Add this topic to your repo
To associate your repository with the
ocr
topic, visit your repo's landing page and select "manage topics."
Learn more
You can’t perform that action at this time.