Stars
CDLA: A Chinese document layout analysis (CDLA) dataset
Baichuan-Omni: Towards Capable Open-source Omni-modal LLM 🌊
【ArXiv】PDF-Wukong: A Large Multimodal Model for Efficient Long PDF Reading with End-to-End Sparse Sampling
《开源大模型食用指南》基于Linux环境快速部署开源大模型,更适合中国宝宝的部署教程
A Curated List of Awesome Table Structure Recognition (TSR) Research. Including models, papers, datasets and codes. Continuously updating.
OCR/OCSR on handwritting ⏣/chemical-structural-formulas with YOLO & CRNN models.
PDF GPT allows you to chat with the contents of your PDF file by using GPT capabilities. The most effective open source solution to turn your pdf files in a chatbot!
MTL-TabNet: Multi-task Learning based Model for Image-based Table Recognition
A lightweight Python library for simulating Chinese handwriting
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Sanster / FasterTransformer
Forked from NVIDIA/FasterTransformerTransformer related optimization, including BERT, GPT
OCR Document image deformation correction.复现阿里OCR皱巴巴文档图像形变矫正
NanoDet-Plus⚡Super fast and lightweight anchor-free object detection model. 🔥Only 980 KB(int8) / 1.8MB (fp16) and run 97FPS on cellphone🔥
OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference
Code for the paper "MASTER: Multi-Aspect Non-local Network for Scene Text Recognition" (Pattern Recognition 2021)
end2end layout analysis based seq2seq
Bourne-M / PytorchOCR
Forked from WenmuZhou/PytorchOCR基于Pytorch的OCR工具库,支持常用的文字检测和识别算法
cnn-selfattention-ctc ocr tensorflow1.x
Image Polygonal Annotation with Python (polygon, rectangle, circle, line, point and image-level flag annotation).
Label Studio is a multi-type data labeling and annotation tool with standardized output format
Implementation of popular deep learning networks with TensorRT network definition API
deep learning for image processing including classification and object-detection etc.
中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…