Skip to content
View Holmes2002's full-sized avatar

Block or report Holmes2002

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

The paper collections for the autoregressive models in vision.

199 9 Updated Nov 16, 2024

HART: Efficient Visual Generation with Hybrid Autoregressive Transformer

Python 336 14 Updated Oct 16, 2024

NDL古典籍OCR学習用データセット(みんなで翻刻加工データ)

Python 12 2 Updated Feb 7, 2024

Official pytorch implementation of the paper: "An Edit Friendly DDPM Noise Space: Inversion and Manipulations". CVPR 2024.

Python 280 13 Updated Jul 11, 2024

NDL古典籍OCRのアプリケーション(ソースコードを含む)

Python 39 13 Updated Oct 31, 2024

Generating handwritten Chinese characters using CycleGAN

Python 38 9 Updated Nov 13, 2019
Python 2 Updated Nov 5, 2024

High-Resolution Image Synthesis with Latent Diffusion Models

Python 39,167 5,050 Updated Oct 10, 2024

[ECCV 2024] PowerPaint, a versatile image inpainting model that supports text-guided object inpainting, object removal, image outpainting and shape-guided object inpainting with only a single model…

Python 658 41 Updated Sep 8, 2024

Code for generating synthetic text images as described in "Synthetic Data for Text Localisation in Natural Images", Ankush Gupta, Andrea Vedaldi, Andrew Zisserman, CVPR 2016.

Python 2,027 622 Updated Aug 9, 2023

IMGUR5K handwriting set. It is a handwritten in-the-wild dataset, which contains challenging real world handwritten samples from different writers.The dataset is shared as a set of image urls with …

Python 285 55 Updated Mar 12, 2024

UC3M License Plate detection and recognition dataset

Python 8 Updated Nov 16, 2024

Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-Augmented Generation for AI-Generated Content: A Survey".

1,254 86 Updated Aug 20, 2024

Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 9,646 594 Updated Nov 11, 2024

A two stage lightweight and high performance license plate recognition in MTCNN and LPRNet

Jupyter Notebook 656 171 Updated Jan 22, 2024

MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone

Python 12,595 886 Updated Oct 22, 2024

License Plate Detection and Recognition in Unconstrained Scenarios

C 1,722 607 Updated Jul 1, 2022

PyTorch code of my ICDAR 2021 paper Vision Transformer for Fast and Efficient Scene Text Recognition (ViTSTR)

Jupyter Notebook 292 59 Updated Apr 9, 2024

Source for NomNaTong-regular Vietnamese chữ Nôm font.

Python 71 6 Updated Nov 12, 2024

A synthetic data generator for text recognition

Python 8 Updated May 22, 2023

TAO Toolkit deep learning networks with PyTorch backend

Python 87 18 Updated Nov 7, 2024

A synthetic data generator for text recognition

Python 3 Updated Oct 10, 2023

Leverage Deep Learning to digitize old Vietnamese handwritten for historical document archiving (Made with national pride in every single line of code): https://www.kaggle.com/datasets/quandang/nom…

Jupyter Notebook 116 22 Updated Jun 11, 2024

CORD: A Consolidated Receipt Dataset for Post-OCR Parsing

398 36 Updated Jul 20, 2022

docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.

Python 3,859 443 Updated Nov 13, 2024

整理目前开源的最优表格识别模型,完善前后处理,模型转换为ONNX Organize the currently open-source optimal table recognition models, improve pre-processing and post-processing, and convert the models to ONNX.

Python 295 34 Updated Nov 16, 2024

A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。

Python 16,511 1,194 Updated Nov 15, 2024

A toolbox of ocr models and algorithms based on MindSpore

Python 219 56 Updated Nov 15, 2024
Next