Holmes2002

Follow

Itsuki Holmes2002

Follow

2 followers · 45 following

Achievements

Achievements

Starred repositories

ChaofanTao / Autoregressive-Models-in-Vision-Survey

The paper collections for the autoregressive models in vision.

199 9 Updated Nov 16, 2024

mit-han-lab / hart

HART: Efficient Visual Generation with Hybrid Autoregressive Transformer

Python 336 14 Updated Oct 16, 2024

ndl-lab / ndl-minhon-ocrdataset

NDL古典籍OCR学習用データセット（みんなで翻刻加工データ）

Python 12 2 Updated Feb 7, 2024

inbarhub / DDPM_inversion

Official pytorch implementation of the paper: "An Edit Friendly DDPM Noise Space: Inversion and Manipulations". CVPR 2024.

Python 280 13 Updated Jul 11, 2024

ndl-lab / ndlkotenocr_cli

NDL古典籍OCRのアプリケーション（ソースコードを含む）

Python 39 13 Updated Oct 31, 2024

ZC119 / Handwritten-CycleGAN

Generating handwritten Chinese characters using CycleGAN

Python 38 9 Updated Nov 13, 2019

Holmes2002 / Awesome-Table-Recognition

Python 4 Updated Nov 5, 2024

Holmes2002 / STN-Retina-Face

Python 2 Updated Nov 5, 2024

Stability-AI / stablediffusion

High-Resolution Image Synthesis with Latent Diffusion Models

Python 39,167 5,050 Updated Oct 10, 2024

open-mmlab / PowerPaint

[ECCV 2024] PowerPaint, a versatile image inpainting model that supports text-guided object inpainting, object removal, image outpainting and shape-guided object inpainting with only a single model…

Python 658 41 Updated Sep 8, 2024

ankush-me / SynthText

Code for generating synthetic text images as described in "Synthetic Data for Text Localisation in Natural Images", Ankush Gupta, Andrea Vedaldi, Andrew Zisserman, CVPR 2016.

Python 2,027 622 Updated Aug 9, 2023

facebookresearch / IMGUR5K-Handwriting-Dataset

IMGUR5K handwriting set. It is a handwritten in-the-wild dataset, which contains challenging real world handwritten samples from different writers.The dataset is shared as a set of image urls with …

Python 285 55 Updated Mar 12, 2024

ramajoballester / UC3M-LP

UC3M License Plate detection and recognition dataset

Python 8 Updated Nov 16, 2024

hymie122 / RAG-Survey

Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-Augmented Generation for AI-Generated Content: A Survey".

1,254 86 Updated Aug 20, 2024

QwenLM / Qwen2.5

Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 9,646 594 Updated Nov 11, 2024

xuexingyu24 / License_Plate_Detection_Pytorch

A two stage lightweight and high performance license plate recognition in MTCNN and LPRNet

Jupyter Notebook 656 171 Updated Jan 22, 2024

OpenBMB / MiniCPM-V

MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone

Python 12,595 886 Updated Oct 22, 2024

sergiomsilva / alpr-unconstrained

License Plate Detection and Recognition in Unconstrained Scenarios

C 1,722 607 Updated Jul 1, 2022

roatienza / deep-text-recognition-benchmark

PyTorch code of my ICDAR 2021 paper Vision Transformer for Fast and Efficient Scene Text Recognition (ViTSTR)

Jupyter Notebook 292 59 Updated Apr 9, 2024

nomfoundation / font

Source for NomNaTong-regular Vietnamese chữ Nôm font.

Python 71 6 Updated Nov 12, 2024

docongminh / VietNamese-OCR-DataGenerator

Forked from Belval/TextRecognitionDataGenerator

A synthetic data generator for text recognition

Python 8 Updated May 22, 2023

NVIDIA / tao_pytorch_backend

TAO Toolkit deep learning networks with PyTorch backend

Python 87 18 Updated Nov 7, 2024

trinhtuanvubk / OCR-Vietnamese-Text-Generator

Forked from Belval/TextRecognitionDataGenerator

A synthetic data generator for text recognition

Python 3 Updated Oct 10, 2023

ds4v / NomNaOCR

Leverage Deep Learning to digitize old Vietnamese handwritten for historical document archiving (Made with national pride in every single line of code): https://www.kaggle.com/datasets/quandang/nom…

Jupyter Notebook 116 22 Updated Jun 11, 2024

clovaai / cord

CORD: A Consolidated Receipt Dataset for Post-OCR Parsing

398 36 Updated Jul 20, 2022

mindee / doctr

docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.

Python 3,859 443 Updated Nov 13, 2024

HCIILAB / Scene-Text-Detection

541 129 Updated Sep 7, 2023

RapidAI / TableStructureRec

整理目前开源的最优表格识别模型，完善前后处理，模型转换为ONNX Organize the currently open-source optimal table recognition models, improve pre-processing and post-processing, and convert the models to ONNX.

Python 295 34 Updated Nov 16, 2024

opendatalab / MinerU

A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具，支持PDF/网页/多格式电子书提取。

Python 16,511 1,194 Updated Nov 15, 2024

mindspore-lab / mindocr

A toolbox of ocr models and algorithms based on MindSpore

Python 219 56 Updated Nov 15, 2024

Starred topics

optical-mark-recognition

handwriting-generation

handwriting-synthesis