Skip to content
View Topdu's full-sized avatar

Block or report Topdu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

End-to-End Object Detection with Transformers

Python 13,432 2,425 Updated Mar 12, 2024

A Comprehensive Toolkit for High-Quality PDF Content Extraction

Python 5,003 336 Updated Oct 7, 2024

A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。

Python 12,604 939 Updated Oct 8, 2024

[ICCV 2023] Code base for Revisiting Scene Text Recognition: A Data Perspective

Python 165 8 Updated Nov 1, 2023

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Python 4,999 407 Updated Oct 2, 2024

A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.

C++ 1,377 166 Updated Sep 30, 2024

Offical implementation of "Integer-Valued Training and Spike-Driven Inference Spiking Neural Network for High-performance and Energy-efficient Object Detection" (ECCV2024 Best Paper Candidate / Oral)

Python 78 3 Updated Oct 7, 2024

A research project for text detection and recognition using PyTorch 1.2.

Python 349 67 Updated Dec 24, 2019

Official PyTorch Implementation of MambaVision: A Hybrid Mamba-Transformer Vision Backbone

Python 737 41 Updated Oct 5, 2024

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 5,668 440 Updated Sep 19, 2024

Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>

Python 4,248 281 Updated Jun 21, 2024

[NeurIPS 2024] The official code of "U-DiTs: Downsample Tokens in U-Shaped Diffusion Transformers"

Python 88 5 Updated Sep 30, 2024
Python 170 18 Updated Sep 28, 2024

Masked Diffusion Transformer is the SOTA for image synthesis. (ICCV 2023)

Python 514 38 Updated Apr 23, 2024

The official code of CornerTransformer (ECCV 2022, Oral) on top of MMOCR.

Python 137 15 Updated Mar 6, 2023

Deeper Depth Prediction with Fully Convolutional Residual Networks (FCRN)

Python 1,112 313 Updated Aug 26, 2019

Code for the paper STN-OCR: A single Neural Network for Text Detection and Text Recognition

Python 498 139 Updated Jan 2, 2018

Code for the AAAI 2018 publication "SEE: Towards Semi-Supervised End-to-End Scene Text Recognition"

Python 575 147 Updated Apr 26, 2019

A minimal GPU design in Verilog to learn how GPUs work from the ground up

SystemVerilog 6,979 522 Updated Aug 18, 2024

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 19,644 2,507 Updated Oct 7, 2024

LLM inference in C/C++

C++ 65,942 9,475 Updated Oct 7, 2024

Implementation of popular deep learning networks with TensorRT network definition API

C++ 6,921 1,765 Updated Sep 23, 2024

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 19,623 2,160 Updated Aug 12, 2024

Scene Text Recognition with Permuted Autoregressive Sequence Models (ECCV 2022)

Python 565 125 Updated May 29, 2024

Machine learning, in numpy

Python 15,308 3,713 Updated Oct 29, 2023

基于Pytorch的OCR工具库,支持常用的文字检测和识别算法

Python 1,366 304 Updated Sep 2, 2024

Combining MMOCR with Segment Anything & Stable Diffusion. Automatically detect, recognize and segment text instances, with serval downstream tasks, e.g., Text Removal and Text Inpainting

Python 519 37 Updated Jan 30, 2024

Painter & SegGPT Series: Vision Foundation Models from BAAI

Python 2,503 167 Updated Oct 31, 2023

A quickstart and benchmark for pytorch distributed training.

Python 1,623 296 Updated Jul 25, 2024

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 14,889 1,379 Updated Sep 5, 2024
Next