Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and…

Python 44,158 7,814 Updated Nov 12, 2024

ThisisGame / cpp-game-engine-book

从零编写游戏引擎教程 Writing a game engine tutorial from scratch

C++ 3,000 375 Updated Apr 19, 2024

NVIDIA-Omniverse / PhysX

NVIDIA PhysX SDK

C++ 2,588 374 Updated Oct 15, 2024

IDEA-Research / Grounded-SAM-2

Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2

Jupyter Notebook 1,066 99 Updated Nov 3, 2024

labuladong / fucking-algorithm

刷算法全靠套路，认准 labuladong 就够了！English version supported! Crack LeetCode, not only how, but also why.

Markdown 125,819 23,221 Updated Sep 22, 2024

facebookresearch / sam2

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 12,262 1,124 Updated Oct 14, 2024

longzw1997 / Open-GroundingDino

This is the third party implementation of the paper Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection.

Python 424 67 Updated Jun 25, 2024

wkentaro / yolo-world-onnx

ONNX models of YOLO-World (an open-vocabulary object detection).

Python 14 2 Updated Jun 29, 2024

NVIDIA / warp

A Python framework for high performance GPU simulation and graphics

Python 4,239 242 Updated Nov 12, 2024

NVlabs / VILA

VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)

Python 1,984 158 Updated Oct 31, 2024

AILab-CVC / YOLO-World

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Python 4,644 450 Updated Nov 5, 2024

FoundationVision / GLEE

[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale

Python 1,077 84 Updated Oct 21, 2024

AILab-CVC / M2PT

[CVPR'24] Multimodal Pathway: Improve Transformers with Irrelevant Data from Other Modalities

Python 93 4 Updated Mar 13, 2024

vchoutas / smplx

SMPL-X

Python 1,859 310 Updated Aug 12, 2024

RockeyCoss / Prompt-Segment-Anything

This is an implementation of zero-shot instance segmentation using Segment Anything.

Python 299 15 Updated Apr 14, 2023

IDEA-Research / X-Pose

[ECCV 2024] Official implementation of the paper "X-Pose: Detecting Any Keypoints"

Python 502 20 Updated Aug 16, 2024

roboflow / inference

A fast, easy-to-use, production-ready inference server for computer vision supporting deployment of many popular model architectures and fine-tuned models.

Python 1,357 125 Updated Nov 12, 2024