Stars
[ICCV 2023] Tracking Anything with Decoupled Video Segmentation
Propagate Yourself: Exploring Pixel-Level Consistency for Unsupervised Visual Representation Learning, CVPR 2021
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
[CVPR 2023] Revisiting Weak-to-Strong Consistency in Semi-Supervised Semantic Segmentation
[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.
A personal investigative project to track the latest progress in the field of multi-modal object tracking.
The official GitHub page for the survey paper "A Survey of Large Language Models".
(TPAMI 2024) A Survey on Open Vocabulary Learning
A collection of deep learning based RGB-T-Fusion methods, codes, and datasets. The main directions involved are Multispectral Pedestrian Detection, RGB-T Aerial Object Detection, RGB-T Semantic Seg…
VFM-Det: Towards High-Performance Vehicle Detection via Large Foundation Models
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
Automatically Update Arxiv Papers Daily using Github Actions (Update Every 8th hours)
TrackGPT: Track What You Need in Videos via Text Prompts
[ECCV 2024] The official code of paper "Open-Vocabulary SAM".
MLNLP社区用来帮助大家避免论文投稿小错误的整理仓库。 Paper Writing Tips
Visual Object Tracking
Open-Sora: Democratizing Efficient Video Production for All
An Open-source Toolkit for LLM Development
A simple and efficient Mamba implementation in pure PyTorch and MLX.
OVTrack: Open-Vocabulary Multiple Object Tracking [CVPR 2023]
A curated collection of papers, tutorials, videos, and other valuable resources related to Mamba.