Stars
The official implementation of 'FakeShield: Explainable Image Forgery Detection and Localization via Multi-modal Large Language Models'
Official Implementation of paper "MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion"
Lagrangian formulation of Doob's h-transform allowing for efficient rare event sampling
A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。
Mamba in Vision: A Comprehensive Survey of Techniques and Applications
[CVPR2023] A Hierarchical Representation Network for Accurate and Detailed Face Reconstruction from In-The-Wild Images.
[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale
【IJCAI 2023】RaSa: Relation and Sensitivity Aware Representation Learning for Text-based Person Search
Fast Diffusion-Based Counterfactuals for Shortcut Removal and Generation (ECCV 2024 ORAL)
The official code of "PLIP: Language-Image Pre-training for Person Representation Learning"
(TPAMI2024) Official implementation of Paper ''A Versatile Framework for Multi-scene Person Re-identification''
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
3D Slicer extension for Segment Anything Model (SAM) developed by Meta
[CVPR2024] UFineBench: Towards Text-based Person Retrieval with Ultra-fine Granularity
Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation
🪐 Objaverse-XL is a Universe of 10M+ 3D Objects. Contains API Scripts for Downloading and Processing!
[ECCV 2024 Oral] LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation.
[MICCAI'23] Foundation Model for Endoscopy Video Analysis via Large-scale Self-supervised Pre-train
Super Resolution models for Gastric Endoscopic Images (Medical Imaging)
FD-Vision Mamba for Endoscopic Exposure Correction
[MICCAI 2024] EndoSparse: Real-Time Sparse View Synthesis of Endoscopic Scenes using Gaussian Splatting
[MICCAI'2024] EndoDAC: Efficient Adapting Foundation Model for Self-Supervised Depth Estimation from Any Endoscopic Camera
[IPCAI'2024 (IJCARS special issue)] Surgical-DINO: Adapter Learning of Foundation Models for Depth Estimation in Endoscopic Surgery
Deformable visual odometry for stereo endoscopic videos
MICCAI 2024: Endo-4DGS: Endoscopic Monocular Scene Reconstruction with 4D Gaussian Splatting