-
Tsinghua University
Starred repositories
DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos
[ICCV2023] 🧊FrozenRecon: Pose-free 3D Scene Reconstruction with Frozen Depth Models
[MICCAI'2024] EndoDAC: Efficient Adapting Foundation Model for Self-Supervised Depth Estimation from Any Endoscopic Camera
Neural Light Simulator for Camera-Light Calibration (a part of DarkGS project, see link below)
[ECCV 2024] Leveraging Synthetic Data for Real-Domain High-Resolution Monocular Metric Depth Estimation
Project Page for Paper "Deep Learning-Based Object Pose Estimation: A Comprehensive Survey"
Official implementation for HybridDepth Model
Python code to fuse multiple RGB-D images into a TSDF voxel volume.
Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
Universal Monocular Metric Depth Estimation
[CVPR'24] MorpheuS: Neural Dynamic 360° Surface Reconstruction from Monocular RGB-D Video
ImageBind One Embedding Space to Bind Them All
This is a sample C# project that extracts Depth and Color information from videos shot in iPhone's Cinematic mode and outputs each as separate videos, along with a sample Unity project for 3D playb…
Official Pytorch Implementation for “DINO-Tracker: Taming DINO for Self-Supervised Point Tracking in a Single Video”
Official PyTorch implementation of the CVPR 2023 paper "Solving 3D Inverse Problems using Pre-trained 2D Diffusion Models (https://arxiv.org/abs/2211.10655)"
Official implementation of the paper "Unsupervised CT Metal Artifact Reduction by Plugging Diffusion Priors in Dual Domains"
A vision-language foundation model for computational pathology - Nature Medicine
Prov-GigaPath: A whole-slide foundation model for digital pathology from real-world data
Learning Better Video Query with SAM for Video Instance Segmentation (TCSVT 2024)
Colonoscopy 3D Video Dataset (C3VD) acquired with a high definition clinical colonoscope and high-fidelity colon models for benchmarking computer vision methods in colonoscopy.
[MICCAI 2024] TeethDreamer: 3D Teeth Reconstruction from Five Intra-oral Photographs
unified multi-threading inferencing nodes for monocular 3D object detection, depth prediction and semantic segmentation
The repo for "Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image" and "Metric3Dv2: A Versatile Monocular Geometric Foundation Model..."
Gaussian-SLAM: Photo-realistic Dense SLAM with Gaussian Splatting
code for "PGSR: Planar-based Gaussian Splatting for Efficient and High-Fidelity Surface Reconstruction"
[EARTH@MICCAI'2024] EndoGS: Deformable Endoscopic Tissues Reconstruction with Gaussian Splatting