Stars
Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)
Code and Model for VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset
Open-Sora: Democratizing Efficient Video Production for All
[ECCV 2024 Oral] LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation.
[ECCV 2024] HiFi-123: Towards High-fidelity One Image to 3D Content Generation
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
Generate dense depth map for monocular depth estimation task on KITTI dataset.
A General NeRF Acceleration Toolbox in PyTorch.
A unified framework for 3D content generation.
Mobile ALOHA: Learning Bimanual Mobile Manipulation with Low-Cost Whole-Body Teleoperation
Implementation for 3d gaussian splatting
code for GaussianShader: 3D Gaussian Splatting with Shading Functions for Reflective Surfaces
[arXiv 2023] DreamGaussian4D: Generative 4D Gaussian Splatting
[ICLR 2024] Official implementation of DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior
This repository implements the KDTree on CUDA with interface bindings both in C++ and Tensorflow
[ICCV2023] NeRF-LOAM: Neural Implicit Representation for Large-Scale Incremental LiDAR Odometry and Mapping
A collaboration friendly studio for NeRFs
Road Damage Detection Based on Unsupervised Disparity Map Segmentation (T-ITS)
Rethinking Road Surface 3D Reconstruction and Pothole Detection: From Perspective Transformation to Disparity Map Segmentation (T-CYB)
SDA-SNE: Spatial Discontinuity-Aware Surface Normal Estimation via Multi-Directional Dynamic Programming
Three-Filters-to-Normal: An Accurate and Ultrafast Surface Normal Estimator (RAL+ICRA'21)
Official PyTorch implementation for a conditional diffusion probability model in BEV perception
[ICCV 2023] OccFormer: Dual-path Transformer for Vision-based 3D Semantic Occupancy Prediction