Stars
Bayesian optimisation & Reinforcement Learning library developped by Huawei Noah's Ark Lab
PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
The implementation and dataset of CVPR 2024 paper: Implicit Event Neural SLAM.
A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.
[CVPR'24 Highlight & Best Demo Award] Gaussian Splatting SLAM
🔮 ChatGPT Desktop Application (Mac, Windows and Linux)
Code for robust monocular depth estimation described in "Ranftl et. al., Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-shot Cross-dataset Transfer, TPAMI 2022"
Code for the paper "Low Latency Automotive Vision with Event Cameras", published in Nature
[ICCV 2019] Monocular depth estimation from a single image
Unsupervised single image depth prediction with CNNs
MonoNav: MAV Navigation via Monocular Depth Estimation and Reconstruction
Depth Estimation using Stereo images using deep learning based architecture for disparity measurement.The architectures used for disparity estimation are BgNet,CreStereo, Raft-Stereo, HitNet,GwcNet…
SOS IROS 2018 GOOGLE; StereoNet ECCV2018 GOOGLE; ActiveStereoNet ECCV2018 Oral GOOGLE; HITNET CVPR2021 GOOGLE;PLUME Uber ATG
PyTorch code and models for the DINOv2 self-supervised learning method.
Official implementation of AISY 2022 paper MC-EMVS (Multi-Camera Event-based Multi-View Stereo)
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
Pyramid Stereo Matching Network (CVPR2018)
High Quality Monocular Depth Estimation via Transfer Learning
Awesome-LLM-3D: a curated list of Multi-modal Large Language Model in 3D world Resources
Octo is a transformer-based robot policy trained on a diverse mix of 800k robot trajectories.