Stars
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation
Awesome papers about Multi-Camera 3D Object Detection and Segmentation in Bird's-Eye-View, such as DETR3D, BEVDet, BEVFormer, BEVDepth, UniAD
Pure Python from-scratch zero-dependency implementation of Bitcoin for educational purposes
A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API
The simplest, fastest repository for training/finetuning medium-sized GPTs.
3D Bounding Box Annotation Tool (3D-BAT) Point cloud and Image Labeling
3D Point Cloud Annotation Platform for Autonomous Driving
Neural Network Compression Framework for enhanced OpenVINO™ inference
Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information
Detects and interactively deactivates duplicate Apt source entries and deletes sources list files without valid enabled source entries (as requested in https://askubuntu.com/a/762815/175814).
[ICCV 2023] Code for NeRF-Det: Learning Geometry-Aware Volumetric Representation for Multi-View 3D Object Detection
[TPAMI'23] Unifying Flow, Stereo and Depth Estimation
[ICCV 2019] Monocular depth estimation from a single image
VFDepth Self-supervised surround-view depth estimation with volumetric feature fusion
Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)
EA-LSS: Edge-aware Lift-splat-shot Framework for 3D BEV Object Detection
[ICCV 2023] SparseBEV: High-Performance Sparse 3D Object Detection from Multi-Camera Videos
OpenPCDet Toolbox for LiDAR-based 3D Object Detection.
[ICCV 2023] StreamPETR: Exploring Object-Centric Temporal Modeling for Efficient Multi-View 3D Object Detection
Open-source simulator for autonomous driving research.
Visualize Camera's Pose Using Extrinsic Parameter by Plotting Pyramid Model on 3D Space
[WACV2022] ImVoxelNet: Image to Voxels Projection for Monocular and Multi-View General-Purpose 3D Object Detection
[ICCV 2023] Temporal Enhanced Training of Multi-view 3D Object Detector via Historical Object Prediction
Cross-view Transformers for real-time Map-view Semantic Segmentation (CVPR 2022 Oral)
Official code base of the BEVDet series .