Stars
The project is for openai clip learning.
Official PyTorch codes for "Open Vocabulary 3D Scene Understanding via Geometry Guided Self-Distillation", ECCV2024
Code of "OccNeRF: Advancing 3D Occupancy Prediction in LiDAR-Free Environments".
[ECCV'24] OpenIns3D: Snap and Lookup for 3D Open-vocabulary Instance Segmentation
[ECCV 2024] Scene as Gaussians for Vision-Based 3D Semantic Occupancy Prediction
[CVPR 2024] SAI3D: Segment Any Instance in 3D Scenes
Code for 3D-LLM: Injecting the 3D World into Large Language Models
officical code for ECCV 2024 paper "Global-Local Collaborative Inference with LLM for Lidar-Based Open-Vocabulary Detection"
Four Ways to Improve Verbo-visual Fusion for Dense 3D Visual Grounding, ECCV 2024
[AAAI 2024] Mono3DVG: 3D Visual Grounding in Monocular Images, AAAI, 2024
[ACM MM 2024] Hierarchical Multimodal Fine-grained Modulation for Visual Grounding.
This is a Chinese translation of the CUDA programming guide
Official repository for Find n' Propagate: Open-Vocabulary 3D Object Detection in Urban Environments
Official code of DMA: Dense Multimodal Alignment for Open-Vocabulary 3D Scene Understanding, ECCV 2024
LLMs interview notes and answers:该仓库主要记录大模型(LLMs)算法工程师相关的面试题和参考答案
Our OpenYOLO3D model achieves state-of-the-art performance in Open Vocabulary 3D Instance Segmentation on ScanNet200 and Replica datasets with up ∼16x speedup compared to the best existing method …
[CVPR 24] MaskClustering: View Consensus based Mask Graph Clustering for Open-Vocabulary 3D Instance Segmentation
AAAI2024 - Sunshine to Rainstorm: Cross-Weather Knowledge Distillation for Robust 3D Object Detection
CLIP-Mamba: CLIP Pretrained Mamba Models with OOD and Hessian Evaluation
OccSora: 4D Occupancy Generation Models as World Simulators for Autonomous Driving
[CVPR2024] Official Repository of Paper "Panacea: Panoramic and Controllable Video Generation for Autonomous Driving"