-
National University of Singapore
- Singapore
- https://ldkong.com
- in/ldkong
- @ldkong1205
Highlights
Block or Report
Block or report ldkong1205
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseLists (4)
Sort Name ascending (A-Z)
Stars
Language
Sort by: Recently starred
Multi-Space Alignments Towards Universal LiDAR Segmentation
[ECCV 2024] Empowering 3D Visual Grounding with Reasoning Capabilities
A Unified Pixel-level Vision LLM for Understanding, Generating, Segmenting, Editing
Code&Data for Grounded 3D-LLM with Referent Tokens
🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.
Project Page for "LISA: Reasoning Segmentation via Large Language Model"
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skills
[CVPR2024] Multiagent Multitraversal Multimodal Self-Driving: Open MARS Dataset
VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)
[ECCV 2024] 4D Contrastive Superflows are Dense 3D Representation Learners
Layout-Guided multi-view driving scene video generation with latent diffusion model
Official repository for paper MG-LLaVA: Towards Multi-Granularity Visual Instruction Tuning(https://arxiv.org/abs/2406.17770).
[ECCV 2022] SimpleRecon: 3D Reconstruction Without 3D Convolutions
[ECCV 2024] 3D World Model for Autonomous Driving
Official implementation of Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image
[IROS23] InsMOS: Instance-Aware Moving Object Segmentation in LiDAR Data
Reason3D: Searching and Reasoning 3D Segmentation via Large Language Model
A High-fidelity 128-channel LiDAR Dataset with Panoramic Ambient and Reflectivity Imagery for Multi-modal Autonomous Driving Applications
GLENet: Boosting 3D Object Detectors with Generative Label Uncertainty Estimation [IJCV2023]
Fine-grained Image-to-LiDAR Contrastive Distillation with Visual Foundation Models
[CVPR 2024 Highlight] XCube: Large-Scale 3D Generative Modeling using Sparse Voxel Hierarchies
Is Your HD Map Constructor Reliable under Sensor Corruptions?
Bridging lidar and text through image intermediaries