Skip to content
View ldkong1205's full-sized avatar
🌳
🌳

Organizations

@Pointcept
Block or Report

Block or report ldkong1205

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

Multi-Space Alignments Towards Universal LiDAR Segmentation

Jupyter Notebook 23 2 Updated Jul 2, 2024

[ECCV 2024] Empowering 3D Visual Grounding with Reasoning Capabilities

14 Updated Jul 2, 2024

A Unified Pixel-level Vision LLM for Understanding, Generating, Segmenting, Editing

Python 239 15 Updated Jun 26, 2024

Code&Data for Grounded 3D-LLM with Referent Tokens

Python 56 Updated Jul 1, 2024

Inference code for Llama models

Python 54,166 9,322 Updated May 15, 2024

🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.

Python 3,509 240 Updated Mar 5, 2024

Project Page for "LISA: Reasoning Segmentation via Large Language Model"

Python 1,625 109 Updated Jul 2, 2024

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 35,621 4,379 Updated Jul 7, 2024
Python 296 11 Updated Jan 22, 2024

LLaVA-Interactive-Demo

Python 332 25 Updated Jun 10, 2024

LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skills

Python 661 51 Updated Feb 1, 2024

A Unified Framework for 3D Scene Understanding

34 1 Updated Jul 5, 2024

Survey and Benchmark of VIALM

7 1 Updated Jan 26, 2024

[CVPR2024] Multiagent Multitraversal Multimodal Self-Driving: Open MARS Dataset

Python 23 Updated Jun 25, 2024

VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)

Python 905 61 Updated Jul 7, 2024

[ECCV 2024] 4D Contrastive Superflows are Dense 3D Representation Learners

5 Updated Jul 2, 2024

Layout-Guided multi-view driving scene video generation with latent diffusion model

Python 514 11 Updated Dec 15, 2023

Official repository for paper MG-LLaVA: Towards Multi-Granularity Visual Instruction Tuning(https://arxiv.org/abs/2406.17770).

Python 81 2 Updated Jun 27, 2024

[ECCV 2022] SimpleRecon: 3D Reconstruction Without 3D Convolutions

Python 1,270 118 Updated May 25, 2023

[ECCV 2024] 3D World Model for Autonomous Driving

Python 296 17 Updated Apr 12, 2024

Official implementation of Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image

Python 2,196 146 Updated Jul 3, 2024

[IROS23] InsMOS: Instance-Aware Moving Object Segmentation in LiDAR Data

Python 108 5 Updated Jun 5, 2024

Reason3D: Searching and Reasoning 3D Segmentation via Large Language Model

38 1 Updated May 28, 2024

A High-fidelity 128-channel LiDAR Dataset with Panoramic Ambient and Reflectivity Imagery for Multi-modal Autonomous Driving Applications

28 1 Updated Jun 30, 2024

GLENet: Boosting 3D Object Detectors with Generative Label Uncertainty Estimation [IJCV2023]

Python 175 8 Updated Jun 4, 2024

Fine-grained Image-to-LiDAR Contrastive Distillation with Visual Foundation Models

Python 13 1 Updated Jul 6, 2024

BRAVO Challenge Toolkit and Evaluation Code

Python 7 Updated Jul 1, 2024

[CVPR 2024 Highlight] XCube: Large-Scale 3D Generative Modeling using Sparse Voxel Hierarchies

Python 147 7 Updated Jun 26, 2024

Is Your HD Map Constructor Reliable under Sensor Corruptions?

Python 29 1 Updated Jun 30, 2024

Bridging lidar and text through image intermediaries

Jupyter Notebook 69 8 Updated Jan 31, 2024
Next