-
AIRS
- shenzhen China
-
07:53
(UTC -12:00) - https://[email protected]
- @matrixMingzai
Block or Report
Block or report matrixgame2018
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseLists (3)
Sort Name ascending (A-Z)
Stars
Language
Sort by: Recently starred
Learn fundamental knowledge in robotics
[CVPR 2024] Physical Property Understanding from Language-Embedded Feature Fields
An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)
A growing curation of Text-to-3D, Diffusion-to-3D works.
Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information
MiniSora: A community aims to explore the implementation path and future development direction of Sora.
[CVPR 2023] Prompt, Generate, then Cache: Cascade of Foundation Models makes Strong Few-shot Learners
Official code implementation of Vary-toy (Small Language Model Meets with Reinforced Vision Vocabulary)
[ECCV 2024] DriveDreamer: Towards Real-world-driven World Models for Autonomous Driving
A lightweight framework for building LLM-based agents
Awesome-LLM-3D: a curated list of Multi-modal Large Language Model in 3D world Resources
[Incl. GenAD, CVPR 2024 Highlight] Embracing Foundation Models into Autonomous Agent and System
AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.
【ICLR 2024🔥】 Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment
My homepage.
An open source framework for research in Embodied-AI from AI2.
[CVPR 2024] A world model for autonomous driving.
This repo contains the code for our paper Towards Open-Ended Visual Recognition with Large Language Model
About official Pytorch implementation of "Lifelong-MonoDepth: Lifelong Learning for Multi-Domain Monocular Metric Depth Estimation
DistillBEV: Boosting Multi-Camera 3D Object Detection with Cross-Modal Knowledge Distillation (ICCV 2023)
GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
Code for Human cognition-inspired active room segmentation
NeurIPS 2023 - Challenge / NeurIPS 2024 Dataset Track