Skip to content
View matrixgame2018's full-sized avatar
Block or Report

Block or report matrixgame2018

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results
Shell 31 2 Updated Jun 10, 2024

Learn fundamental knowledge in robotics

TeX 161 7 Updated Jun 16, 2024

[CVPR 2024] Physical Property Understanding from Language-Embedded Feature Fields

Python 40 2 Updated Apr 8, 2024

Repository of TRUMANS

Python 39 1 Updated Jun 28, 2024

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

Python 3,173 255 Updated Jul 2, 2024

A growing curation of Text-to-3D, Diffusion-to-3D works.

TeX 429 21 Updated Jun 30, 2024

Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information

Python 8,523 1,286 Updated Jun 28, 2024

MiniSora: A community aims to explore the implementation path and future development direction of Sora.

Python 1,096 144 Updated Jun 1, 2024

[CVPR 2023] Prompt, Generate, then Cache: Cascade of Foundation Models makes Strong Few-shot Learners

Python 332 18 Updated Jun 1, 2023

OMG-LLaVA and OMG-Seg codebase

Python 890 43 Updated Jun 28, 2024

Official code implementation of Vary-toy (Small Language Model Meets with Reinforced Vision Vocabulary)

Python 565 41 Updated Jul 2, 2024

[ECCV 2024] DriveDreamer: Towards Real-world-driven World Models for Autonomous Driving

253 6 Updated Jan 18, 2024

A lightweight framework for building LLM-based agents

Python 984 98 Updated Jun 13, 2024
Python 225 37 Updated Jul 2, 2024

Awesome-LLM-3D: a curated list of Multi-modal Large Language Model in 3D world Resources

827 54 Updated Jun 25, 2024
Python 43 3 Updated Nov 7, 2023

[Incl. GenAD, CVPR 2024 Highlight] Embracing Foundation Models into Autonomous Agent and System

Python 458 16 Updated May 28, 2024

AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.

Python 4,588 488 Updated May 26, 2024

【ICLR 2024🔥】 Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment

Python 610 48 Updated Mar 25, 2024

My homepage.

HTML 2 1 Updated Jun 19, 2024

An open source framework for research in Embodied-AI from AI2.

Python 298 46 Updated Jul 1, 2024

[CVPR 2024] A world model for autonomous driving.

Python 237 2 Updated Dec 7, 2023

This repo contains the code for our paper Towards Open-Ended Visual Recognition with Large Language Model

Jupyter Notebook 88 5 Updated Dec 4, 2023

About official Pytorch implementation of "Lifelong-MonoDepth: Lifelong Learning for Multi-Domain Monocular Metric Depth Estimation

Python 10 Updated Dec 8, 2023

DistillBEV: Boosting Multi-Camera 3D Object Detection with Cross-Modal Knowledge Distillation (ICCV 2023)

Python 77 5 Updated Nov 24, 2023

GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest

Python 475 24 Updated Jun 11, 2024

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Python 5,565 585 Updated Jun 28, 2024

Code for Human cognition-inspired active room segmentation

Python 7 Updated Oct 28, 2023

NeurIPS 2023 - Challenge / NeurIPS 2024 Dataset Track

Python 1 Updated Dec 4, 2023
Next