-
Southeast -> Tsinghua
- Shenzhen
-
00:43
(UTC +08:00) - guanxinglu.github.io
Highlights
- Pro
Lists (1)
Sort Name ascending (A-Z)
Stars
Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
An open-source impl. of Large Reconstruction Models
Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2
Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"
A suite of image and video neural tokenizers
A data generator based on sap2000 api
Implementation of π₀, the robotic foundation model architecture proposed by Physical Intelligence
PhyRecon: Physically Plausible Neural Scene Reconstruction
A flexible and efficient codebase for training visually-conditioned language models (VLMs)
This repository compiles a list of papers related to the application of video technology in the field of robotics! Star⭐ the repo and follow me if you like what you see🤩.
Digit 360 is a modular platform that unlocks new capabilities, and enables future research on the nature of touch.
A Paper List for Humanoid Robot Learning.
A simple llm-Agent for learning and get to know how agent works
Evaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo) in simulation under common setups (e.g., Google Robot, WidowX+Bridge) (CoRL 2024)
An example RLDS dataset builder for X-embodiment dataset conversion.
An example RLDS dataset builder for X-embodiment dataset conversion.
Universal Monocular Metric Depth Estimation
HaMeR: Reconstructing Hands in 3D with Transformers
Heterogeneous Pre-trained Transformer (HPT) as Scalable Policy Learner.
Code for "Dynamic 3D Gaussian Tracking for Graph-Based Neural Dynamics Modeling" (CoRL 2024)
The hardware design for AgiBot X1.
This is the official repository for "EgoLifter Open-world 3D Segmentation for Egocentric Perception, ECCV 2024"
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention)
Implementation of Deepmind's RoboCat: "Self-Improving Foundation Agent for Robotic Manipulation" An next generation robot LLM
Code for "SlotLifter: Slot-guided Feature Lifting for Learning Object-centric Radiance Fields" (ECCV 2024)
Code for CoRL 2022 paper: https://arxiv.org/abs/2211.09006 (ToolFlowNet, for simulation envs)