Stars
This is the official repository for "EgoLifter Open-world 3D Segmentation for Egocentric Perception, ECCV 2024"
HART: Efficient Visual Generation with Hybrid Autoregressive Transformer
Agent-to-Sim Learning Interactive Behavior from Casual Videos.
Official implementation of "ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis"
RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation
[arXiv 2024] Generalizable Humanoid Manipulation with Improved 3D Diffusion Policies. Part 1: Train & Deploy of iDP3
Official Pytorch Implementation of Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think
Splat-MOVER: Multi-Stage, Open-Vocabulary Robotic Manipulation via Editable Gaussian Splatting
Official repository for Splatt3R: Zero-shot Gaussian Splatting from Uncalibrated Image Pairs
DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.
SPEAR: A Simulator for Photorealistic Embodied AI Research
New API (RM_API2) developed for Realman Robot (https://www.realman-robotics.com/)
MichalZawalski / embodied-CoT
Forked from openvla/openvlaEmbodied Chain of Thought: A robotic policy that reason to solve the task.
[NeurIPS 2024 D&B] Point Cloud Matters: Rethinking the Impact of Different Observation Spaces on Robot Learning
[ECCV 2024] BlenderAlchemy: Editing 3D Graphics with Vision-Language Models
tiktoken is a fast BPE tokeniser for use with OpenAI's models.
Official Implementation of paper "MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion"
ROS Agents is a fully-loaded framework for creating interactive embodied agents that can understand, remember, and act upon contextual information from their environment.
[IROS 2024] Learning Human-to-Humanoid Real-Time Whole-Body Teleoperation. [CoRL 2024] OmniH2O: Universal and Dexterous Human-to-Humanoid Whole-Body Teleoperation and Learning
Official implementation of "MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling"
Code for "SlotLifter: Slot-guided Feature Lifting for Learning Object-centric Radiance Fields" (ECCV 2024)
autogenhub / autogen
Forked from microsoft/autogenA programming framework for agentic AI. Discord: https://discord.gg/pAbnFJrkgZ
Robot Utility Models are trained on a diverse set of environments and objects, and then can be deployed in novel environments with novel objects without any further data or training.
A curated list of resources for using LLMs to develop more competitive grant applications.
PR2 is a humanoid robot testbed designed for both entry-level students and professional users with supports in bipedal locomotion, multi-modal manipulation, and interaction with vision and language…