-
UC San Diego
- La Jolla, San Diego
- RchalYang.github.io
Highlights
- Pro
Block or Report
Block or report RchalYang
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
This repository compiles a list of papers related to the application of video technology in the field of robotics! Star⭐ the repo and follow me if you like what you see🤩.
The repository provides code associated with the paper VLFM: Vision-Language Frontier Maps for Zero-Shot Semantic Navigation (ICRA 2024)
HOT3D: A dataset for egocentric 3D hand and object tracking
A flexible and efficient codebase for training visually-conditioned language models (VLMs)
Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation
CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision Making
openvla / openvla
Forked from TRI-ML/prismatic-vlmsOpenVLA: An open-source vision-language-action model for robotic manipulation.
A list of Human-Object Interaction Learning.
Official repository of "TACO: Benchmarking Generalizable Bimanual Tool-ACtion-Object Understanding".
Official Implementation of the ICLR 2023 spotlight paper: Universal Humanoid Motion Representations for Physics-Based Control
Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supportin…
HumanPlus: Humanoid Shadowing and Imitation from Humans
This repo contains the code of the paper "Learned Inertial Odometry for Autonomous Drone Racing", RA-L 2023.
Code release and project site for "CCIL: Continuity-based Data Augmentation for Corrective Imitation Learning"
Video+code lecture on building nanoGPT from scratch
This code corresponds to simulation environments used as part of the MimicGen project.
Parameterizing Everyday Home Activities Towards 3D Generative Modeling of Human-Object Interactions
RoboCasa: Large-Scale Simulation of Everyday Tasks for Generalist Robots
Accessible large language models via k-bit quantization for PyTorch.
Load and visualize io-data with python scripts.
[CVPR 2024 Highlight] FoundationPose: Unified 6D Pose Estimation and Tracking of Novel Objects
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4V. 接近GPT-4V表现的可商用开源多模态对话模型
Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI
[CVPR 2023] Official Implementation of X-Decoder for generalized decoding for pixel, image and language
🔥🔥 LLaVA++: Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)
Open-TeleVision: Teleoperation with Immersive Active Visual Feedback