Block or Report
Block or report zhoubin-me
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
Real-Time 3D Semantic Reconstruction from 2D data
Evaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo) in simulation under common setups (e.g., Google Robot, WidowX+Bridge)
An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary algorithms utilized include the Segment Anything Model (SAM) fo…
Unofficial reverse-engineered ChatGPT API in Python
Unreal plugin for robot visualisation using ROS connecting with WebSockets.
A Unreal Engine 5 (UE5) based plugin aiming to provide real-time visulization, management, editing, and scalable hybrid rendering of Guassian Splatting model.
The docs of MoonBit programming language
VGGSfM: Visual Geometry Grounded Deep Structure From Motion
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
[CVPR 2024 Highlight] Official PyTorch implementation of SpatialTracker: Tracking Any 2D Pixels in 3D Space
Get up and running with Llama 3.1, Mistral, Gemma 2, and other large language models.
VINS-Fusion, VINS-Fisheye, OpenVINS, EnVIO, ROVIO, S-MSCKF, ORB-SLAM2, NVIDIA Elbrus application of different sets of cameras and imu on different board including desktop and Jetson boards
Action Chunking Transformer implementation for low cost robot
DORA (Dataflow-Oriented Robotic Application) is middleware designed to streamline and simplify the creation of AI-based robotic applications. It offers low latency, composable, and distributed data…
Visual-inertial-wheel fusion odometry, better performance in scenes with drastic changes in light
📚 The list of vision-based SLAM / Visual Odometry open source, blogs, and papers
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
A collection of high-quality models for the MuJoCo physics engine, curated by Google DeepMind.
A simple monocular visual odometry (part of vSLAM) by ORB keypoints with initialization, tracking, local map and bundle adjustment. (WARNING: Hi, I'm sorry that this project is tuned for course dem…
A Production-ready Reinforcement Learning AI Agent Library brought by the Applied Reinforcement Learning team at Meta.
[ECCV 2022] Map-free Visual Relocalization: Metric Pose Relative to a Single Image