Skip to content
View HongzheBi's full-sized avatar
🇦🇮
hard-working
🇦🇮
hard-working
  • BUPT
  • Beijing

Highlights

  • Pro

Block or report HongzheBi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

Tracking Any Point (TAP)

Jupyter Notebook 1,242 118 Updated Aug 30, 2024

Text-to-video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 6,486 581 Updated Aug 30, 2024

Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.

Python 599 28 Updated Aug 31, 2024

A curated list of 3D Vision papers relating to Robotics domain in the era of large models i.e. LLMs/VLMs, inspired by awesome-computer-vision, including papers, codes, and related websites

440 21 Updated Aug 25, 2024

Robot bimanual manipulation / dual-arm manipulation

128 8 Updated Aug 2, 2024

[ICRA 2023] A Joint Modeling of Vision-Language-Action for Target-oriented Grasping in Clutter

Python 83 10 Updated May 19, 2024

This repository compiles a list of papers related to the application of video technology in the field of robotics! Star⭐ the repo and follow me if you like what you see🤩.

101 6 Updated Aug 12, 2024

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 10,193 773 Updated Aug 21, 2024

world modeling challenge for humanoid robots

Python 150 12 Updated Aug 23, 2024

Code for paper "Patch-Level Training for Large Language Models"

Python 56 3 Updated Jul 18, 2024

SAM with text prompt

Jupyter Notebook 1,507 167 Updated Aug 1, 2024

[RSS 2024] Code for "Multimodal Diffusion Transformer: Learning Versatile Behavior from Multimodal Goals" for CALVIN experiments with pre-trained weights

C++ 43 2 Updated Jul 10, 2024

[Embodied-AI-Survey-2024] Paper list and projects for Embodied AI

397 23 Updated Aug 27, 2024

Official implementation for paper "EquiBot: SIM(3)-Equivariant Diffusion Policy for Generalizable and Data Efficient Learning".

Python 95 13 Updated Jul 2, 2024

Latest Advances on Embodied Multimodal LLMs (or Vison-Language-Action Models).

48 2 Updated Jul 4, 2024

HaMeR: Reconstructing Hands in 3D with Transformers

Python 336 29 Updated Jul 12, 2024

CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision Making

Jupyter Notebook 274 22 Updated Aug 31, 2024

OpenVLA: An open-source vision-language-action model for robotic manipulation.

Python 908 114 Updated Aug 27, 2024

为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…

Python 63,547 7,874 Updated Aug 28, 2024

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 19,428 2,473 Updated Aug 28, 2024

🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning

Python 5,922 503 Updated Aug 30, 2024

Official repository of Learning to Act from Actionless Videos through Dense Correspondences.

Python 154 10 Updated Apr 25, 2024
Python 185 12 Updated Jul 17, 2024

PointMamba: A Simple State Space Model for Point Cloud Analysis

Python 326 22 Updated Jun 13, 2024

A curated collection of papers, tutorials, videos, and other valuable resources related to Mamba.

279 26 Updated Aug 29, 2024

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Python 3,153 277 Updated May 4, 2024

Latte: Latent Diffusion Transformer for Video Generation.

Python 1,611 168 Updated Aug 22, 2024

MiniSora: A community aims to explore the implementation path and future development direction of Sora.

Python 1,149 148 Updated Aug 14, 2024

minisora-DiT, a DiT reproduction based on XTuner from the open source community MiniSora

Python 35 2 Updated Mar 25, 2024

Fast Diffusion Models with Transformers

Python 663 88 Updated Oct 7, 2023
Next