Skip to content
View wangwisdom's full-sized avatar

Block or report wangwisdom

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
C++ 9 4 Updated Apr 4, 2024

VSLAM开源基础教程,各章节练习代码

C++ 156 18 Updated Nov 26, 2023

Open-source evaluation toolkit of large vision-language models (LVLMs), support 160+ VLMs, 50+ benchmarks

Python 1,337 188 Updated Nov 15, 2024

[CoRL 2024] Open-TeleVision: Teleoperation with Immersive Active Visual Feedback

Python 643 64 Updated Sep 27, 2024

Imitation learning algorithms with Co-training for Mobile ALOHA: ACT, Diffusion Policy, VINN

Python 3,050 564 Updated May 15, 2024

[RSS 2024]: Expressive Whole-Body Control for Humanoid Robots

Python 189 17 Updated Jul 19, 2024

A fast and flexible implementation of Rigid Body Dynamics algorithms and their analytical derivatives

C++ 1,916 395 Updated Nov 15, 2024

VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)

Python 1,992 158 Updated Oct 31, 2024

Open Platform for Embodied Agents

Python 268 15 Updated Oct 13, 2024

High-Resolution Image Synthesis with Latent Diffusion Models

Jupyter Notebook 11,866 1,535 Updated Feb 29, 2024

A latent text-to-image diffusion model

Jupyter Notebook 68,369 10,167 Updated Jun 18, 2024

text to image to generation: CogView3-Plus and CogView3(ECCV 2024)

Python 244 13 Updated Oct 15, 2024

official code repo for paper "CogView2: Faster and Better Text-to-Image Generation via Hierarchical Transformers"

Python 950 78 Updated Aug 3, 2022
22 Updated May 23, 2023

A high-performance runtime framework for modern robotics.

C++ 771 103 Updated Nov 15, 2024

OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。

Python 27,268 2,736 Updated Oct 18, 2024

✨✨ MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans?

Python 78 5 Updated Nov 14, 2024

Official release of InternLM2.5 base and chat models. 1M context support

Python 6,465 455 Updated Oct 10, 2024

BlueLM(蓝心大模型): Open large language models developed by vivo AI Lab

Python 852 59 Updated Apr 22, 2024

LLaVA-HR: High-Resolution Large Language-Vision Assistant

Python 213 11 Updated Aug 14, 2024

Mobile ALOHA: Learning Bimanual Mobile Manipulation with Low-Cost Whole-Body Teleoperation

Jupyter Notebook 3,878 674 Updated Jun 22, 2024

The build files for the Dexhand

C++ 262 53 Updated Jul 26, 2024

这个文档是使用Habitat-sim的中文教程

Python 33 3 Updated Mar 10, 2023

Code for "Temporal Difference Learning for Model Predictive Control"

Python 358 55 Updated Nov 25, 2023

An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

Python 7,360 826 Updated Nov 14, 2024

A gym environment for PushT

Python 54 9 Updated Jul 5, 2024

Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 9,636 592 Updated Nov 11, 2024

pix2tex: Using a ViT to convert images of equations into LaTeX code.

Python 12,680 1,031 Updated Jul 5, 2024

ReKep: Spatio-Temporal Reasoning of Relational Keypoint Constraints for Robotic Manipulation

Python 515 52 Updated Aug 30, 2024

A version 1.1 of the Alexander Koch low cost robot arm with some small changes.

434 43 Updated Sep 17, 2024
Next