Stars
RoboTwin: Dual-Arm Robot Benchmark with Generative Digital Twins
Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML models. It supports text generation, image generation, vision-language models (VLM), auto-speech-recognition (ASR), and text-to-spee…
Code for Paper "UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image Animation".
Accelerate your Stable Diffusion inference with the library's universal C/C++ framework design, powered by ONNXRuntime & across platforms.
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
Next-Generation Interactive Intelligent Programming Assistant
Framework of fast implementation data processing and operating pipelines
Explainable Person Re-Identification with Attribute-guided Metric Distillation
Cocos simplifies game creation and distribution with Cocos Creator, a free, open-source, cross-platform game engine. Empowering millions of developers to create high-performance, engaging 2D/3D gam…
Bayesian optimisation & Reinforcement Learning library developped by Huawei Noah's Ark Lab
[CVPR 2024] Official code for "Text-Driven Image Editing via Learnable Regions"
[NeurIPS 2024 Datasets and Benchmarks Track] Closed-Loop E2E-AD Benchmark Enhanced by World Model RL Expert
PyTorch Implementation of AudioLCM (ACM-MM'24): a efficient and high-quality text-to-audio generation with latent consistency model.
cube studio开源云原生一站式机器学习/深度学习/大模型AI平台,支持sso登录,多租户,大数据平台对接,notebook在线开发,拖拉拽任务流pipeline编排,多机多卡分布式训练,超参搜索,推理服务VGPU,边缘计算,serverless,标注平台,自动化标注,数据集管理,大模型微调,vllm大模型推理,llmops,私有知识库,AI模型应用商店,支持模型一键开发/推理/微调,…
A multi-modal vector database that supports upserts and vector queries using unified SQL (MySQL-Compatible) on structured and unstructured data, while meeting the requirements of high concurrency a…
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
Official code repository of CBGBench: Fill in the Blank of Protein-Molecule Complex Binding Graph
[NeurIPS 2024] An official implementation of ShareGPT4Video: Improving Video Understanding and Generation with Better Captions
Real-time and accurate open-vocabulary end-to-end object detection
CSGHub is an open-source large model platform just like on-premise version of Hugging Face. You can easily manage models and datasets, deploy model applications and setup model finetune or inferenc…
Listen to Mechanical Keyboard Sounds with Every Keystroke - It's Fast
Official repo for WavCraft, an AI agent for audio creation and editing