Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML models. It supports text generation, image generation, vision-language models (VLM), auto-speech-recognition (ASR), and text-to-spee…

Python 1,923 284 Updated Oct 10, 2024

ali-vilab / UniAnimate

Code for Paper "UniAnimate: Taming Unified Video Diﬀusion Models for Consistent Human Image Animation".

Python 988 52 Updated Jul 23, 2024

freechat-fun / freechat

https://freechat.fun

Java 605 121 Updated Oct 9, 2024

Windsander / ADI-Stable-Diffusion

Accelerate your Stable Diffusion inference with the library's universal C/C++ framework design, powered by ONNXRuntime & across platforms.

C++ 627 102 Updated Aug 16, 2024

gpt-omni / mini-omni

open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.

Python 2,784 254 Updated Sep 25, 2024

fufankeji / MateGen

Next-Generation Interactive Intelligent Programming Assistant

Python 1,033 170 Updated Sep 20, 2024

CubicZebra / informatics

Framework of fast implementation data processing and operating pipelines

Python 337 17 Updated Aug 7, 2024

SheldongChen / AMD.github.io

Explainable Person Re-Identification with Attribute-guided Metric Distillation

Python 117 22 Updated Jul 18, 2022

cocos / cocos-engine

Cocos simplifies game creation and distribution with Cocos Creator, a free, open-source, cross-platform game engine. Empowering millions of developers to create high-performance, engaging 2D/3D gam…

C++ 7,196 1,831 Updated Oct 10, 2024

RookieXwc / GPICTURE

Python 59 8 Updated Oct 9, 2024

huawei-noah / HEBO

Bayesian optimisation & Reinforcement Learning library developped by Huawei Noah's Ark Lab

Jupyter Notebook 3,255 587 Updated Oct 10, 2024

yuanze-lin / Learnable_Regions

[CVPR 2024] Official code for "Text-Driven Image Editing via Learnable Regions"

Python 263 21 Updated Sep 28, 2024

Thinklab-SJTU / Bench2Drive

[NeurIPS 2024 Datasets and Benchmarks Track] Closed-Loop E2E-AD Benchmark Enhanced by World Model RL Expert

Python 1,253 79 Updated Sep 28, 2024

Text-to-Audio / AudioLCM

PyTorch Implementation of AudioLCM (ACM-MM'24): a efficient and high-quality text-to-audio generation with latent consistency model.

Python 1,118 177 Updated Jul 17, 2024

data-infra / cube-studio

cube studio开源云原生一站式机器学习/深度学习/大模型AI平台，支持sso登录，多租户，大数据平台对接，notebook在线开发，拖拉拽任务流pipeline编排，多机多卡分布式训练，超参搜索，推理服务VGPU，边缘计算，serverless，标注平台，自动化标注，数据集管理，大模型微调，vllm大模型推理，llmops，私有知识库，AI模型应用商店，支持模型一键开发/推理/微调，…

Jupyter Notebook 2,101 75 Updated Oct 6, 2024

ServerlessOS / Aiturbo-vGPU

Forked from MaoZzzzz/Aiturbo-vGPU

Python 11 3 Updated Nov 9, 2023

dingodb / dingo

A multi-modal vector database that supports upserts and vector queries using unified SQL (MySQL-Compatible) on structured and unstructured data, while meeting the requirements of high concurrency a…

Java 1,531 259 Updated Oct 10, 2024

fudan-generative-vision / hallo

Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation

Python 9,282 1,279 Updated Sep 14, 2024

Langboat / Mengzi3

Python 2,032 31 Updated Oct 9, 2024

EDAPINENUT / CBGBench

Official code repository of CBGBench: Fill in the Blank of Protein-Molecule Complex Binding Graph

Python 190 28 Updated Oct 10, 2024

ShareGPT4Omni / ShareGPT4Video

[NeurIPS 2024] An official implementation of ShareGPT4Video: Improving Video Understanding and Generation with Better Captions

Python 1,242 44 Updated Oct 9, 2024

om-ai-lab / OmDet

Real-time and accurate open-vocabulary end-to-end object detection

Python 1,509 142 Updated Sep 6, 2024

OpenCSGs / csghub

CSGHub is an open-source large model platform just like on-premise version of Hugging Face. You can easily manage models and datasets, deploy model applications and setup model finetune or inferenc…

Vue 2,894 450 Updated Oct 10, 2024

bytewiz3 / TravelGPT

Python 1,016 179 Updated Oct 9, 2024

ZacharyL2 / KeyEcho

Listen to Mechanical Keyboard Sounds with Every Keystroke - It's Fast

Rust 640 13 Updated Sep 13, 2024

JinhuaLiang / WavCraft

Official repo for WavCraft, an AI agent for audio creation and editing

Python 649 96 Updated Sep 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FSDSS-101 mono-max

Block or report mono-max

Stars

juggleim / im-server

guanchuwang / redis-bench

TianxingChen / RoboTwin

520CCC / AIGenerateCode

NexaAI / nexa-sdk