Stars
Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities。
The first open autoregressive foundational video AI model.
The official repository for paper "Tora: Trajectory-oriented Diffusion Transformer for Video Generation"
⭐ Dynamically generate stats SVG from your Github, LeetCode, Steam, and more in #Cyberpunk style :)
数字底座是一款面向大型政府、企业数字化转型,基于身份认证、组织架构、岗位职务、应用系统、资源角色等功能构建的统一且安全的管理支撑平台。数字底座基于三员管理模式,具备微服务、多租户、容器化和国产化,支持用户利用代码生成器快速构建自己的业务应用,同时可关联诸多成熟且好用的内部生态应用
A Multimodal Native Agent Framework for Smart Hardware and More
Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation
TM1638 full function driver library for general MCU and Linux.
Unleashing the Power of Distributed Content Management and Transformation
Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML models. It supports text generation, image generation, vision-language models (VLM), auto-speech-recognition (ASR), and text-to-spee…
Bayesian optimisation & Reinforcement Learning library developed by Huawei Noah's Ark Lab
Next-Generation Interactive Intelligent Programming Assistant
SSD1306 full function driver library for general MCU and Linux.
Cocos simplifies game creation and distribution with Cocos Creator, a free, open-source, cross-platform game engine. Empowering millions of developers to create high-performance, engaging 2D/3D gam…
MCP3421 full function driver library for general MCU and Linux.
[NeurIPS 2024 Datasets and Benchmarks Track] Closed-Loop E2E-AD Benchmark Enhanced by World Model RL Expert
A multi-modal vector database that supports upserts and vector queries using unified SQL (MySQL-Compatible) on structured and unstructured data, while meeting the requirements of high concurrency a…
PyTorch Implementation of AudioLCM (ACM-MM'24): a efficient and high-quality text-to-audio generation with latent consistency model.
CorrNet3D: Unsupervised End-to-end Learning of Dense Correspondence for 3D Point Clouds
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
[NeurIPS 2024] An official implementation of ShareGPT4Video: Improving Video Understanding and Generation with Better Captions
CSGHub is an open-source large model platform just like on-premise version of Hugging Face. You can easily manage models and datasets, deploy model applications and setup model finetune or inferenc…
Real-time and accurate open-vocabulary end-to-end object detection