Lists (1)
Sort Name ascending (A-Z)
Stars
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切割、翻译、对齐、甚至加上配音,一键全自动视频搬运AI字幕组
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
rockets-cn / ChatTTS
Forked from 2noise/ChatTTSChatTTS is a generative speech model for daily dialogue.
rockets-cn / MaxKB
Forked from 1Panel-dev/MaxKB🚀 基于 LLM 大语言模型的知识库问答系统。开箱即用,支持快速嵌入到第三方业务系统,1Panel 官方出品。
Fuse ChatTTS with OpenVoice, upload a 10-second audio clip, and clone your personalized ChatTTS voice.
A toolkit for controlling Euro Truck Simulator 2 with python to develop self-driving algorithms.
A Raspberry Pi operated Wireless Allsky Camera
Code and additional files for an open source cable camera robot.
Finetune Llama 3.2, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.
Multilingual Voice Understanding Model
Together Mixture-Of-Agents (MoA) – 65.1% on AlpacaEval with OSS models
Fast voice assistant powered by Groq, Cartesia, and Vercel.
openvla / openvla
Forked from TRI-ML/prismatic-vlmsOpenVLA: An open-source vision-language-action model for robotic manipulation.
PartyKit simplifies developing multiplayer applications
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key