Skip to content
View rockets-cn's full-sized avatar

Block or report rockets-cn

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 3,225 268 Updated Oct 16, 2024

Implementation of F5-TTS in MLX

Python 127 12 Updated Oct 15, 2024

Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切割、翻译、对齐、甚至加上配音,一键全自动视频搬运AI字幕组

Python 4,025 428 Updated Oct 16, 2024

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Python 4,503 443 Updated Jul 30, 2024

ChatTTS is a generative speech model for daily dialogue.

Jupyter Notebook 1 Updated Jun 3, 2024

🚀 基于 LLM 大语言模型的知识库问答系统。开箱即用,支持快速嵌入到第三方业务系统,1Panel 官方出品。

Python 1 Updated Jun 4, 2024

The next generation of motion control firmware

C++ 1,573 381 Updated Oct 14, 2024

Brand new TTS solution

Python 13,374 998 Updated Oct 11, 2024

Fuse ChatTTS with OpenVoice, upload a 10-second audio clip, and clone your personalized ChatTTS voice.

Python 334 42 Updated Oct 9, 2024

A toolkit for controlling Euro Truck Simulator 2 with python to develop self-driving algorithms.

Jupyter Notebook 1,511 188 Updated Oct 25, 2020

A Raspberry Pi operated Wireless Allsky Camera

JavaScript 1,182 181 Updated Oct 16, 2024

Code and additional files for an open source cable camera robot.

C++ 14 2 Updated Mar 29, 2021

Finetune Llama 3.2, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Python 16,901 1,159 Updated Oct 15, 2024

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Python 4,170 409 Updated Oct 12, 2024

[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation

Python 3,539 296 Updated Aug 14, 2024

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Python 167,543 44,256 Updated Oct 15, 2024

a pptx to markdown converter

Python 755 96 Updated May 3, 2024

Implementation for MatMul-free LM.

Python 2,905 182 Updated Sep 19, 2024

Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.

Go 93,413 7,380 Updated Oct 15, 2024

FreeSWITCH ASR APP

C 159 103 Updated Jan 30, 2024

fork of chattts mac can run

Python 4 Updated Jun 3, 2024

Multilingual Voice Understanding Model

Python 3,018 279 Updated Sep 25, 2024

Together Mixture-Of-Agents (MoA) – 65.1% on AlpacaEval with OSS models

Python 2,564 354 Updated Aug 10, 2024

Fast voice assistant powered by Groq, Cartesia, and Vercel.

TypeScript 480 96 Updated Aug 2, 2024

OpenVLA: An open-source vision-language-action model for robotic manipulation.

Python 1,147 145 Updated Oct 15, 2024

Open Source AI Math Notes

Python 462 34 Updated Jun 15, 2024

PartyKit simplifies developing multiplayer applications

TypeScript 4,632 155 Updated Oct 14, 2024

Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key

Python 5,492 560 Updated Oct 9, 2024

StarCraft II Learning Environment

Python 8,013 1,155 Updated Jul 23, 2024
Next