Speech-to-text, text-to-speech, speaker recognition, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 …

C++ 3,252 380 Updated Oct 6, 2024

Tele-AI / TeleSpeech-ASR

Python 476 39 Updated Jun 7, 2024

lovemefan / telespeech-asr-python

Python 31 2 Updated Jul 17, 2024

exadel-inc / CompreFace

Leading free and open-source face recognition system

Java 5,340 734 Updated Oct 5, 2024

HumanSignal / labelImg

LabelImg is now part of the Label Studio community. The popular image annotation tool created by Tzutalin is no longer actively being developed, but you can check out Label Studio, the open source …

Python 22,561 6,272 Updated Jun 7, 2024

lenML / ChatTTS-Forge

🍦 ChatTTS-Forge is a project developed around TTS generation model, implementing an API Server and a Gradio-based WebUI.

Python 698 85 Updated Oct 4, 2024

chatchat-space / Langchain-Chatchat

Langchain-Chatchat（原Langchain-ChatGLM）基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and…

TypeScript 31,463 5,487 Updated Sep 30, 2024

songquanpeng / one-api

OpenAI 接口管理 & 分发系统，支持 Azure、Anthropic Claude、Google PaLM 2 & Gemini、智谱 ChatGLM、百度文心一言、讯飞星火认知、阿里通义千问、360 智脑以及腾讯混元，可用于二次分发管理 key，仅单可执行文件，已打包好 Docker 镜像，一键部署，开箱即用. OpenAI key management & redistributi…

JavaScript 18,265 4,123 Updated Sep 22, 2024

modelscope / FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 6,190 659 Updated Sep 30, 2024

ufal / whisper_streaming

Whisper realtime streaming for long speech-to-text transcription and translation

Python 1,888 227 Updated Oct 4, 2024

SYSTRAN / faster-whisper

Faster Whisper transcription with CTranslate2

Python 11,708 975 Updated Aug 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

stein1988

Block or report stein1988

Lists (2)

ASR

TTS

Stars

hwchase17 / langgraph-engineer

datawhalechina / llms-from-scratch-cn

rasbt / LLMs-from-scratch

QwenLM / Qwen-Agent

phodal / prompt-patterns

modelscope / modelscope

modelscope / modelscope-classroom

modelscope / modelscope-agent

FunAudioLLM / SenseVoice

FunAudioLLM / CosyVoice

warmcat / libwebsockets

ultranationalism / GPT-SoVITS-mindspore

YoMio-Tech-Inc / GPT-SoVITS2

netease-youdao / EmotiVoice

fishaudio / fish-speech

X-T-E-R / GPT-SoVITS-Inference

RVC-Boss / GPT-SoVITS

PlayVoice / vits_chinese

langchain-ai / langgraph

k2-fsa / sherpa-onnx