Stars
🍦 Speech-AI-Forge is a project developed around TTS generation model, implementing an API Server and a Gradio-based WebUI.
cengfubo / ChatTTS
Forked from 6drf21e/ChatTTS_colab🚀 一键部署(含离线整合包)!基于 ChatTTS ,支持音色抽卡、长音频生成和分角色朗读。简单易用,无需复杂安装。
Fuse ChatTTS with OpenVoice, upload a 10-second audio clip, and clone your personalized ChatTTS voice.
一个简单的本地网页界面,使用ChatTTS将文字合成为语音,同时支持对外提供API接口。A simple native web interface that uses ChatTTS to synthesize text into speech, along with support for external API interfaces.
A generative speech model for daily dialogue.
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
An Open-Sourced LLM-empowered Foundation TTS System
An embedded WebView2 browser project for Microsoft Excel.
Create and modify PDF documents in any JavaScript environment
Standalone tool to edit custom UI part of Office open document file format
the comfyui custom node for UVR5 to separate vocals and background music
make your Speaker talking as Native style with own voice!
Jan is an open source alternative to ChatGPT that runs 100% offline on your computer. Multiple engine support (llama.cpp, TensorRT-LLM)
The open source platform for AI-native application development.
🚀 阿里通义千问2.5大模型逆向API白嫖测试【特长:六边形战士】,支持高速流式输出、无水印AI绘图、长文档解读、图像解析、多轮对话,零配置部署,多路token支持,自动清理会话痕迹。
🚀 KIMI AI 长文本大模型逆向API白嫖测试【特长:长文本解读整理】,支持高速流式输出、智能体对话、联网搜索、长文档解读、图像OCR、多轮对话,零配置部署,多路token支持,自动清理会话痕迹。
Excel-DNA - Free and easy .NET for Excel. This repository contains the core Excel-DNA library.
NocoBase is a scalability-first, open-source no-code/low-code platform for building business applications and enterprise solutions.
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。
OCR离线图片文字识别命令行windows程序,以JSON字符串形式输出结果,方便别的程序调用。基于 RapidOcrOnnx 。
Whisper.net. Speech to text made simple using Whisper Models
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
A nearly-live implementation of OpenAI's Whisper.