Stars
Streamer-Sales 销冠 —— 卖货主播 LLM 大模型🛒🎁,一个能够根据给定的商品特点从激发用户购买意愿角度出发进行商品解说的卖货主播大模型。🚀⭐内含详细的数据生成流程❗ 📦另外还集成了 LMDeploy 加速推理🚀、RAG检索增强生成 📚、TTS文字转语音🔊、数字人生成 🦸、 Agent 使用网络查询实时信息🌐、ASR 语音转文字🎙️、Vue 生态搭建前端🍍、FastAPI 搭…
OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。
A visual no-code/code-free web crawler/spider易采集:一个可视化浏览器自动化测试/数据采集/爬虫软件,可以无代码图形化的设计和执行爬虫任务。别名:ServiceWrapper面向Web应用的智能化服务封装系统。
Official implementations for paper: Zero-shot Image Editing with Reference Imitation
Context-aware AI Sales Agent to automate sales outreach.
V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.
Official implementation of the paper 'InstructAvatar: Text-Guided Emotion and Motion Control for Avatar Generation'
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
ChatLaw:A Powerful LLM Tailored for Chinese Legal. 中文法律大模型
🤖一个基于 WeChaty 结合 OpenAi ChatGPT / Kimi / 讯飞等Ai服务实现的微信机器人 ,可以用来帮助你自动回复微信消息,或者管理微信群/好友,检测僵尸粉等...
Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,并支持api调用
一个简单的本地网页界面,使用ChatTTS将文字合成为语音,同时支持对外提供API接口。A simple native web interface that uses ChatTTS to synthesize text into speech, along with support for external API interfaces.
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
zxypro1 / GPT-SoVITS
Forked from RVC-Boss/GPT-SoVITSGPT-SoVITS project Serverless version
ComfyUI-Manager is an extension designed to enhance the usability of ComfyUI. It offers management functions to install, remove, disable, and enable various custom nodes of ComfyUI. Furthermore, th…
A generative speech model for daily dialogue.
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.