Block or Report
Block or report xddun
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
yihong0618 / ChatTTS
Forked from 2noise/ChatTTSChatTTS is a generative speech model for daily dialogue.
A generative speech model for daily dialogue.
🚀 一键部署(含离线整合包)!基于 ChatTTS ,支持流式输出、音色抽卡、长音频生成和分角色朗读。简单易用,无需复杂安装。
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Design
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
vits2 backbone with multilingual-bert
🍦 ChatTTS-Forge is a project developed around TTS generation model, implementing an API Server and a Gradio-based WebUI.
🔊 Text-Prompted Generative Audio Model
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
[ECCV 2024] Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.
Chat-甄嬛是利用《甄嬛传》剧本中所有关于甄嬛的台词和语句,基于ChatGLM2进行LoRA微调得到的模仿甄嬛语气的聊天语言模型。
A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
《开源大模型食用指南》基于Linux环境快速部署开源大模型,更适合中国宝宝的部署教程
A high-throughput and memory-efficient inference and serving engine for LLMs
PyTorch code for training EfficientPS for Panoptic Segmentation
OneFormer: One Transformer to Rule Universal Image Segmentation, arxiv 2022 / CVPR 2023
Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.
[ECCV 2024] PowerPaint, a versatile image inpainting model that supports text-guided object inpainting, object removal, image outpainting and shape-guided object inpainting with only a single model…
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence
Real-time microphone noise suppression on Linux.
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…
PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫
Karras et al. (2022) diffusion models for PyTorch
A paper list of some recent Transformer-based CV works.
Hugging StableDiffusion, Hugging Future.