Skip to content
View Shengqiang-Li's full-sized avatar
💭
I may be slow to respond.
💭
I may be slow to respond.
  • Northwestern Polytechnical University
  • Suzhou
  • 07:10 (UTC +08:00)

Block or report Shengqiang-Li

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 176 15 Updated Oct 11, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 28,213 4,173 Updated Oct 11, 2024
Python 6,184 462 Updated Oct 11, 2024

An Open-Sourced LLM-empowered Foundation TTS System

Python 275 14 Updated Sep 25, 2024

Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 8,795 550 Updated Oct 2, 2024

Finetune Llama 3.2, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Python 16,737 1,149 Updated Oct 11, 2024

Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Python 5,718 519 Updated Sep 19, 2024

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

HTML 3,036 360 Updated Aug 19, 2024

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Python 5,591 510 Updated Oct 4, 2024

A quantization algorithm for LLM

Cuda 99 5 Updated Jun 21, 2024

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Python 40,489 5,196 Updated Jun 27, 2024

The open source code for SimpleSpeech series

Python 92 6 Updated Oct 8, 2024

LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.

Python 2,329 144 Updated Sep 24, 2024

Reverse Engineering of Supervised Semantic Speech Tokenizer (S3Tokenizer) proposed in CosyVoice

Python 107 9 Updated Oct 11, 2024

🤯 Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / Azure / DeepSeek), Knowledge Base (file upload / knowledge manageme…

TypeScript 42,339 9,570 Updated Oct 11, 2024
Jupyter Notebook 42 3 Updated Oct 11, 2024

open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.

Python 2,798 255 Updated Sep 25, 2024
TypeScript 18 4 Updated Aug 17, 2024

Automatically Update Text-to-speech (TTS) Papers Daily using Github Actions (Update Every 12th hours)

Python 240 20 Updated Oct 11, 2024

Denoising Diffusion Autoregressive Model for Raw Speech Waveform Generation

Python 23 2 Updated Mar 8, 2024

High-Resolution Image Synthesis with Latent Diffusion Models

Jupyter Notebook 11,650 1,518 Updated Feb 29, 2024

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Python 6,110 541 Updated May 31, 2024

Code for "AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling"

Python 757 60 Updated Aug 27, 2024

Code for "Diffusion Model Alignment Using Direct Preference Optimization"

Python 242 22 Updated Dec 28, 2023

The open source code for LLM-Codec

Python 110 4 Updated Aug 18, 2024

Official Pytorch Implementation of "Diff-HierVC: Diffusion-based Hierarchical Voice Conversion with Robust Pitch Generation and Masked Prior for Zero-shot Speaker Adaptation"

Python 193 18 Updated Jul 3, 2024

PitchVC: Pitch Conditioned Any-to-Many Voice Conversion

Python 34 4 Updated Jun 6, 2024

FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion

Python 592 109 Updated Mar 23, 2024

LibriTTS-P: A Corpus with Speaking Style and Speaker Identity Prompts for Text-to-Speech and Style Captioning

111 2 Updated Jun 13, 2024
Next