Skip to content
View Jiang-Stan's full-sized avatar

Block or report Jiang-Stan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

Jupyter Notebook 3,682 325 Updated Oct 27, 2024

A generative speech model for daily dialogue.

Python 32,271 3,508 Updated Nov 5, 2024

Brand new TTS solution

Python 14,344 1,078 Updated Nov 10, 2024

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Jupyter Notebook 7,533 559 Updated Nov 1, 2024

Generative models for conditional audio generation

Python 2,710 258 Updated Nov 5, 2024

An advanced singing voice synthesis system with high fidelity, expressiveness, controllability and flexibility based on DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism

Python 2,712 286 Updated Nov 8, 2024

Stable diffusion for real-time music generation

Python 3,406 391 Updated Jul 22, 2024

CREPE: A Convolutional REpresentation for Pitch Estimation -- pre-trained model (ICASSP 2018)

Python 1,119 159 Updated Aug 19, 2024

Audio Captioning datasets for PyTorch.

Python 106 6 Updated Nov 4, 2024

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Python 20,949 2,142 Updated Nov 11, 2024

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook 6,306 778 Updated Nov 11, 2024

Unoffical implementation of Megatts2

Python 264 35 Updated Mar 23, 2024

Instant voice cloning by MIT and MyShell.

Python 29,747 2,927 Updated Aug 21, 2024

Text-to-Audio/Music Generation

Python 2,300 179 Updated Sep 29, 2024

Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch

Python 1,281 100 Updated Sep 24, 2023

Code for the paper Hybrid Spectrogram and Waveform Source Separation

Python 8,344 1,058 Updated Apr 24, 2024
Python 51 11 Updated Apr 3, 2023

This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples are presented on

Python 472 40 Updated Jun 9, 2024

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

Python 2,047 319 Updated Nov 14, 2023

Chat凉宫春日, An open sourced Role-Playing chatbot Cheng Li, Ziang Leng, and others.

Jupyter Notebook 1,831 163 Updated Aug 13, 2024

Rectified Rotary Position Embeddings

Python 339 30 Updated May 20, 2024

🦜🔗 Build context-aware reasoning applications

Jupyter Notebook 94,711 15,327 Updated Nov 12, 2024

Let us control diffusion models!

Python 30,338 2,728 Updated Feb 25, 2024

The official implementation of PTQD: Accurate Post-Training Quantization for Diffusion Models

Jupyter Notebook 87 5 Updated Mar 12, 2024

[ICCV 2023] Q-Diffusion: Quantizing Diffusion Models.

Python 327 21 Updated Mar 21, 2024

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Python 168,242 44,378 Updated Nov 12, 2024

State-of-the-art 2D and 3D Face Analysis Project

Python 23,403 5,413 Updated Nov 10, 2024

CVPR2023 (highlight) - UniDistill: A Universal Cross-Modality Knowledge Distillation Framework for 3D Object Detection in Bird's-Eye View

Python 105 10 Updated Aug 5, 2023
Python 41 10 Updated Nov 1, 2022
Next