Jiang-Stan

Jiang-Stan

6 followers · 2 following

Achievements

Stars

MahmoudAshraf97 / whisper-diarization

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

Jupyter Notebook 3,682 325 Updated Oct 27, 2024

2noise / ChatTTS

A generative speech model for daily dialogue.

Python 32,271 3,508 Updated Nov 5, 2024

fishaudio / fish-speech

Brand new TTS solution

Python 14,344 1,078 Updated Nov 10, 2024

open-mmlab / Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Jupyter Notebook 7,533 559 Updated Nov 1, 2024

Stability-AI / stable-audio-tools

Generative models for conditional audio generation

Python 2,710 258 Updated Nov 5, 2024

openvpi / DiffSinger

Forked from MoonInTheRiver/DiffSinger

An advanced singing voice synthesis system with high fidelity, expressiveness, controllability and flexibility based on DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism

Python 2,712 286 Updated Nov 8, 2024

riffusion / riffusion-hobby

Stable diffusion for real-time music generation

Python 3,406 391 Updated Jul 22, 2024

marl / crepe

CREPE: A Convolutional REpresentation for Pitch Estimation -- pre-trained model (ICASSP 2018)

Python 1,119 159 Updated Aug 19, 2024

Labbeti / aac-datasets

Audio Captioning datasets for PyTorch.

Python 106 6 Updated Nov 4, 2024

facebookresearch / audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Python 20,949 2,142 Updated Nov 11, 2024

pyannote / pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook 6,306 778 Updated Nov 11, 2024

LSimon95 / megatts2

Unoffical implementation of Megatts2

Python 264 35 Updated Mar 23, 2024

myshell-ai / OpenVoice

Instant voice cloning by MIT and MyShell.

Python 29,747 2,927 Updated Aug 21, 2024

haoheliu / AudioLDM2

Text-to-Audio/Music Generation

Python 2,300 179 Updated Sep 29, 2024

lucidrains / naturalspeech2-pytorch

Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch

Python 1,281 100 Updated Sep 24, 2023

facebookresearch / demucs

Code for the paper Hybrid Spectrogram and Waveform Source Separation

Python 8,344 1,058 Updated Apr 24, 2024

Mikxox / EnCodec_Trainer

Python 51 11 Updated Apr 3, 2023

ZhangXInFD / SpeechTokenizer

This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples are presented on

Python 472 40 Updated Jun 9, 2024

lifeiteng / vall-e

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

Python 2,047 319 Updated Nov 14, 2023

LC1332 / Chat-Haruhi-Suzumiya

Chat凉宫春日, An open sourced Role-Playing chatbot Cheng Li, Ziang Leng, and others.

Jupyter Notebook 1,831 163 Updated Aug 13, 2024

bojone / rerope

Rectified Rotary Position Embeddings

Python 339 30 Updated May 20, 2024

langchain-ai / langchain

🦜🔗 Build context-aware reasoning applications

Jupyter Notebook 94,711 15,327 Updated Nov 12, 2024

openai / guided-diffusion

Python 6,256 822 Updated Jul 2, 2024

lllyasviel / ControlNet

Let us control diffusion models!

Python 30,338 2,728 Updated Feb 25, 2024

ziplab / PTQD

The official implementation of PTQD: Accurate Post-Training Quantization for Diffusion Models

Jupyter Notebook 87 5 Updated Mar 12, 2024

Xiuyu-Li / q-diffusion

[ICCV 2023] Q-Diffusion: Quantizing Diffusion Models.

Python 327 21 Updated Mar 21, 2024

Significant-Gravitas / AutoGPT

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Python 168,242 44,378 Updated Nov 12, 2024

deepinsight / insightface

State-of-the-art 2D and 3D Face Analysis Project

Python 23,403 5,413 Updated Nov 10, 2024

megvii-research / CVPR2023-UniDistill

CVPR2023 (highlight) - UniDistill: A Universal Cross-Modality Knowledge Distillation Framework for 3D Object Detection in Bird's-Eye View

Python 105 10 Updated Aug 5, 2023

LiuXiaoxuanPKU / GACT-ICML

Python 41 10 Updated Nov 1, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Jiang-Stan

Achievements

Achievements

Block or report Jiang-Stan

Stars

MahmoudAshraf97 / whisper-diarization

2noise / ChatTTS

fishaudio / fish-speech

open-mmlab / Amphion

Stability-AI / stable-audio-tools

openvpi / DiffSinger

riffusion / riffusion-hobby

marl / crepe

Labbeti / aac-datasets

facebookresearch / audiocraft

pyannote / pyannote-audio

LSimon95 / megatts2

myshell-ai / OpenVoice

haoheliu / AudioLDM2

lucidrains / naturalspeech2-pytorch

facebookresearch / demucs

Mikxox / EnCodec_Trainer

ZhangXInFD / SpeechTokenizer

lifeiteng / vall-e

LC1332 / Chat-Haruhi-Suzumiya

bojone / rerope

langchain-ai / langchain

openai / guided-diffusion

lllyasviel / ControlNet

ziplab / PTQD

Xiuyu-Li / q-diffusion

Significant-Gravitas / AutoGPT

deepinsight / insightface

megvii-research / CVPR2023-UniDistill

LiuXiaoxuanPKU / GACT-ICML