Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - Provide C++ & Python API

C++ 187 35 Updated Sep 14, 2024

X-LANCE / VoiceFlow-TTS

[ICASSP 2024] This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"

Python 302 21 Updated Sep 3, 2024

CrossmodalGroup / DynamicVectorQuantization

Official Pytorch Implementation of Our CVPR2023 Paper: "Towards Accurate Image Coding: Improved Autoregressive Image Generation with Dynamic Vector Quantization"

Python 152 6 Updated Jul 23, 2023

lucidrains / e2-tts-pytorch

Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in Pytorch

Python 284 25 Updated Oct 13, 2024

Variante / video-postproc-toolbox

针对新的视频后期工作流制作的各种小工具

Python 17 Updated Apr 14, 2024

lucidrains / voicebox-pytorch

Implementation of Voicebox, new SOTA Text-to-speech network from MetaAI, in Pytorch

Python 599 50 Updated Oct 1, 2024

atong01 / conditional-flow-matching

TorchCFM: a Conditional Flow Matching library

Python 1,112 89 Updated Oct 9, 2024

Plachtaa / FAcodec

Training code for FAcodec presented in NaturalSpeech3

Python 166 19 Updated Aug 26, 2024

bdashore3 / flash-attention

Forked from Dao-AILab/flash-attention

Fast and memory-efficient exact attention

Python 237 20 Updated Jul 26, 2024

Python 15 4 Updated Oct 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Yushen CHEN SWivid

Achievements

Achievements

Highlights

Block or report SWivid

Stars

mit-han-lab / llm-awq

MahmoudAshraf97 / ctc-forced-aligner

SWivid / F5-TTS

sigoden / dufs

BytedanceSpeech / seed-tts-eval

feizc / FluxMusic

innnky / MagVITS

bfs18 / e2_tts

huggingface / speech-to-speech

csukuangfj / kaldifeat

X-LANCE / VoiceFlow-TTS

CrossmodalGroup / DynamicVectorQuantization

lucidrains / e2-tts-pytorch

Variante / video-postproc-toolbox

lucidrains / voicebox-pytorch

atong01 / conditional-flow-matching

Plachtaa / FAcodec

bdashore3 / flash-attention

dukGuo / valle-audiodec

Plachtaa / VALL-E-X

kale4eat / nisqalib

GitYCC / g2pW

FunAudioLLM / SenseVoice

FunAudioLLM / CosyVoice

OpenNMT / CTranslate2

mobiusml / faster-whisper

SYSTRAN / faster-whisper

Dao-AILab / flash-attention

facebookresearch / seamless_communication

fishaudio / fish-speech