Skip to content
View SWivid's full-sized avatar

Highlights

  • Pro

Block or report SWivid

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Python 2,420 184 Updated Oct 10, 2024

Text to speech alignment using CTC forced alignment

Python 100 17 Updated Sep 28, 2024

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 1,817 161 Updated Oct 14, 2024

A file server that supports static serving, uploading, searching, accessing control, webdav...

Rust 6,170 309 Updated Sep 25, 2024

Text-to-Music Generation with Rectified Flow Transformers

Python 1,556 120 Updated Sep 6, 2024

VITS with phoneme-level prosody modeling based on MaskGIT

Python 74 7 Updated Aug 31, 2024
Python 52 7 Updated Sep 3, 2024

Speech To Speech: an effort for an open-sourced and modular GPT4-o

Python 3,240 340 Updated Oct 14, 2024

Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - Provide C++ & Python API

C++ 187 35 Updated Sep 14, 2024

[ICASSP 2024] This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"

Python 302 21 Updated Sep 3, 2024

Official Pytorch Implementation of Our CVPR2023 Paper: "Towards Accurate Image Coding: Improved Autoregressive Image Generation with Dynamic Vector Quantization"

Python 152 6 Updated Jul 23, 2023

Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in Pytorch

Python 284 25 Updated Oct 13, 2024

针对新的视频后期工作流制作的各种小工具

Python 17 Updated Apr 14, 2024

Implementation of Voicebox, new SOTA Text-to-speech network from MetaAI, in Pytorch

Python 599 50 Updated Oct 1, 2024

TorchCFM: a Conditional Flow Matching library

Python 1,112 89 Updated Oct 9, 2024

Training code for FAcodec presented in NaturalSpeech3

Python 166 19 Updated Aug 26, 2024

Fast and memory-efficient exact attention

Python 237 20 Updated Jul 26, 2024

Inference code for Audiodec-Valle-Wenetspeech4TTS

Python 44 2 Updated Jul 14, 2024

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/

Python 7,603 757 Updated Feb 11, 2024

This is a Python package for NISQA.

Python 4 2 Updated Apr 9, 2024

Chinese Mandarin Grapheme-to-Phoneme Converter. 中文轉注音或拼音 (INTERSPEECH 2022)

Python 279 38 Updated Jun 16, 2024

Multilingual Voice Understanding Model

Python 3,003 277 Updated Sep 25, 2024

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 5,517 568 Updated Sep 29, 2024

Fast inference engine for Transformer models

C++ 3,308 289 Updated Oct 10, 2024

Faster Whisper ASR transcription with CTranslate2

Python 15 4 Updated Oct 14, 2024

Faster Whisper transcription with CTranslate2

Python 11,866 995 Updated Aug 21, 2024

Fast and memory-efficient exact attention

Python 13,757 1,261 Updated Oct 14, 2024

Foundational Models for State-of-the-Art Speech and Text Translation

Jupyter Notebook 10,844 1,056 Updated Aug 15, 2024

Brand new TTS solution

Python 13,325 994 Updated Oct 11, 2024
Next