Lists (1)
Sort Name ascending (A-Z)
Stars
This is the official implementation of the SEMamba paper. (Accepted to IEEE SLT 2024)
Automatic headphone equalization from frequency responses
PyTorch Lightning + Hydra. A very user-friendly template for ML experimentation. ⚡🔥⚡
Speech To Speech: an effort for an open-sourced and modular GPT4-o
Transformer with Local Modeling by Convolution for Speech Separation and Enhancement
Instant voice cloning by MIT and MyShell.
Production First and Production Ready End-to-End Text-to-Speech Toolkit
This repository contains some material of speech enhancement and dereverberation. On the one hand, I summarize this work for my further understanding. On the other hand, I hope that all beginners o…
Different implementations of "Weighted Prediction Error" for speech dereverberation
Implementation of Voicebox, new SOTA Text-to-speech network from MetaAI, in Pytorch
Non-Uniform FFT on the CPU and GPU (1D, 2D and 3D)
[Preprint] Why is the State of Neural Network Pruning so Confusing? On the Fairness, Comparison Setup, and Trainability in Network Pruning
[ICLR'23] Trainability Preserving Neural Pruning (PyTorch)
The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based attractors". [ICASSP 2024]
Active noise cancellation using various algorithms (FxLMS, FuLMS, NLMS) in Matlab, VST and C
AudioLDM training, finetuning, evaluation and inference.
Pytorch implementation of subband decomposition
Official code for MUSE: Flexible Voiceprint Receptive Fields and Multi-Path Fusion Enhanced Taylor Transformer for U-Net-based Speech Enhancemen
[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching
Code for Adam-mini: Use Fewer Learning Rates To Gain More https://arxiv.org/abs/2406.16793
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization
SpeechGPT Series: Speech Large Language Models