Skip to content
View AaZz101's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report AaZz101

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results
Python 453 39 Updated Jun 7, 2024

This is the official implementation of the SEMamba paper. (Accepted to IEEE SLT 2024)

Python 114 10 Updated Sep 9, 2024

Automatic headphone equalization from frequency responses

Python 13,118 2,467 Updated Jul 27, 2024

PyTorch Lightning + Hydra. A very user-friendly template for ML experimentation. ⚡🔥⚡

Python 4,076 633 Updated Aug 16, 2024

Speech To Speech: an effort for an open-sourced and modular GPT4-o

Python 2,935 311 Updated Sep 9, 2024

Transformer with Local Modeling by Convolution for Speech Separation and Enhancement

Python 23 4 Updated Aug 1, 2024
Python 37 5 Updated Aug 23, 2024

Instant voice cloning by MIT and MyShell.

Python 28,301 2,773 Updated Aug 21, 2024

Production First and Production Ready End-to-End Text-to-Speech Toolkit

Python 367 55 Updated May 30, 2024

This repository contains some material of speech enhancement and dereverberation. On the one hand, I summarize this work for my further understanding. On the other hand, I hope that all beginners o…

40 12 Updated Jul 6, 2020

Different implementations of "Weighted Prediction Error" for speech dereverberation

Python 473 165 Updated Jun 18, 2024

Implementation of Voicebox, new SOTA Text-to-speech network from MetaAI, in Pytorch

Python 589 49 Updated Feb 16, 2024

Non-Uniform FFT on the CPU and GPU (1D, 2D and 3D)

Python 14 5 Updated Jan 13, 2021

[Preprint] Why is the State of Neural Network Pruning so Confusing? On the Fairness, Comparison Setup, and Trainability in Network Pruning

40 1 Updated Feb 17, 2023

[ICLR'23] Trainability Preserving Neural Pruning (PyTorch)

Python 29 2 Updated May 21, 2023

论文写作与资料分享

2,267 543 Updated Aug 7, 2022
Python 39 2 Updated Jul 29, 2024

The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based attractors". [ICASSP 2024]

Python 75 4 Updated Jan 24, 2024

List of speech synthesis papers.

991 120 Updated Jul 24, 2023

Active noise cancellation using various algorithms (FxLMS, FuLMS, NLMS) in Matlab, VST and C

MATLAB 327 101 Updated Apr 25, 2022

AudioLDM training, finetuning, evaluation and inference.

Python 187 37 Updated Jun 2, 2024

Pytorch implementation of subband decomposition

HTML 86 13 Updated Jul 26, 2022

Official code for MUSE: Flexible Voiceprint Receptive Fields and Multi-Path Fusion Enhanced Taylor Transformer for U-Net-based Speech Enhancemen

Python 25 4 Updated Jul 21, 2024

[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching

Jupyter Notebook 596 75 Updated Sep 2, 2024

Code for Adam-mini: Use Fewer Learning Rates To Gain More https://arxiv.org/abs/2406.16793

Python 280 9 Updated Sep 4, 2024

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 4,604 462 Updated Sep 6, 2024

A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization

Python 61 6 Updated Sep 2, 2024

SpeechGPT Series: Speech Large Language Models

Python 1,208 79 Updated Jul 22, 2024
Next