Skip to content
View whmrtm's full-sized avatar
Block or Report

Block or report whmrtm

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

This repository contains the trained models and some audio samples for the tPLCnet.

Python 19 2 Updated Sep 26, 2023
Python 44 3 Updated Sep 23, 2023

[ACMMM2023] "Enhancing Visibility in Nighttime Haze Images Using Guided APSF and Gradient Adaptive Convolution", https://arxiv.org/abs/2308.01738

MATLAB 141 10 Updated Jun 2, 2024

[AAAI23] Estimating Reflectance Layer from A Single Image: Integrating Reflectance Guidance and Shadow/Specular Aware Learning, https://arxiv.org/abs/2211.14751

HTML 87 1 Updated Jan 7, 2024

[ICCV2021]"DC-ShadowNet: Single-Image Hard and Soft Shadow Removal Using Unsupervised Domain-Classifier Guided Network", https://arxiv.org/abs/2207.10434

Python 204 19 Updated Jun 25, 2024

[ECCV2022] "Unsupervised Night Image Enhancement: When Layer Decomposition Meets Light-Effects Suppression", https://arxiv.org/abs/2207.10564

HTML 376 31 Updated Mar 2, 2024

[ACCV22] Structure Representation Network and Uncertainty Feedback Learning for Dense Non-Uniform Fog Removal, https://arxiv.org/abs/2210.03061

Python 141 6 Updated Feb 14, 2024
Python 27 2 Updated Feb 24, 2023
Python 8 1 Updated Apr 5, 2024

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Python 20,197 2,022 Updated Jun 19, 2024

Specify what you want it to build, the AI asks for clarification, and then builds it.

Python 51,375 6,681 Updated Jul 7, 2024

AI Audio Datasets (AI-ADS) 🎵, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio a…

381 31 Updated Jul 10, 2024

[NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web"

Jupyter Notebook 616 86 Updated Apr 29, 2024

StoRM: A Diffusion-based Stochastic Regeneration Model for Speech Enhancement and Dereverberation

Python 152 19 Updated Apr 17, 2024

Uformer: A Unet based dilated complex & real dual-path conformer network for simultaneous speech enhancement and dereverberation

Python 92 14 Updated Jun 29, 2022

Espresso: A Fast End-to-End Neural Speech Recognition Toolkit

Python 940 116 Updated Jul 24, 2023

🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).

1,612 221 Updated Jun 6, 2024

Augmentation adversarial training for self-supervised speaker recognition

Python 76 10 Updated Aug 15, 2021

Self-Supervised Speech Pre-training and Representation Learning Toolkit

Python 2,156 478 Updated Jun 18, 2024

A fully invertible U-Net for memory efficiency in Pytorch.

Python 120 16 Updated Jul 9, 2022

Official SRFlow training code: Super-Resolution using Normalizing Flow in PyTorch

Jupyter Notebook 824 112 Updated Dec 8, 2022

A unofficial Pytorch implementation of Microsoft's PHASEN

Python 217 49 Updated Apr 10, 2024

🔉 spafe: Simplified Python Audio Features Extraction

Python 442 76 Updated Jun 19, 2024

SEGAN for bandwidth extension

Jupyter Notebook 15 3 Updated Jun 6, 2019
Python 148 31 Updated Dec 20, 2023

A Pytorch Implementation of ClariNet

Python 288 66 Updated Aug 5, 2019

🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.

12,776 1,344 Updated Feb 13, 2023

A vocoder framework which had been widely used in research community since 1999.

MATLAB 173 41 Updated Dec 24, 2018
Jupyter Notebook 6 4 Updated Sep 21, 2018
Next