-
Ohio State University
- whmrtm.github.io
Block or Report
Block or report whmrtm
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseLanguage
Sort by: Recently starred
Starred repositories
This repository contains the trained models and some audio samples for the tPLCnet.
[ACMMM2023] "Enhancing Visibility in Nighttime Haze Images Using Guided APSF and Gradient Adaptive Convolution", https://arxiv.org/abs/2308.01738
[AAAI23] Estimating Reflectance Layer from A Single Image: Integrating Reflectance Guidance and Shadow/Specular Aware Learning, https://arxiv.org/abs/2211.14751
[ICCV2021]"DC-ShadowNet: Single-Image Hard and Soft Shadow Removal Using Unsupervised Domain-Classifier Guided Network", https://arxiv.org/abs/2207.10434
[ECCV2022] "Unsupervised Night Image Enhancement: When Layer Decomposition Meets Light-Effects Suppression", https://arxiv.org/abs/2207.10564
[ACCV22] Structure Representation Network and Uncertainty Feedback Learning for Dense Non-Uniform Fog Removal, https://arxiv.org/abs/2210.03061
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…
Specify what you want it to build, the AI asks for clarification, and then builds it.
AI Audio Datasets (AI-ADS) 🎵, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio a…
[NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web"
StoRM: A Diffusion-based Stochastic Regeneration Model for Speech Enhancement and Dereverberation
Uformer: A Unet based dilated complex & real dual-path conformer network for simultaneous speech enhancement and dereverberation
Espresso: A Fast End-to-End Neural Speech Recognition Toolkit
🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
Augmentation adversarial training for self-supervised speaker recognition
Self-Supervised Speech Pre-training and Representation Learning Toolkit
A fully invertible U-Net for memory efficiency in Pytorch.
Official SRFlow training code: Super-Resolution using Normalizing Flow in PyTorch
A unofficial Pytorch implementation of Microsoft's PHASEN
🔉 spafe: Simplified Python Audio Features Extraction
SEGAN for bandwidth extension
🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.
A vocoder framework which had been widely used in research community since 1999.