Skip to content
View RoyChao19477's full-sized avatar

Block or report RoyChao19477

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement

Python 310 45 Updated Oct 28, 2024

Efficient Python library for Extended LSTM with exponential gating, memory mixing, and matrix memory for superior sequence modeling.

Python 258 21 Updated Jun 28, 2024

Mamba SSM architecture

Python 13,090 1,113 Updated Nov 5, 2024

MLX: An array framework for Apple silicon

C++ 17,031 986 Updated Nov 5, 2024

[ICCV 2023] DETRs with Collaborative Hybrid Assignments Training

Python 1,017 113 Updated Nov 5, 2024

Inference code for Llama models

Python 56,294 9,560 Updated Aug 18, 2024

video anomaly detection

Python 78 12 Updated Sep 21, 2022

Starter code for working with the YouTube-8M dataset.

Python 2,321 849 Updated Oct 25, 2021

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 32,135 4,756 Updated Nov 5, 2024

PESQ (Perceptual Evaluation of Speech Quality) Wrapper for Python Users (narrow band and wide band)

C 535 98 Updated Sep 5, 2024

Implementation for paper: Multi-Metric Optimization using Generative Adversarial Networks for Near-End Speech Intelligibility Enhancement

Python 20 2 Updated Sep 21, 2021

Support for Clarity Enhancement and Prediction Challenges (obsolete - see README)

Python 46 10 Updated Apr 14, 2022

Clarity Challenge toolkit - software for building Clarity Challenge systems

Python 128 54 Updated Nov 5, 2024

STOI loss function in PyTorch

Python 87 20 Updated Sep 30, 2024

NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment

Python 678 117 Updated Mar 8, 2024

VAD using FCN neural network

Python 1 Updated Jun 8, 2022
Python 1 Updated Jul 13, 2022

For DL based SE starters

2 Updated May 25, 2022

[Unofficial] PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)

Python 955 175 Updated Dec 22, 2023

In defence of metric learning for speaker recognition

Python 1,051 272 Updated Mar 26, 2024

[CVPR 2022--Oral] Restormer: Efficient Transformer for High-Resolution Image Restoration. SOTA for motion deblurring, image deraining, denoising (Gaussian/real data), and defocus deblurring.

Python 1,782 238 Updated Aug 16, 2024

A time-domain extension to "Perceptual Contrast Stretching on Target Feature for Speech Enhancement"

Python 6 2 Updated May 5, 2022

Spectral Normalization for Keras Dense and Convolution Layers

Jupyter Notebook 122 34 Updated Dec 28, 2019

MetricGAN: Generative Adversarial Networks based Black-box Metric Scores Optimization for Speech Enhancement (ICML 2019, with Travel awards)

MATLAB 133 34 Updated Apr 19, 2021

End-to-end waveform utterance enhancement for direct evaluation metrics optimization by fully convolutional neural networks (TASLP 2018)

Python 18 10 Updated Jul 12, 2019
Python 5 4 Updated Mar 6, 2020
Next