Skip to content
View Viceaa's full-sized avatar

Block or report Viceaa

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling @ INTERSPEECH 2021

Python 280 20 Updated Jul 22, 2022

CnSTD: 基于 PyTorch/MXNet 的 中文/英文 场景文字检测(Scene Text Detection)、数学公式检测(Mathematical Formula Detection, MFD)、篇章分析(Layout Analysis)的Python3 包

Python 668 104 Updated Jun 22, 2024

类似按键精灵的鼠标键盘录制和自动化操作 模拟点击和键入 | automate mouse clicks and keyboard input

Python 6,859 1,000 Updated Aug 31, 2024

[.NET] m3u8 downloader 开源的命令行m3u8/HLS/dash下载器,支持普通AES-128-CBC解密,多线程,自定义请求头等. 支持简体中文,繁体中文和英文. English Supported.

C# 14,081 2,134 Updated Jun 3, 2023

A pytorch implementation of the paper "3D-R2N2: A Unified Approach for Single and Multi-view 3D Object Reconstruction" by Choy et al.

Python 23 11 Updated Oct 4, 2023

Single/multi view image(s) to voxel reconstruction using a recurrent neural network

Python 1,346 293 Updated Jul 16, 2021

通过可逆跳跃马尔科夫链蒙特卡洛方法实现一维大地电磁反演

MATLAB 11 1 Updated May 24, 2022

A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR

Python 887 152 Updated Jul 5, 2023

High-Resolution Image Synthesis with Latent Diffusion Models

Python 22 4 Updated Sep 19, 2023

This research project aims at studying and finding a suitable method to implement audio bandwidth extension to bandlimited audio files.

MATLAB 22 6 Updated Jan 24, 2018

Denoising Diffusion Probabilistic Models

Python 3,598 359 Updated Aug 29, 2023

A fast, high-quality neural vocoder.

Python 270 45 Updated Jul 18, 2023

TFGAN: Time and Frequency Domain Based Generative Adversarial Network for High-fidelity Speech Synthesis

Python 84 19 Updated Feb 23, 2021

iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform

Python 217 47 Updated Mar 14, 2023

HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement

Python 151 19 Updated Jul 16, 2022

General Speech Restoration

Python 273 54 Updated Jan 13, 2024

General Speech Restoration

Python 985 129 Updated May 31, 2024
Python 131 15 Updated Jan 9, 2023

Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation implemented by Pytorch

Python 403 67 Updated Feb 14, 2023

《Pytorch模型训练实用教程》中配套代码

Python 7,420 1,733 Updated Jul 30, 2024

总结梳理自然语言处理工程师(NLP)需要积累的各方面知识,包括面试题,各种基础知识,工程能力等等,提升核心竞争力

Python 6,686 1,160 Updated Aug 24, 2022

Unofficial PyTorch implementation of Google AI's VoiceFilter system

Python 1,070 225 Updated Jul 25, 2024
MATLAB 2 1 Updated Jun 22, 2015

This is an official implementation of our NeurIPS 2021 paper "HRFormer: High-Resolution Transformer for Dense Prediction".

Python 1 Updated Dec 29, 2021

Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)

Python 579 111 Updated Apr 11, 2024

In defence of metric learning for speaker recognition

Python 1,021 272 Updated Mar 26, 2024

kaldi-asr/kaldi is the official location of the Kaldi project.

Shell 14,074 5,307 Updated Aug 2, 2024

Speaker embedding(verification and recognition) using Pytorch

Python 363 100 Updated Jul 24, 2020

Keras implementation of ‘’Deep Speaker: an End-to-End Neural Speaker Embedding System‘’ (speaker recognition)

Python 246 81 Updated Apr 27, 2020

Generating room impulse responses

C++ 1 1 Updated Oct 22, 2020
Next