Skip to content
View nshmyrev's full-sized avatar

Block or report nshmyrev

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

BRSpeech: A Portuguese Dataset for Speech Synthesis

CSS 6 Updated Aug 20, 2024

A implementation of Power Normalized Cepstral Coefficients: PNCC

Python 50 10 Updated Aug 11, 2019

Web app, command-line interface and Python library for synthesizing Chinese texts into speech.

Python 7 1 Updated Apr 24, 2024

Automatically setup the AISHELL-4 and MSDWild dataset for usage with pyannote-database (and pyannote-audio)

Python 2 Updated Sep 24, 2024

Code for the paper "Toward Fully Self-Supervised Multi-Pitch Estimation".

Python 14 Updated Sep 25, 2024
Python 4 1 Updated Sep 25, 2024
Python 13 1 Updated Jul 15, 2024

A high-throughput and memory-efficient inference and serving engine for Whisper, https://mesolitica.com/blog/vllm-whisper

Python 10 4 Updated Jul 28, 2024

[IEEE/ACM-TASLP 2024] Controllable Accented Text-to-Speech Synthesis with Fine and Coarse-Grained Intensity Rendering

HTML 2 1 Updated Sep 24, 2024

[Neural Networks'2021] FastTalker: A neural text-to-speech architecture with shallow and group autoregression

HTML 2 1 Updated Sep 24, 2024

[ICASSP'2020] Teacher-Student Training for Robust Tacotron-based TTS

HTML 1 1 Updated Sep 24, 2024

Lightweight wrapper for Silero VAD using internal ONNX Runtime and with no python package dependencies

Python 4 Updated Sep 27, 2024
Python 4 2 Updated Oct 14, 2023

Diffusion-based singing voice pitch correction

Python 87 14 Updated Sep 20, 2024

A simple FastAPI Server to run XTTSv2

Python 366 84 Updated Jul 21, 2024
Python 67 3 Updated Sep 24, 2024

The Official Code Repo of SafeEar (Accepted by CCS 2024)

Python 14 3 Updated Sep 24, 2024

PyTorch implementation of WaveFit [2022, Google] which is one of SOTA lightweight/fast speech vocoders.

Python 40 1 Updated Sep 23, 2024

Stutter-Solver: End-to-end Cross-lingual Dysfluency Detection

Jupyter Notebook 5 Updated Jul 20, 2024

StreamHiFiGAN offers a HiFiGAN vocoder model optimized for streaming inference, providing real-time audio synthesis capabilities.

Python 2 Updated Jun 28, 2024

Using Pre-trained SSL Transformer Models for Speaker Verification

Python 4 Updated Sep 22, 2024

The source code for the Interspeech 2024 paper "Lightweight Transducer Based on Frame Level Criterion".

Python 7 1 Updated Sep 23, 2024

Elementary is a JavaScript library for digital audio signal processing.

C 322 29 Updated Jul 29, 2024

SAMO: SPEAKER ATTRACTOR MULTI-CENTER ONE-CLASS LEARNING FOR VOICE ANTI-SPOOFING

Python 33 8 Updated Apr 5, 2023
Python 18 Updated Jul 1, 2024
22 Updated Sep 14, 2024

This is a general framework for fake audio detection using pytorch lightning

Python 9 Updated Sep 11, 2024

MTDA-HSED: Mutual-Assistance Tuning and Dual-Branch Aggregating for Heterogeneous Sound Event Detection

6 Updated Sep 24, 2024
Next