nshmyrev

Nickolay V. Shmyrev nshmyrev

595 followers · 772 following

Alpha Cephei Inc
Astrakhan, Russia
https://alphacephei.com

Starred repositories

freds0 / BRSpeech-Dataset

BRSpeech: A Portuguese Dataset for Speech Synthesis

CSS 6 Updated Aug 20, 2024

supikiti / PNCC

A implementation of Power Normalized Cepstral Coefficients: PNCC

Python 50 10 Updated Aug 11, 2019

stefantaubert / zh-tts

Web app, command-line interface and Python library for synthesizing Chinese texts into speech.

Python 7 1 Updated Apr 24, 2024

pengzhendong / datasets-pyannote

Forked from FrenchKrab/datasets-pyannote

Automatically setup the AISHELL-4 and MSDWild dataset for usage with pyannote-database (and pyannote-audio)

Python 2 Updated Sep 24, 2024

cwitkowitz / ss-mpe

Code for the paper "Toward Fully Self-Supervised Multi-Pitch Estimation".

Python 14 Updated Sep 25, 2024

freds0 / CleanSpecNet

Python 4 1 Updated Sep 25, 2024

Yip-Jia-Qi / codecformer

Python 13 1 Updated Jul 15, 2024

mesolitica / vllm-whisper

Forked from vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for Whisper, https://mesolitica.com/blog/vllm-whisper

Python 10 4 Updated Jul 28, 2024

ttslr / CTA-TTS

[IEEE/ACM-TASLP 2024] Controllable Accented Text-to-Speech Synthesis with Fine and Coarse-Grained Intensity Rendering

HTML 2 1 Updated Sep 24, 2024

ttslr / FastTalker

[Neural Networks'2021] FastTalker: A neural text-to-speech architecture with shallow and group autoregression

HTML 2 1 Updated Sep 24, 2024

ttslr / ICASSP2020

[ICASSP'2020] Teacher-Student Training for Robust Tacotron-based TTS

HTML 1 1 Updated Sep 24, 2024

daanzu / py-silero-vad-lite

Lightweight wrapper for Silero VAD using internal ONNX Runtime and with no python package dependencies

Python 4 Updated Sep 27, 2024

ljuvela / SourceFilterNeuralFormants

Python 9 5 Updated Sep 20, 2024

ljuvela / GlotNet

Python 4 2 Updated Oct 14, 2023

haidog-yaqub / DiffPitcher

Diffusion-based singing voice pitch correction

Python 87 14 Updated Sep 20, 2024

daswer123 / xtts-api-server

A simple FastAPI Server to run XTTSv2

Python 366 84 Updated Jul 21, 2024

xinchen-ai / Westlake-Omni

Python 67 3 Updated Sep 24, 2024

LetterLiGo / SafeEar

The Official Code Repo of SafeEar (Accepted by CCS 2024)

Python 14 3 Updated Sep 24, 2024

yukara-ikemiya / wavefit-pytorch

PyTorch implementation of WaveFit [2022, Google] which is one of SOTA lightweight/fast speech vocoders.

Python 40 1 Updated Sep 23, 2024

eureka235 / Stutter-Solver

Stutter-Solver: End-to-end Cross-lingual Dysfluency Detection

Jupyter Notebook 5 Updated Jul 20, 2024

splinter21 / inferStreamHiFiGAN

StreamHiFiGAN offers a HiFiGAN vocoder model optimized for streaming inference, providing real-time audio synthesis capabilities.

Python 2 Updated Jun 28, 2024

BUTSpeechFIT / wespeaker_ssl_public

Using Pre-trained SSL Transformer Models for Speaker Verification

Python 4 Updated Sep 22, 2024

wangmengzhi / Lightweight-Transducer

The source code for the Interspeech 2024 paper "Lightweight Transducer Based on Frame Level Criterion".

Python 7 1 Updated Sep 23, 2024

elemaudio / elementary

Elementary is a JavaScript library for digital audio signal processing.

C 322 29 Updated Jul 29, 2024

sivannavis / samo

SAMO: SPEAKER ATTRACTOR MULTI-CENTER ONE-CLASS LEARNING FOR VOICE ANTI-SPOOFING

Python 33 8 Updated Apr 5, 2023

Tonyyouyou / Mamba-in-Speech

Python 18 Updated Jul 1, 2024

xuyaoxun / MuCodec

22 Updated Sep 14, 2024

vocaliodmiku / wav2vec2mdd-Text

Python 16 6 Updated Jun 28, 2022

john852517791 / pytorch_lightning_FAD

This is a general framework for fake audio detection using pytorch lightning

Python 9 Updated Sep 11, 2024

Visitor-W / MTDA

MTDA-HSED: Mutual-Assistance Tuning and Dual-Branch Aggregating for Heterogeneous Sound Event Detection

6 Updated Sep 24, 2024

Nickolay V. Shmyrev nshmyrev

Starred repositories

Telegram

speech-to-text

speech-recognition

stt