Skip to content
View janvainer's full-sized avatar
  • Pilsen
  • 18:32 (UTC -12:00)

Block or report janvainer

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A production-ready implementation of WaveRNN-based autoregressive waveform synthesis.

Python 7 Updated Oct 24, 2021

Multilingual G2P in 100 languages

Jupyter Notebook 286 25 Updated May 26, 2023

Noise reduction in python using spectral gating (speech, bioacoustics, audio, time-domain signals)

Jupyter Notebook 1,471 231 Updated Oct 9, 2024

A Data Streaming Library for Efficient Neural Network Training

Python 1,141 142 Updated Nov 15, 2024

Typed command line interfaces with argparse and pydantic

Python 40 4 Updated Oct 13, 2024
Python 1 Updated Feb 6, 2023

Reference implementation of the ChordPro standard for musical lead sheets.

Perl 324 51 Updated Nov 16, 2024
Python 1 Updated Jan 6, 2022

An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.

Python 830 157 Updated Oct 10, 2023
JavaScript 1 Updated Apr 20, 2021

Simple package for binding functions to CLI or config files.

Python 43 4 Updated Aug 11, 2024

Learn make by example

SCSS 4,935 251 Updated Jun 24, 2024

Type annotations and dynamic checking for a tensor's shape, dtype, names, etc.

Python 1,406 34 Updated Aug 1, 2024

Speaker embedding (d-vector) trained with GE2E loss

Python 273 47 Updated Jan 8, 2024

Helpers for running native Python functions as qsub jobs on the CLSP grid

Python 5 2 Updated Sep 16, 2021

Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a ca…

Python 1,684 302 Updated Mar 14, 2023

Haste: a fast, simple, and open RNN library

C++ 325 27 Updated Jul 18, 2023

MADGRAD Optimization Method

Python 802 57 Updated Apr 11, 2023

🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.

C++ 2,283 278 Updated Mar 11, 2024

GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis

Python 977 214 Updated Aug 28, 2023

[Prototype] Tools for the concurrent manipulation of variably sized Tensors.

Jupyter Notebook 253 28 Updated Nov 14, 2022

Implementation of the AlignTTS

Jupyter Notebook 76 12 Updated Jul 6, 2023

OpenShot Video Editor is an award-winning free and open-source video editor for Linux, Mac, and Windows, and is dedicated to delivering high quality video editing and animation solutions to the world.

Python 4,363 545 Updated Oct 12, 2024

List of speech synthesis papers.

2 Updated Aug 17, 2020

Accompanying code for our paper "Optimizing Short-Time Fourier Transform Parameters via Gradient Descent"

Jupyter Notebook 31 1 Updated Oct 30, 2020

Flowtron is an auto-regressive flow-based generative network for text to speech synthesis with control over speech variation and style transfer

Jupyter Notebook 890 177 Updated Jul 6, 2023

Train/test a variety of open source vocoders using the same input features and dataset. Then infer together for easy side-by-side comparisons.

Python 6 1 Updated Nov 2, 2020

Python implementation of soft-DTW.

Python 547 98 Updated Jun 19, 2024

Fast CUDA implementation of (differentiable) soft dynamic time warping for PyTorch

Python 629 59 Updated Apr 3, 2024

Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.

Python 960 88 Updated Nov 8, 2024
Next