Skip to content
View dizzyname's full-sized avatar

Block or report dizzyname

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

An implementation of WaveNet with fast generation

Jupyter Notebook 975 228 Updated Sep 17, 2020

A Flow-based Generative Network for Speech Synthesis

Python 2,283 530 Updated Oct 19, 2023

Tacotron 2 - PyTorch implementation with faster-than-realtime inference

Jupyter Notebook 5,094 1,385 Updated Jun 12, 2024

State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.

Jupyter Notebook 13,543 3,229 Updated Aug 12, 2024

Implements the unsupervised pre-training of convolutional neural networks

Python 249 33 Updated Sep 16, 2021

Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications

Python 5,878 1,196 Updated Mar 31, 2024

The Places365-CNNs for Scene Classification

Python 1,924 536 Updated Jul 16, 2020

🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks

Python 2,166 639 Updated Jan 17, 2024

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

C++ 25,334 3,966 Updated Sep 3, 2024

speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition

Python 469 121 Updated Jul 1, 2021

Speaker diarization scripts, based on AaltoASR

Python 190 37 Updated Jan 3, 2019

PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.

Python 577 164 Updated Jan 20, 2022

Tensorflow implementation of "Generalized End-to-End Loss for Speaker Verification"

Python 359 102 Updated Oct 9, 2021

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

1,616 226 Updated Oct 16, 2024

This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.

Python 1,558 319 Updated Sep 25, 2024

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook 6,302 777 Updated Nov 11, 2024

End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow

Python 2,843 537 Updated Mar 24, 2023

Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm

Python 800 122 Updated Nov 8, 2024

Easy/Updated Tensorflow Image Classification

Python 42 14 Updated Jul 17, 2017

An opinionated list of awesome Python frameworks, libraries, software and resources.

Python 224,333 24,903 Updated Aug 11, 2024

A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model

Python 1,828 436 Updated Jan 17, 2022