dizzyname

dizzyname

1 follower · 2 following

Starred repositories

vincentherrmann / pytorch-wavenet

An implementation of WaveNet with fast generation

Jupyter Notebook 975 228 Updated Sep 17, 2020

NVIDIA / waveglow

A Flow-based Generative Network for Speech Synthesis

Python 2,283 530 Updated Oct 19, 2023

NVIDIA / tacotron2

Tacotron 2 - PyTorch implementation with faster-than-realtime inference

Jupyter Notebook 5,094 1,385 Updated Jun 12, 2024

NVIDIA / DeepLearningExamples

State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.

Jupyter Notebook 13,543 3,229 Updated Aug 12, 2024

facebookresearch / DeeperCluster

Implements the unsupervised pre-training of convolutional neural networks

Python 249 33 Updated Sep 16, 2021

tyiannak / pyAudioAnalysis

Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications

Python 5,878 1,196 Updated Mar 31, 2024

CSAILVision / places365

The Places365-CNNs for Scene Classification

Python 1,924 536 Updated Jul 16, 2020

pannous / tensorflow-speech-recognition

🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks

Python 2,166 639 Updated Jan 17, 2024

mozilla / DeepSpeech

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

C++ 25,334 3,966 Updated Sep 3, 2024

taylorlu / Speaker-Diarization

speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition

Python 469 121 Updated Jul 1, 2021

aalto-speech / speaker-diarization

Speaker diarization scripts, based on AaltoASR

Python 190 37 Updated Jan 3, 2019

HarryVolek / PyTorch_Speaker_Verification

PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.

Python 577 164 Updated Jan 20, 2022

Janghyun1230 / Speaker_Verification

Tensorflow implementation of "Generalized End-to-End Loss for Speaker Verification"

Python 359 102 Updated Oct 9, 2021

wq2012 / awesome-diarization

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

1,616 226 Updated Oct 16, 2024

google / uis-rnn

This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.

Python 1,558 319 Updated Sep 25, 2024

pyannote / pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook 6,302 777 Updated Nov 11, 2024

zzw922cn / Automatic_Speech_Recognition

End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow

Python 2,843 537 Updated Mar 24, 2023

mammothb / symspellpy

Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm

Python 800 122 Updated Nov 8, 2024

AxelAli / Tensorflow-Image-Classification

Easy/Updated Tensorflow Image Classification

Python 42 14 Updated Jul 17, 2017

vinta / awesome-python

An opinionated list of awesome Python frameworks, libraries, software and resources.

Python 224,333 24,903 Updated Aug 11, 2024

Kyubyong / tacotron

A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model

Python 1,828 436 Updated Jan 17, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly