Block or Report
Block or report whinton
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseLanguage: Python
Sort by: Most stars
Starred repositories
Command-line program to download videos from YouTube.com and other video sites
Clone a voice in 5 seconds to generate arbitrary speech in real-time
DeepFaceLab is the leading software for creating deepfakes.
Minimal examples of data structures and algorithms in Python
Jupyter metapackage for installation, docs and chat
Python GUIs for Humans! PySimpleGUI is the top-rated Python application development environment. Launched in 2018 and actively developed, maintained, and supported in 2024. Transforms tkinter, Qt, …
A lightweight, dependency-free Python library (and command-line utility) for downloading YouTube Videos.
Python bindings for FFmpeg - with complex filtering support
A cross platform front-end GUI of the popular youtube-dl written in wxPython.
Manipulate audio with a simple and easy high level interface
Speech recognition module for Python, supporting several engines and APIs, online and offline.
Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications
A python package to analyze and compare voices with deep learning
This library provides common speech features for ASR including MFCCs and filterbank energies.
Starter code for working with the YouTube-8M dataset.
This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.
SincNet is a neural architecture for efficiently processing raw audio samples.
Implementation of the Wave-U-Net for audio source separation
Speech Enhancement Generative Adversarial Network in TensorFlow
🔈 Deep Learning & 3D Convolutional Neural Networks for Speaker Verification
A cross-platform GUI wrapper for yt-dlp written in PySide6
Daily Fantasy Sports lineup optimzer for all popular daily fantasy sports sites
WIP: Open Source Implementation of "MelNet: A Generative Model for Audio in the Frequency Domain"
Speech Recognition with Python examples
Simple d-vector based Speaker Recognition (verification and identification) using Pytorch
Speaker Identification System (upto 100% accuracy); built using Python 2.7 and python_speech_features library
Speaker diarization scripts, based on AaltoASR