Automagically synchronize subtitles with video.
-
Updated
Mar 18, 2024 - Python
Automagically synchronize subtitles with video.
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
Command-line utility to transcribe/translate from video/audio/subtitles to subtitles
A python package to build AI-powered real-time audio applications
Python AI assistant 🧠
An audio/acoustic activity detection and audio segmentation tool
CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.
Automatically synchronize and translate subtitles, or create new ones by transcribing, using pre-trained DNNs, Forced Alignments and Transformers. https://subaligner.readthedocs.io/
🗣️ A book and repo to get you started programming voice computing applications in Python (10 chapters and 200+ scripts).
Voice Activity Detection based on Deep Learning & TensorFlow
Auto transcribe tool based on whisper
On-device voice activity detection (VAD) powered by deep learning
Pytorch implementation of SELF-ATTENTIVE VAD, ICASSP 2021
Repository for our Interspeech2020 general-purpose voice activity detection (GPVAD) paper
This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised Segment-Based Robust Voice Activity Detection Method.
The codebase for Data-driven general-purpose voice activity detection.
A python library for voice activity detection (VAD) for speech/non-speech segmentation.
♂️♀️ Detect a person's gender from a voice file (90.7% +/- 1.3% accuracy).
A collection of basic python modules for spoken natural language processing
Add a description, image, and links to the voice-activity-detection topic page so that developers can more easily learn about it.
To associate your repository with the voice-activity-detection topic, visit your repo's landing page and select "manage topics."