Automagically synchronize subtitles with video.
-
Updated
Oct 16, 2024 - Python
Automagically synchronize subtitles with video.
CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.
Voice Activity Detection based on Deep Learning & TensorFlow
Cross-platform, real-time, offline speech recognition plugin for Unreal Engine. Based on Whisper OpenAI technology, whisper.cpp.
Android Voice Activity Detection (VAD) library. Supports WebRTC VAD GMM, Silero VAD DNN, Yamnet VAD DNN models.
Synchronize your subtitles using machine learning
EduSense: Practical Classroom Sensing at Scale
A complete speech segmentation system using Kaldi and x-vectors for voice activity detection (VAD) and speaker diarisation.
Voice activity detection (VAD) library for speech-end detection, based on WebRTC's VAD engine
Identifying individual speakers in an audio stream based on the unique characteristics found in individual voices using Python
Speech Detection 💬
PocketPiglet for iOS
VadRecorder based webrtc's VAD engine and vo-aac encoder, recording valid speech and discarding silence/noise data
PocketPiglet for Android
A mobile application that shows you what you say and objects around.
This repository contains scripts of activities performed on various deep learning concepts
iOS Voice Activity Detection (VAD). Supports WebRTC VAD GMM, Silero VAD DNN, Yamnet VAD DNN models.
Add a description, image, and links to the speech-detection topic page so that developers can more easily learn about it.
To associate your repository with the speech-detection topic, visit your repo's landing page and select "manage topics."