voice-activity-detection

Here are 57 public repositories matching this topic...

snakers4 / silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

voice-commands speech pytorch voice-recognition vad voice-control speech-processing voice-detection voice-activity-detection onnx onnxruntime onnx-runtime

Updated Jul 11, 2024
Python

modelscope / FunASR

Star

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

pytorch speech-recognition vad punctuation whisper audio-visual-speech-recognition speaker-diarization voice-activity-detection conformer pretrained-model rnnt dfsmn paraformer speechgpt speechllm

Updated Jul 11, 2024
Python

Picovoice / cobra

Star

On-device voice activity detection (VAD) powered by deep learning

speech-recognition vad voice-activity-detection on-device voice-activity voice-activity-detector

Updated Jul 8, 2024
Python

juanmc2005 / diart

Star

A python package to build AI-powered real-time audio applications

real-time deep-learning transcription speaker-diarization streaming-audio voice-activity-detection speaker-embedding

Updated Jul 8, 2024
Python

nianlonggu / WhisperSeg

Star

Code for ICASSP 2024 paper WhisperSeg: Positive Transfer of the Whisper Speech Transformer to Human and Animal Voice Activity Detection

transformer whisper audio-segmentation voice-activity-detection icassp2024 animal-sound-detection whisperseg

Updated Jul 8, 2024
Python

ina-foss / inaSpeechSegmenter

Star

CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.

Updated Jul 2, 2024
Python

baxtree / subaligner

Star

Automatically synchronize and translate subtitles, or create new ones by transcribing, using pre-trained DNNs, Forced Alignments and Transformers. https://subaligner.readthedocs.io/

Updated Jul 1, 2024
Python

Yifei-ZHAO96 / Tr-VAD

Star

Tr-VAD: An Efficient Transformer based Voice Activity Detection Model

vad voice-activity-detection

Updated Jun 30, 2024
Python

OpenVoiceOS / ovos-vad-plugin-silero

Star

ovos plugin for voice activity detection using silero vad

plugin vad voice-activity-detection ovos openvoiceos

Updated Jun 27, 2024
Python

jim-schwoebel / voice_gender_detection

Star

♂️♀️ Detect a person's gender from a voice file (90.7% +/- 1.3% accuracy).

machine-learning tutorial voice voice-commands voice-recognition workshop-materials voice-control gender-classification voice-assistant machine-learning-modeling gender-detection machine-learning-practice voice-activity-detection machine-learning-tutorial voice-computing machine-learning-model surveylex neurolex

Updated Jun 17, 2024
Python

kristofferv98 / SemanthaVoiceAssistant

Star

A comprehensive AI companion leveraging advanced semantic analysis, sentiment detection, and voice processing to provide personalized and context-aware interactions using Autogen, semantic-router, and VoiceProcessingToolkit.

Updated Jun 2, 2024
Python

zhenghuatan / rVADfast

Star

This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised Segment-Based Robust Voice Activity Detection Method.

voice-activity-detection

Updated May 21, 2024
Python

HolgerBovbjerg / SSL-PVAD

Star

A repository for code used to produce the results the ICASSP 2024 paper: "SELF-SUPERVISED PRETRAINING FOR ROBUST PERSONALIZED VOICE ACTIVITY DETECTION IN ADVERSE CONDITIONS"

speech-processing voice-activity-detection self-supervised-learning personalized-machine-learning