speech-detection

Star

Here are 19 public repositories matching this topic...

smacke / ffsubsync

Sponsor

Star

Automagically synchronize subtitles with video.

Updated Oct 16, 2024
Python

ina-foss / inaSpeechSegmenter

Star

CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.

Updated Nov 8, 2024
Python

filippogiruzzi / voice_activity_detection

Star

Voice Activity Detection based on Deep Learning & TensorFlow

python machine-learning deep-neural-networks deep-learning time-series tensorflow speech artificial-intelligence speech-recognition vad resnet deeplearning time-series-classification voice-activity-detection librispeech speech-detection librispeech-dataset mfcc-features

Updated Mar 24, 2023
Python

gtreshchev / RuntimeSpeechRecognizer

Star

Cross-platform, real-time, offline speech recognition plugin for Unreal Engine. Based on Whisper OpenAI technology, whisper.cpp.

voice-recognition speech-recognition openai unreal-engine ue4 speech-to-text whisper speech-processing audio-processing unreal-engine-4 ue4-plugin speech-detection whis ue5 unreal-engine-5 ue5-plugin whisper-cpp whisper-ai

Updated Nov 14, 2024
C++

gkonovalov / android-vad

Star

Android Voice Activity Detection (VAD) library. Supports WebRTC VAD GMM, Silero VAD DNN, Yamnet VAD DNN models.

Updated Feb 12, 2024
C

tympanix / subsync

Star

Synchronize your subtitles using machine learning

machine-learning neural-network delay subtitles subtitle fix mfcc shift subsync speech-detection shift-subtitle

Updated Sep 18, 2023
Python

edusense / edusense

Star

EduSense: Practical Classroom Sensing at Scale

audio teachers classroom tracking machine-learning computer-vision pedagogy instructors posture gaze sensing speech-detection hand-raise

Updated Oct 28, 2024
Python

bbc / bbc-speech-segmenter

Star

A complete speech segmentation system using Kaldi and x-vectors for voice activity detection (VAD) and speaker diarisation.

automatic-speech-recognition speech-to-text voice-activity-detection speech-detection x-vectors endpoint-detection

Updated Jun 17, 2024
Shell

sepnic / litevad

Star

Voice activity detection (VAD) library for speech-end detection, based on WebRTC's VAD engine

webrtc voice-activity-detection speech-detection

Updated Jun 21, 2024
C

PranavPutsa1006 / Speaker-Diarization

Star

Identifying individual speakers in an audio stream based on the unique characteristics found in individual voices using Python

deep-learning neural-networks speech-to-text mfcc speaker-diarization spectral-clustering voice-activity-detection speech-segmentation speech-detection speech-transcription embeddings-extraction

Updated Jun 18, 2023
Jupyter Notebook

isbendiyarovanezrin / SpeechDetection

Star

Speech Detection 💬

vanilla-javascript web-speech-api speech-recognition javascript30 speech-to-text speech-detection

Updated Mar 22, 2022
CSS

pocketpiglet / pocketpiglet-ios

Star

PocketPiglet for iOS

game ios qt multimedia qml voice qt5 animations pet vad talking voice-activity-detection speech-detection

Updated Nov 29, 2022
QML

sepnic / vadrecorder

Star

VadRecorder based webrtc's VAD engine and vo-aac encoder, recording valid speech and discarding silence/noise data

webrtc audio-recorder audio-encoder speech-detection

Updated Jun 21, 2024
C++

andreahergert / speech_detection

Star

30 Days of Javascript Day 20

javascript speech-detection

Updated Mar 23, 2023
JavaScript

pocketpiglet / pocketpiglet-android

Star

PocketPiglet for Android

android game qt multimedia qml voice qt5 animations pet vad talking voice-activity-detection speech-detection

Updated Feb 18, 2023
QML

DPigeon / SeeText

Star

A mobile application that shows you what you say and objects around.

android mobile ai computer-vision languages object-detection definitions speech-detection ctext

Updated Sep 8, 2020
Java

Mihir998 / Deep-Learning-Activities

Star

This repository contains scripts of activities performed on various deep learning concepts

data-science data deep-neural-networks deep-learning neural-network speech-detection

Updated Nov 15, 2022
Jupyter Notebook

Mixa26 / Spoken-digits-recognizer-with-dynamic-time-warping

Star

sound speech-recognition digit-recognition speech-detection spoken-digits-recognition spoken-digits

Updated Dec 13, 2023
Jupyter Notebook

baochuquan / ios-vad

Star

iOS Voice Activity Detection (VAD). Supports WebRTC VAD GMM, Silero VAD DNN, Yamnet VAD DNN models.

Updated Nov 14, 2024
Swift

Improve this page

Add a description, image, and links to the speech-detection topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the speech-detection topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

speech-detection

Here are 19 public repositories matching this topic...

smacke / ffsubsync

ina-foss / inaSpeechSegmenter

filippogiruzzi / voice_activity_detection

gtreshchev / RuntimeSpeechRecognizer

gkonovalov / android-vad

tympanix / subsync

edusense / edusense

bbc / bbc-speech-segmenter

sepnic / litevad

PranavPutsa1006 / Speaker-Diarization

isbendiyarovanezrin / SpeechDetection

pocketpiglet / pocketpiglet-ios

sepnic / vadrecorder

andreahergert / speech_detection

pocketpiglet / pocketpiglet-android

DPigeon / SeeText

Mihir998 / Deep-Learning-Activities

Mixa26 / Spoken-digits-recognizer-with-dynamic-time-warping

baochuquan / ios-vad

Improve this page

Add this topic to your repo