This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised Segment-Based Robust Voice Activity Detection Method.

Python 126 21 Updated May 21, 2024

X-LANCE / MSDWILD

[INTERSPEECH 2022] This dataset is designed for multi-modal speaker diarization and lip-speech synchronization in the wild.

HTML 33 1 Updated Jan 24, 2024

joonaskalda / PixIT

Companion repo for the paper "PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings" published at Odyssey 2024

Python 23 Updated Jun 19, 2024

jagabandhumishra / W2V-E2E-Language-Diarization

Python 7 Updated Sep 4, 2023

VundleVim / Vundle.vim

Vundle, the plug-in manager for Vim

Vim Script 23,883 2,568 Updated Jul 30, 2024

ohmyzsh / ohmyzsh

🙃 A delightful community-driven (with 2,300+ contributors) framework for managing your zsh configuration. Includes 300+ optional plugins (rails, git, macOS, hub, docker, homebrew, node, php, python…

Shell 172,319 25,792 Updated Sep 5, 2024

jungwoo-ha / WeeklyArxivTalk

[Zoom & Facebook Live] Weekly AI Arxiv 시즌2

970 41 Updated Aug 27, 2023

etri / kmsav

Python 10 3 Updated Mar 18, 2024

HaoFengyuan / EEND-IAAE

The implementation of "End-to-End Neural Speaker Diarization with an Iterative Adaptive Attractor Estimation", which is accepted by Neural Networks.

Python 9 2 Updated Aug 27, 2023

desh2608 / diarizer

Clustering-based methods for overlapping diarization

Python 68 8 Updated Jan 12, 2024

MahmoudAshraf97 / whisper-diarization

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

Jupyter Notebook 3,287 272 Updated Sep 5, 2024

liyunlongaaa / NSD-MS2S

CHIME-7/8 diarization champion system: neural speaker diarization using memory-aware multi-speaker embedding with sequence-to-sequence architecture

Shell 60 4 Updated May 17, 2024

egruttadauria98 / SSpaVAlDo

Jupyter Notebook 27 2 Updated Apr 4, 2024

DongKeon / Awesome-Speaker-Diarization

Some comprehensive papers about speaker diarization

187 3 Updated Aug 13, 2024

popcornell / SparseLibriMix

Python 54 7 Updated Feb 15, 2021

huggingface / audio-transformers-course

The Hugging Face Course on Transformers for Audio

MDX 311 96 Updated Aug 15, 2024

EmulationAI / awesome-large-audio-models

Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.

530 30 Updated Aug 3, 2024

Audio-WestlakeU / FS-EEND

The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based attractors". [ICASSP 2024]

Python 75 4 Updated Jan 24, 2024

Mu-Y / DiariST

Python 17 3 Updated Sep 19, 2023

BUTSpeechFIT / DiaPer

Python 41 2 Updated Feb 8, 2024

trimstray / the-book-of-secret-knowledge

A collection of inspiring lists, manuals, cheatsheets, blogs, hacks, one-liners, cli/web tools and more.

142,987 9,413 Updated Aug 21, 2024

modelscope / 3D-Speaker

A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

Python 1,064 93 Updated Aug 18, 2024

google / speaker-id

This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.

Python 342 40 Updated Sep 5, 2024