Skip to content
View DongKeon's full-sized avatar

Highlights

  • Pro

Block or report DongKeon

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

Use your Neovim like using Cursor AI IDE!

Lua 4,681 158 Updated Sep 8, 2024
Jupyter Notebook 13 Updated Jul 16, 2024

Multi-Stage Face-Voice Association Learning with Keynote Speaker Diarization (ACM MM 2024)

Python 10 Updated Jul 25, 2024

Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.

Python 19 2 Updated Jul 10, 2024

The official code repo of "HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection"

Python 342 62 Updated Aug 16, 2024

📄 Awesome CV is LaTeX template for your outstanding job application

TeX 22,728 4,745 Updated Aug 8, 2024

This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised Segment-Based Robust Voice Activity Detection Method.

Python 126 21 Updated May 21, 2024

[INTERSPEECH 2022] This dataset is designed for multi-modal speaker diarization and lip-speech synchronization in the wild.

HTML 33 1 Updated Jan 24, 2024

Companion repo for the paper "PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings" published at Odyssey 2024

Python 23 Updated Jun 19, 2024

Vundle, the plug-in manager for Vim

Vim Script 23,883 2,568 Updated Jul 30, 2024

🙃 A delightful community-driven (with 2,300+ contributors) framework for managing your zsh configuration. Includes 300+ optional plugins (rails, git, macOS, hub, docker, homebrew, node, php, python…

Shell 172,319 25,792 Updated Sep 5, 2024

[Zoom & Facebook Live] Weekly AI Arxiv 시즌2

970 41 Updated Aug 27, 2023
Python 10 3 Updated Mar 18, 2024

The implementation of "End-to-End Neural Speaker Diarization with an Iterative Adaptive Attractor Estimation", which is accepted by Neural Networks.

Python 9 2 Updated Aug 27, 2023

Clustering-based methods for overlapping diarization

Python 68 8 Updated Jan 12, 2024

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

Jupyter Notebook 3,287 272 Updated Sep 5, 2024

CHIME-7/8 diarization champion system: neural speaker diarization using memory-aware multi-speaker embedding with sequence-to-sequence architecture

Shell 60 4 Updated May 17, 2024
Jupyter Notebook 27 2 Updated Apr 4, 2024

Some comprehensive papers about speaker diarization

187 3 Updated Aug 13, 2024
Python 54 7 Updated Feb 15, 2021

The Hugging Face Course on Transformers for Audio

MDX 311 96 Updated Aug 15, 2024

Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.

530 30 Updated Aug 3, 2024

The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based attractors". [ICASSP 2024]

Python 75 4 Updated Jan 24, 2024
Python 17 3 Updated Sep 19, 2023
Python 41 2 Updated Feb 8, 2024

A collection of inspiring lists, manuals, cheatsheets, blogs, hacks, one-liners, cli/web tools and more.

142,987 9,413 Updated Aug 21, 2024

A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

Python 1,064 93 Updated Aug 18, 2024

This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.

Python 342 40 Updated Sep 5, 2024
Next