Skip to content
View woshizhishixuebao's full-sized avatar
Block or Report

Block or report woshizhishixuebao

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

The code for our INTERSPEECH 2020 paper - Jointly Fine-Tuning "BERT-like'" Self Supervised Models to Improve Multimodal Speech Emotion Recognition

Python 109 10 Updated Feb 26, 2021

[ICASSP 2023] Official Tensorflow implementation of "Temporal Modeling Matters: A Novel Temporal Emotional Modeling Approach for Speech Emotion Recognition".

Python 154 25 Updated May 15, 2024

基于Pytorch实现的语音情感识别

Python 103 19 Updated Jul 2, 2024
Jupyter Notebook 1 Updated May 18, 2024

This code package implements tools to finetune and evaluate end-to-end neural speaker diarization models.

Python 2 Updated May 3, 2024

CHIME-7 diarization champion system: neural speaker diarization using memory-aware multi-speaker embedding with sequence-to-sequence architecture

Shell 55 3 Updated May 17, 2024

The project tries to solve a speaker diarization problem using audio features, face recognition and video feature extraction from face image, mouth tracking.

Python 14 6 Updated Feb 10, 2019

Multimodal speaker diarization using pre-trained audio-visual synchronization model

Python 8 6 Updated May 12, 2020

A python package to build AI-powered real-time audio applications

Python 903 76 Updated Jul 8, 2024

Diarization scoring tools.

Python 201 41 Updated Mar 28, 2023

The PyTorch 1.6 and Python 3.7 implementation for the paper Graph Convolutional Networks for Text Classification

Python 104 20 Updated Oct 7, 2020

A pytorch implementation of the paper "ANSD-MA-MSE: Adaptive Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding"

Shell 38 2 Updated May 28, 2024
Python 42 3 Updated Nov 24, 2022

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit

Python 597 104 Updated Jul 10, 2024

PyTorch implementation of Densely Connected Time Delay Neural Network

Python 83 24 Updated May 4, 2023

基于Flask+VUE前后端,在阿里云公网WEB端部署YOLOv5目标检测模型

Python 174 25 Updated Apr 22, 2024

The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based attractors". [ICASSP 2024]

Python 65 4 Updated Jan 24, 2024
Python 1 Updated Sep 30, 2023

Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation implemented by Pytorch

Python 400 68 Updated Feb 14, 2023

A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

Python 931 77 Updated Jun 25, 2024

This repository contains a set of codes to run (i.e., train, perform inference with, evaluate) a diarization method called EEND-vector-clustering.

Python 69 17 Updated Oct 18, 2022

Speaker diarization based on Kaldi x-vectors, tuned for 16k microphone data

Python 95 29 Updated Jul 6, 2023

Self-supervised Speaker Diarization Interspeech 2022 Implementation

Python 10 Updated Sep 13, 2022
Python 42 Updated Jan 12, 2023

Variational Bayes HMM over x-vectors diarization

Python 245 57 Updated Jan 15, 2024

🎙️ Enhanced Speaker Diarisation 📒 with OSD, SS, and Advanced VAD🗣️.

Python 4 1 Updated Aug 27, 2023

A PyTorch implementation of End-to-End Neural Diarization

Python 95 15 Updated Jun 19, 2023
Next