NSD-MA-MSE

This repository is a pytorch implementation of the paper "ANSD-MA-MSE: Adaptive Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding", for more details please see paper .

Results

Pretrain Model

The model trained on AMI dataset can be found here, with results of 11.19 on the AMI development set and 11.81 on the test set (oracle VAD, th=0.5).

Training

The training preparation process can be referred to here.

Decoding

You can check the decoding results with the following decoding commands:

 # AMI dev
 bash decode_MULTI_SE_MA_MSE_NSD_AMI.sh --stage 3 --data AMI_Headset_dev --sets dev
 
 # AMI test
 bash decode_MULTI_SE_MA_MSE_NSD_AMI.sh --stage 3 --data AMI_Headset_test --sets test

Citation

If you find this code useful in your research, please consider to cite the following papers:

@ARTICLE{10093997,
  author={He, Mao-Kui and Du, Jun and Liu, Qing-Feng and Lee, Chin-Hui},
  journal={IEEE/ACM Transactions on Audio, Speech, and Language Processing}, 
  title={ANSD-MA-MSE: Adaptive Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding}, 
  year={2023},
  volume={31},
  number={},
  pages={1561-1573},
  doi={10.1109/TASLP.2023.3265199}}

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
data		data
doc		doc
embedding_raw/voxceleb		embedding_raw/voxceleb
exp/nnet3_recipe_ivector/extractor		exp/nnet3_recipe_ivector/extractor
model/MULTI_SE_MA_MSE_NSD/Batchsize48_4speakers_Segment800s_configs3_4Speakers_ivector128_xvectors128_2Classes_Mixup0.5_AMI_Headset		model/MULTI_SE_MA_MSE_NSD/Batchsize48_4speakers_Segment800s_configs3_4Speakers_ivector128_xvectors128_2Classes_Mixup0.5_AMI_Headset
steps		steps
utils		utils
HTK.py		HTK.py
README.md		README.md
analysis_diarization.sh		analysis_diarization.sh
config.py		config.py
decode_MULTI_SE_MA_MSE_NSD.py		decode_MULTI_SE_MA_MSE_NSD.py
decode_MULTI_SE_MA_MSE_NSD_AMI.sh		decode_MULTI_SE_MA_MSE_NSD_AMI.sh
loss_function.py		loss_function.py
md-eval-22.pl		md-eval-22.pl
model.py		model.py
path.sh		path.sh
postprocessing.py		postprocessing.py
reader.py		reader.py
rttm_filter_with_vad.py		rttm_filter_with_vad.py
run_pretrain_MULTI_SE_AMI.py		run_pretrain_MULTI_SE_AMI.py
train_Pretrain_SE.py		train_Pretrain_SE.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NSD-MA-MSE

Results

Pretrain Model

Training

Decoding

Citation

About

Releases

Packages

Contributors 2

Languages

Maokui-He/NSD-MA-MSE

Folders and files

Latest commit

History

Repository files navigation

NSD-MA-MSE

Results

Pretrain Model

Training

Decoding

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages