Skip to content
/ McNet Public
forked from Audio-WestlakeU/McNet

The official repo: "McNet: Fuse Multiple Cues for Multichannel Speech Enhancement" submitted to ICASSP 2023

Notifications You must be signed in to change notification settings

ishine/McNet

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

McNet

The official repo: McNet: Fuse Multiple Cues for Multichannel Speech Enhancement submitted to ICASSP 2023 (https://arxiv.org/pdf/2211.08872.pdf). Examples can be found at https://audio.westlake.edu.cn/Research/McNet.htm.

Table 1. Performance of offline speech enhancement.* means scores are quoted from the original papers.

Method NB-PESQ WB-PESQ STOI SDR
Noisy 1.82 1.27 87.0 7.5
MNMF Beamforming * [20] - - 94.0 16.2
Oracle MVDR 2.49 1.94 97.0 17.3
CA Dense U-net * [12] - 2.44 - 18.6
Narrow-band Net [11] 2.74 2.13 95.0 16.6
FT-JNF [14] 3.17 2.48 96.2 17.7
McNet (prop.) 3.38 2.73 97.6 19.6

Table 2. Performance of online speech enhancement.

Method NB-PESQ WB-PESQ STOI SDR
Noisy 1.82 1.27 87.0 7.5
Narrow-band Net [11] 2.70 2.15 94.7 16.0
FT-JNF [14] 2.80 2.23 95.4 16.9
McNet (prop.) 3.29 2.67 97.2 19.0

Train & Test

Reminder: This project is built on the pytorch-lightning package, in particular its command line interface (CLI). To understand the commands below and config file, you need to have some basic knowledge about the CLI in lightning.

Train:

python McNetCLI.py fit --config config\mc_net_online.yaml

Test:

python McNetCLI.py test --config config\mc_net_online.yaml

About

The official repo: "McNet: Fuse Multiple Cues for Multichannel Speech Enhancement" submitted to ICASSP 2023

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%