Single channel voice separation for unknown number of speakers under reverberant and noisy settings

Chazan, Shlomo E.; Wolf, Lior; Nachmani, Eliya; Adi, Yossi

Computer Science > Sound

arXiv:2011.02329 (cs)

[Submitted on 4 Nov 2020]

Title:Single channel voice separation for unknown number of speakers under reverberant and noisy settings

Authors:Shlomo E. Chazan, Lior Wolf, Eliya Nachmani, Yossi Adi

View PDF

Abstract:We present a unified network for voice separation of an unknown number of speakers. The proposed approach is composed of several separation heads optimized together with a speaker classification branch. The separation is carried out in the time domain, together with parameter sharing between all separation heads. The classification branch estimates the number of speakers while each head is specialized in separating a different number of speakers. We evaluate the proposed model under both clean and noisy reverberant set-tings. Results suggest that the proposed approach is superior to the baseline model by a significant margin. Additionally, we present a new noisy and reverberant dataset of up to five different speakers speaking simultaneously.

Subjects:	Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
Cite as:	arXiv:2011.02329 [cs.SD]
	(or arXiv:2011.02329v1 [cs.SD] for this version)
	https://doi.org/10.48550/arXiv.2011.02329

Submission history

From: Shlomo Chazan [view email]
[v1] Wed, 4 Nov 2020 14:59:14 UTC (365 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.SD

< prev | next >

new | recent | 2020-11

Change to browse by:

cs
cs.LG
eess
eess.AS

References & Citations

DBLP - CS Bibliography

listing | bibtex

Shlomo E. Chazan
Lior Wolf
Eliya Nachmani
Yossi Adi

export BibTeX citation

Computer Science > Sound

Title:Single channel voice separation for unknown number of speakers under reverberant and noisy settings

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Sound

Title:Single channel voice separation for unknown number of speakers under reverberant and noisy settings

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators