Multichannel Speech Enhancement with Deep Neural Networks - Beamforming with Autoencoders

This project applies an autoencoder deep neural network to the multichannel speech enhancement problem. It takes the problem from dataset generation to the model training.

Single Channel and Multichannel Dataset Generation

In order to train the model, you need to create a dataset containing the mixture signals and the clean target signals. The dataset is then converted to the magnitude spectrum. You can find use the code snippets in Dataset Generation folder to create your own dataset. Note that you will need to find your own speech dataset and noise dataset. This set ensures the mixture generation and STFT conversion into a structured form.

You can download the thesis from here: https://github.com/furkanarius/Multichannel-Speech-Enhancement-with-Deep-Neural-Networks/blob/master/652282.pdf

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
Data Generation		Data Generation
Model		Model
.gitignore		.gitignore
4channel_v1.ipynb		4channel_v1.ipynb
4channel_v2_single.ipynb		4channel_v2_single.ipynb
652282.pdf		652282.pdf
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multichannel Speech Enhancement with Deep Neural Networks - Beamforming with Autoencoders

Single Channel and Multichannel Dataset Generation

About

Releases

Packages

Languages

License

furkanarius/Multichannel-Speech-Enhancement-with-Deep-Neural-Networks

Folders and files

Latest commit

History

Repository files navigation

Multichannel Speech Enhancement with Deep Neural Networks - Beamforming with Autoencoders

Single Channel and Multichannel Dataset Generation

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages