Skip to content

Main repository for the "Modern Methods of Speech Recognition and Synthesis" course project.

License

Notifications You must be signed in to change notification settings

neyudin/wavenetglow

Repository files navigation

WaveGlow

Unofficial implementation of WaveGlow model.

Project participants

Preliminary settings

Clone this repository and initialize submodule, install project requirements:

    git clone https://github.com/neyudin/wavenetglow.git
    cd wavenetglow
    git submodule init
    git submodule update
    pip install -r requirements.txt

Train the model

  1. Download LJ Speech dataset and unpack it to repository root.

  2. Set training configuration in file.

  3. Start training process:

    python train.py -c config.json

The whole training process can be monitored via Tensorboard while executing the following command (<log_dir> — directory with saved logs for Tensorboard, log_dir parameter in configuration file stands for it, <port_num> — port number to watch training information via https://localhost:<port_num>):

    tensorboard --logdir <log_dir> --port <port_num>

Text-to-Speech inference with pretrained Tacotron2 model

  1. Train WaveGlow model from scratch on data.

  2. Download pretrained Tacotron2 model.

  3. Run inference demo in notebook.

About

Main repository for the "Modern Methods of Speech Recognition and Synthesis" course project.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published