Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
BogiHsu committed Nov 9, 2020
1 parent 5cb7590 commit aafde38
Showing 1 changed file with 2 additions and 2 deletions.
4 changes: 2 additions & 2 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,7 +2,7 @@
Yet another PyTorch implementation of [Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions](https://arxiv.org/pdf/1712.05884.pdf). The project is highly based on [these](#References). I made some modification to improve speed and performance of both training and inference.

## TODO
- [ ] Combine with [WG-WaveNet](https://bogihsu.github.io/WG-WaveNet/).
- [ ] Combine with [WG-WaveNet](https://github.com/BogiHsu/WG-WaveNet).
- [ ] Add Colab demo.

## Requirements
Expand Down Expand Up @@ -52,7 +52,7 @@ python3 inference.py --ckpt_pth=<pth/to/model> --img_pth=<pth/to/save/alignment>
You can download pretrained models from [here](https://www.dropbox.com/sh/vk2erozpkoltao6/AABCk4WryQtrt4BYthIKzbK7a?dl=0) (git commit: [9e7c26d](https://github.com/BogiHsu/Tacotron2-PyTorch/commit/9e7c26d93ea9d93332b1c316ac85c58771197d4f)). The hyperparameter for training is also in the directory.

## Vocoder
A vocoder is not implemented yet. For now it just reconstucts the linear spectrogram from the Mel spectrogram directly and uses Griffim-Lim to synthesize the waveform. A pipeline for [WG-WaveNet](https://bogihsu.github.io/WG-WaveNet/) is in progress. Or you can refer to [WaveNet](https://github.com/r9y9/wavenet_vocoder), [FFTNet](https://github.com/syang1993/FFTNet), or [WaveGlow](https://github.com/NVIDIA/waveglow).
A vocoder is not implemented yet. For now it just reconstucts the linear spectrogram from the Mel spectrogram directly and uses Griffim-Lim to synthesize the waveform. A pipeline for [WG-WaveNet](https://github.com/BogiHsu/WG-WaveNet) is in progress. Or you can refer to [WaveNet](https://github.com/r9y9/wavenet_vocoder), [FFTNet](https://github.com/syang1993/FFTNet), or [WaveGlow](https://github.com/NVIDIA/waveglow).

## Results
You can find some samples in [results](https://github.com/BogiHsu/Tacotron2-PyTorch/tree/master/results). These results are generated using either pseudo inverse or WaveNet.
Expand Down

0 comments on commit aafde38

Please sign in to comment.