-
Notifications
You must be signed in to change notification settings - Fork 8.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
How we can slow down the output audio? #1067
Comments
A more advanced solution is to save the attention layer alignments from inference, stretch them by the desired slowdown amount, then run the decoder loop again replacing the attention network output with the stretched alignments. This has the effect of forcing the decoder to use more frames when synthesizing each letter of every word, slowing down the output. With similar techniques, it's also possible to slow down selected words, or speed up. |
No description provided.
The text was updated successfully, but these errors were encountered: