Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How we can slow down the output audio? #1067

Open
ayush431 opened this issue May 19, 2022 · 2 comments
Open

How we can slow down the output audio? #1067

ayush431 opened this issue May 19, 2022 · 2 comments

Comments

@ayush431
Copy link

No description provided.

@raccoonML
Copy link

A more advanced solution is to save the attention layer alignments from inference, stretch them by the desired slowdown amount, then run the decoder loop again replacing the attention network output with the stretched alignments. This has the effect of forcing the decoder to use more frames when synthesizing each letter of every word, slowing down the output. With similar techniques, it's also possible to slow down selected words, or speed up.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants
@ayush431 @raccoonML and others