-
Notifications
You must be signed in to change notification settings - Fork 57
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Compare to VoiceFlow TTS #66
Comments
They are! I met the author @cantabile-kwok just last week (super nice guy), it is interesting we both made certain decisions to improve the speed relative to just conditional flow matching. One way to speed up that they employed was to improve the paths by "rectifying" the learned paths by flow matching which is a two-step approach and quite effective. For us, we felt that the same speedup could be achieved by improving the architecture instead so we improved the U-net architecture and got a similar speedup. Hope that helps. Shivam |
Thank you for your work and sharing!
It seems MATCHA-TTS and VoiceFLow-TTS (https://github.com/X-LANCE/VoiceFlow-TTS) are very similar?
What is the main diffences between these two methods?
And How about the performace on voice quality, for example prosody, and the inference speed?
The text was updated successfully, but these errors were encountered: