-
Notifications
You must be signed in to change notification settings - Fork 10
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Trivial questions about the used models #3
Comments
Hi, no worry, thanks for your very interesting question! You r right, according to the work of ATST-Frame, C2F model performances better than ATST-Frame only. The reasons that we did not use it in this work are in two folds:
All models trained/fine-tuned in the ATST-Frame will be released in the audiossl repo. We still need some time (in one month) to organize the codes and ckpt files. |
I get it, thanks for your reply~ |
I close this issue if there is no further question. You are welcome to ask any other question in a new issue : ) |
Dear author, really sorry to bother your again.
I find that the atst-c2f model generally performs better than the atst-frame model no matter in tagging or detection tasks. Why don't you utilize this model to conduct downstream desed training? By the way, will the atst-c2f model be publicly available?
The text was updated successfully, but these errors were encountered: