Trivial questions about the used models #3

Ming-er · 2023-11-13T04:07:55Z

Dear author, really sorry to bother your again.
I find that the atst-c2f model generally performs better than the atst-frame model no matter in tagging or detection tasks. Why don't you utilize this model to conduct downstream desed training? By the way, will the atst-c2f model be publicly available?

SaoYear · 2023-11-13T05:54:39Z

Hi, no worry, thanks for your very interesting question!

You r right, according to the work of ATST-Frame, C2F model performances better than ATST-Frame only. The reasons that we did not use it in this work are in two folds:

Simply because of the poorer performance of C2F when we use it in the first training stage of ATST-SED. In this work, we utilized the model fine-tuned on AS-2M as the pretrained model. However, since C2F unavoidably contains the clip-level information distilled from the ATST-Clip, it actually performed poorer than ATST-Frame. The CLS token of ATST-Clip has a negative affect on the SED, according to our previous experience. And the distillation in AS-2M (C2F) trains the ATST-Frame to perform similarly as the ATST-Clip CLS token.
The right way of using C2F in the DESED set is to fine-tune the ATST-Clip-AS_2M first and then distill it to ATST-Frame-AS_2M. However, we did not implement this process in the ATST-SED. Because our main focus was to fine-tune the pre-trained model, and we did not want to complicate the fine-tuning procedure in the development stage. Honestly, we did not know the performance of fine-tuning C2F model yet.

All models trained/fine-tuned in the ATST-Frame will be released in the audiossl repo. We still need some time (in one month) to organize the codes and ckpt files.

Ming-er · 2023-11-13T07:11:24Z

I get it, thanks for your reply~

SaoYear · 2023-11-16T04:03:17Z

I close this issue if there is no further question. You are welcome to ask any other question in a new issue : )

SaoYear closed this as completed Nov 16, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Trivial questions about the used models #3

Trivial questions about the used models #3

Ming-er commented Nov 13, 2023

SaoYear commented Nov 13, 2023

Ming-er commented Nov 13, 2023

SaoYear commented Nov 16, 2023

Trivial questions about the used models #3

Trivial questions about the used models #3

Comments

Ming-er commented Nov 13, 2023

SaoYear commented Nov 13, 2023

Ming-er commented Nov 13, 2023

SaoYear commented Nov 16, 2023