Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

transformer预训练模型是如何训练的? #61

Open
dzyanshan opened this issue Nov 22, 2022 · 1 comment
Open

transformer预训练模型是如何训练的? #61

dzyanshan opened this issue Nov 22, 2022 · 1 comment

Comments

@dzyanshan
Copy link

作者您好!以我目前的浅薄理解,训练的过程是transformer模型直接加载笔划权重pretrain_transformer_stroke_decomposition.pth,计算sr图与hr图的结果,l1loss回传给生成模型,预测过程是lr图经过生成模型获取sr图,使用crnn直接预测结果吗?transformer模型的参数在中途是不是不变啊,笔划部分是如何训练的呢?

@liujie316316
Copy link

您好,请问一下回传给生成模型的是l1损失吗?不是l2损失吗?回传过去的是mse loss吗?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants