-
Notifications
You must be signed in to change notification settings - Fork 28
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
cost #23
Comments
APE的训练是16张V100-32G,大概30-40天,中间有少量中断。 |
有没有包括预训练那部分的时间啊,我也正在做类似的项目,1024*1024的分辨率,batchsize 就每张卡一张图,花费的时间就非常非常长,你们具体是什么一个情况呢 |
APE是直接用基于CLIP预训练后的ViT和文本编码器。APE本身只训练一次。 我们训练也确实很慢,大概3s一个step,APE-D总共训练1080k个steps。 前期我们是用R50验证有提升,后面才用更大模型。 |
你们当时有没有考虑用EVA_CLIP中VIT 作为BACK BONE ,或者有没有做这样得尝试呢 |
APE模型已经是用了EVA_CLIP的视觉和文本编码器。 |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
这个APE(D)在多少张卡上训练的啊,一共用了多少卡时??
The text was updated successfully, but these errors were encountered: