New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

matmul use fp32 compute_type #8733

Merged

WenmuZhou merged 1 commit into PaddlePaddle:dygraph from zhangting2020:dygraph

Dec 29, 2022

Contributor

zhangting2020 commented Dec 29, 2022

修复rec v3模型amp训练精度偏低问题：由于该模型中matmul调用BatchedGEMM，框架默认使用FP16累加因此会损失精度。

本PR通过环境变量FLAGS_gemm_use_half_precision_compute_type=False，使BatchedGEMM使用FP32累加，AMP-O1和O2可达到精度对齐。


matmul use fp32 compute_type

1270ea1

paddle-bot bot commented Dec 29, 2022

Thanks for your contribution!

WenmuZhou approved these changes

View reviewed changes

WenmuZhou merged commit 4f735db into PaddlePaddle:dygraph

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment