Fix integer overflow in quantization #129127

Flamefire · 2024-06-20T10:12:15Z

The static_cast<int64_t> can overflow for large float values and/or a small scale (e.g. 9.2e14 & 1e-4)
Fix a similar issue in the mask calculation where std::lrint is used which may convert to a 32 bit float returning an implementation defined value on overflow.

Stay in float mode using std::round and fmin/fmax to avoid this.

Fixes #111471

I actually fixed the CUDA code first and copied that.

I didn't touch the code duplication which can likely be removed by using a fitting AT_DISPATCH but for some reason the zero_point is int32_t while the limits are int64_t which to me doesn't make much sense and the actual type could always be used.

After #113861 failed for some reason

cc @jgong5 @mingfeima @XiaobingSuper @sanchitintel @ashokei @jingxu10

The `static_cast<int64_t>` can overflow for large float values and/or a small scale (e.g. 9.2e14 & 1e-4) Fix a similar issue in the mask calculation where `std::lrint` is used which may convert to a 32 bit float returning an implementation defined value on overflow. Stay in float mode using `std::round` and `fmin/fmax` to avoid this. Fixes pytorch#111471

pytorch-bot · 2024-06-20T10:12:19Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/129127

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 2af912f with merge base 54b0006 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Flamefire and others added 5 commits January 22, 2024 22:56

Replace std::round by std::nearbyint

6099d4b

Merge branch 'pytorch:main' into fix-fake_quant

a164cae

Merge branch 'pytorch:main' into fix-fake_quant

e15879a

Merge branch 'pytorch:main' into fix-fake_quant

2af912f

Flamefire requested review from jerryzh168, salilsdesai, kimishpatel, digantdesai and jianyuh as code owners June 20, 2024 10:12

pytorch-bot bot added module: cpu CPU specific problem (e.g., perf, algorithm) release notes: quantization release notes category labels Jun 20, 2024

pytorchbot added the open source label Jun 20, 2024

zou3519 added the triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module label Jun 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix integer overflow in quantization #129127

Fix integer overflow in quantization #129127

Flamefire commented Jun 20, 2024 •

edited by pytorch-bot bot

Loading

pytorch-bot bot commented Jun 20, 2024 •

edited

Loading

Fix integer overflow in quantization #129127

Are you sure you want to change the base?

Fix integer overflow in quantization #129127

Conversation

Flamefire commented Jun 20, 2024 • edited by pytorch-bot bot Loading

pytorch-bot bot commented Jun 20, 2024 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/129127

✅ No Failures

Flamefire commented Jun 20, 2024 •

edited by pytorch-bot bot

Loading

pytorch-bot bot commented Jun 20, 2024 •

edited

Loading