Solving Oscillation Problem in Post-Training Quantization Through a Theoretical Perspective

Ma, Yuexiao; Li, Huixia; Zheng, Xiawu; Xiao, Xuefeng; Wang, Rui; Wen, Shilei; Pan, Xin; Chao, Fei; Ji, Rongrong

Computer Science > Computer Vision and Pattern Recognition

arXiv:2303.11906v1 (cs)

[Submitted on 21 Mar 2023 (this version), latest version 4 Apr 2023 (v2)]

Title:Solving Oscillation Problem in Post-Training Quantization Through a Theoretical Perspective

Authors:Yuexiao Ma, Huixia Li, Xiawu Zheng, Xuefeng Xiao, Rui Wang, Shilei Wen, Xin Pan, Fei Chao, Rongrong Ji

View PDF

Abstract:Post-training quantization (PTQ) is widely regarded as one of the most efficient compression methods practically, benefitting from its data privacy and low computation costs. We argue that an overlooked problem of oscillation is in the PTQ methods. In this paper, we take the initiative to explore and present a theoretical proof to explain why such a problem is essential in PTQ. And then, we try to solve this problem by introducing a principled and generalized framework theoretically. In particular, we first formulate the oscillation in PTQ and prove the problem is caused by the difference in module capacity. To this end, we define the module capacity (ModCap) under data-dependent and data-free scenarios, where the differentials between adjacent modules are used to measure the degree of oscillation. The problem is then solved by selecting top-k differentials, in which the corresponding modules are jointly optimized and quantized. Extensive experiments demonstrate that our method successfully reduces the performance drop and is generalized to different neural networks and PTQ methods. For example, with 2/4 bit ResNet-50 quantization, our method surpasses the previous state-of-the-art method by 1.9%. It becomes more significant on small model quantization, e.g. surpasses BRECQ method by 6.61% on MobileNetV2*0.5.

Comments:	Accepted by CVPR 2023
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2303.11906 [cs.CV]
	(or arXiv:2303.11906v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2303.11906

Submission history

From: Yuexiao Ma [view email]
[v1] Tue, 21 Mar 2023 14:52:52 UTC (627 KB)
[v2] Tue, 4 Apr 2023 08:04:19 UTC (889 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Solving Oscillation Problem in Post-Training Quantization Through a Theoretical Perspective

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Solving Oscillation Problem in Post-Training Quantization Through a Theoretical Perspective

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators