[int8] Add MobileNetV2_1.0 & ResNet18 Quantization #14823

xinyu-intel · 2019-04-28T07:34:48Z

Description

Add MobileNetV2_1.0 & ResNet18 Quantization.

ResNet18 Performance on Skylake 8180 28c

resnet18_v1	fp32	int8	speedup
1	309.61	492.18	1.59
64	810.82	1341.55	1.65
accuracy	70.07%/89.30%	69.85%/89.23%

#14819 will improve mobilenetv2 fp32/int8 performance

mobilenetv2_1.0	fp32	int8	speedup	fp32_opt	int8_opt	speedup
1	75.22	162.12	2.16	240.51	413.92	1.72
64	291.63	469.28	1.61	795.86	3137.77	3.94
accuracy	70.14%/89.60%	63.62%/84.84%		70.14%/89.60%	69.53%/89.24%

@pengzhao-intel @ZhennanQin

Checklist

Essentials

Please feel free to remove inapplicable items for your PR.

The PR title starts with [MXNET-$JIRA_ID], where $JIRA_ID refers to the relevant JIRA issue created (except PRs with tiny changes)
Changes are complete (i.e. I finished coding on this PR)
All changes have test coverage:
Unit tests are added for small changes to verify correctness (e.g. adding a new operator)
Nightly tests are added for complicated/long-running ones (e.g. changing distributed kvstore)
Build tests will be added for build configuration changes (e.g. adding a new build option with NCCL)
Code is well-documented:
For user-facing API changes, API doc string has been updated.
For new C++ functions in header files, their functionalities and arguments are documented.
For new examples, README.md is added to explain the what the example does, the source of the dataset, expected performance on test set and reference to the original paper if applicable
Check the API doc at http:https://mxnet-ci-doc.s3-accelerate.dualstack.amazonaws.com/PR-$PR_ID/$BUILD_ID/index.html
To the my best knowledge, examples are either not affected by this change, or have been fixed to be compatible with this change

Changes

Feature1, tests, (and when applicable, API doc)
Feature2, tests, (and when applicable, API doc)

Comments

If this change is a backward incompatible change, why must this change be made.
Interesting edge cases to note here

pengzhao-intel · 2019-04-28T07:58:38Z

cc @zhreshold

pengzhao-intel

LGTM, let's wait the CI back.

TaoLv · 2019-04-28T13:12:34Z

src/operator/nn/mkldnn/mkldnn_base.cc

@@ -392,6 +392,7 @@ mkldnn_memory_format_t GetDefaultFormat(const mkldnn::memory::desc &desc) {
 case mkldnn_gOhwi8o:
 case mkldnn_gOhwi16o:
 case mkldnn_gOhIw16o4i:
+ case mkldnn_Goihw16g_s8s8:


What's this? It's first time for us to have this format?

yes, when quantize s8s8 group conv.

zhreshold · 2019-04-29T18:37:18Z

example/quantization/imagenet_gen_qsym_mkldnn.py

@@ -234,6 +242,12 @@ def save_params(fname, arg_params, aux_params, logger=None):
 'mobilenet0_pool0_fwd']
 if exclude_first_conv:
 excluded_sym_names += ['mobilenet0_conv0_fwd']
+ elif args.model == 'mobilenetv2_1.0':
+ rgb_mean = '123.68,116.779,103.939'


is there any exception that the rgb_mean dna std is not the same? otherwise repeatively coding it looks redundant

Agree with @zhreshold, along with enabling more models, we don't need to show how to reproduce each one since most of the command is very similar. We need to define a template for the user to reproduce any existed models for classification networks.

@zhreshold @pengzhao-intel agree, I'll refactor this script along with enabling more models next time.

roywei · 2019-04-29T22:40:51Z

@mxnet-label-bot add [Quantization, Example]

pengzhao-intel · 2019-04-30T21:58:36Z

Merging now. @xinyu-intel will refactor the script in the next PR.

* add resnet18 and mobilenetv2 models * add readme * support mkldnn s8s8 goihw16g weight format * fix_readme_typo

xinyu-intel added 4 commits April 28, 2019 10:31

add resnet18 and mobilenetv2 models

786fc05

add readme

45a07f2

support mkldnn s8s8 goihw16g weight format

4f1f634

fix_readme_typo

7bad48a

xinyu-intel requested a review from szha as a code owner April 28, 2019 07:34

pengzhao-intel added this to Review in progress in CPU Performance and Quantization Apr 28, 2019

pengzhao-intel mentioned this pull request Apr 28, 2019

[Discussion] 1.5.0 Roadmap #14619

Closed

CPU Performance and Quantization automation moved this from Review in progress to Reviewer approved Apr 28, 2019

pengzhao-intel approved these changes Apr 28, 2019

View reviewed changes

TaoLv reviewed Apr 28, 2019

View reviewed changes

Merge remote-tracking branch 'upstream/master' into mobilenetv2_res18

5c1974e

zhreshold reviewed Apr 29, 2019

View reviewed changes

marcoabreu added Example Quantization Issues/Feature Requests related to Quantization labels Apr 29, 2019

pengzhao-intel merged commit bde1b84 into apache:master Apr 30, 2019

CPU Performance and Quantization automation moved this from Reviewer approved to Done Apr 30, 2019

haohuanw pushed a commit to haohuanw/incubator-mxnet that referenced this pull request Jun 23, 2019

[int8] Add MobileNetV2_1.0 & ResNet18 Quantization (apache#14823)

704d0d5

* add resnet18 and mobilenetv2 models * add readme * support mkldnn s8s8 goihw16g weight format * fix_readme_typo

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[int8] Add MobileNetV2_1.0 & ResNet18 Quantization #14823

[int8] Add MobileNetV2_1.0 & ResNet18 Quantization #14823

xinyu-intel commented Apr 28, 2019

pengzhao-intel commented Apr 28, 2019

pengzhao-intel left a comment

TaoLv Apr 28, 2019

xinyu-intel Apr 28, 2019

zhreshold Apr 29, 2019

pengzhao-intel Apr 30, 2019

xinyu-intel Apr 30, 2019

roywei commented Apr 29, 2019

pengzhao-intel commented Apr 30, 2019 •

edited

Loading

[int8] Add MobileNetV2_1.0 & ResNet18 Quantization #14823

[int8] Add MobileNetV2_1.0 & ResNet18 Quantization #14823

Conversation

xinyu-intel commented Apr 28, 2019

Description

Checklist

Essentials

Changes

Comments

pengzhao-intel commented Apr 28, 2019

pengzhao-intel left a comment

Choose a reason for hiding this comment

TaoLv Apr 28, 2019

Choose a reason for hiding this comment

xinyu-intel Apr 28, 2019

Choose a reason for hiding this comment

zhreshold Apr 29, 2019

Choose a reason for hiding this comment

pengzhao-intel Apr 30, 2019

Choose a reason for hiding this comment

xinyu-intel Apr 30, 2019

Choose a reason for hiding this comment

roywei commented Apr 29, 2019

pengzhao-intel commented Apr 30, 2019 • edited Loading

pengzhao-intel commented Apr 30, 2019 •

edited

Loading