[Quantization] Support zero-size tensor input for quantization flow #15031

ciyongch · 2019-05-22T02:43:47Z

Description

This PR is to support zero-size tensor input for quantization flow.
Let's take RNN related model as an example, the begin_state is always initialized into shape (0, self._num_hidden), it worked well in FP32 pass, but failed in INT8 pass due to unknown dimension error with latest MXNet code base.
With this patch, models with such inputs are able to be quantized and run in INT8 mode.

@pengzhao-intel @TaoLv @ZhennanQin

Checklist

Essentials

Please feel free to remove inapplicable items for your PR.

The PR title starts with [MXNET-$JIRA_ID], where $JIRA_ID refers to the relevant JIRA issue created (except PRs with tiny changes)
Changes are complete (i.e. I finished coding on this PR)
All changes have test coverage:
Unit tests are added for small changes to verify correctness (e.g. adding a new operator)
Nightly tests are added for complicated/long-running ones (e.g. changing distributed kvstore)
Build tests will be added for build configuration changes (e.g. adding a new build option with NCCL)
Code is well-documented:
For user-facing API changes, API doc string has been updated.
For new C++ functions in header files, their functionalities and arguments are documented.
For new examples, README.md is added to explain the what the example does, the source of the dataset, expected performance on test set and reference to the original paper if applicable
Check the API doc at http:https://mxnet-ci-doc.s3-accelerate.dualstack.amazonaws.com/PR-$PR_ID/$BUILD_ID/index.html
To the my best knowledge, examples are either not affected by this change, or have been fixed to be compatible with this change

Changes

Feature1, tests, (and when applicable, API doc)
Feature2, tests, (and when applicable, API doc)

Comments

If this change is a backward incompatible change, why must this change be made.
Interesting edge cases to note here

src/operator/quantization/quantize-inl.h

src/operator/quantization/quantized_activation.cc

pengzhao-intel · 2019-05-22T06:52:11Z

@ZhennanQin @TaoLv @xinyu-intel please help take a review.

pengzhao-intel · 2019-05-22T08:32:34Z

Please also add a test case

src/operator/subgraph/subgraph_property.h.bak

ciyongch · 2019-05-22T11:07:32Z

@pengzhao-intel @TaoLv test case is added, redundant file is remove. Please help to review again:)

pengzhao-intel

Thanks for the improvements.

LGTM

ZhennanQin · 2019-05-23T01:38:05Z

LGTM.

pengzhao-intel · 2019-05-23T01:38:47Z

Thanks for your contribution. Merging now.

…pache#15031) * [Quantization] Support zero-size tensor input for quantization flow * Comment out quantized_act and quantized_sum * retrigger CI * Add test cases

ciyongch added 3 commits May 22, 2019 10:13

[Quantization] Support zero-size tensor input for quantization flow

007c737

Comment out quantized_act and quantized_sum

a866e05

retrigger CI

fd16c3c

pengzhao-intel reviewed May 22, 2019

View reviewed changes

src/operator/quantization/quantize-inl.h Show resolved Hide resolved

pengzhao-intel reviewed May 22, 2019

View reviewed changes

src/operator/quantization/quantized_activation.cc Outdated Show resolved Hide resolved

pengzhao-intel mentioned this pull request May 22, 2019

MKL-DNN QuantizedFullyConnectedOp Error #14467

Closed

pengzhao-intel added this to Review in progress in CPU Performance and Quantization May 22, 2019

pengzhao-intel mentioned this pull request May 22, 2019

[Discussion] 1.5.0 Roadmap #14619

Closed

TaoLv reviewed May 22, 2019

View reviewed changes

src/operator/subgraph/subgraph_property.h.bak Outdated Show resolved Hide resolved

Add test cases

bc9760c

ciyongch force-pushed the zero-size-quantization branch from 652af82 to bc9760c Compare May 22, 2019 11:05

pengzhao-intel approved these changes May 22, 2019

View reviewed changes

CPU Performance and Quantization automation moved this from Review in progress to Reviewer approved May 22, 2019

TaoLv approved these changes May 23, 2019

View reviewed changes

pengzhao-intel merged commit d4e458e into apache:master May 23, 2019

CPU Performance and Quantization automation moved this from Reviewer approved to Done May 23, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Quantization] Support zero-size tensor input for quantization flow #15031

[Quantization] Support zero-size tensor input for quantization flow #15031

ciyongch commented May 22, 2019

pengzhao-intel commented May 22, 2019

pengzhao-intel commented May 22, 2019

ciyongch commented May 22, 2019

pengzhao-intel left a comment

ZhennanQin commented May 23, 2019

pengzhao-intel commented May 23, 2019

[Quantization] Support zero-size tensor input for quantization flow #15031

[Quantization] Support zero-size tensor input for quantization flow #15031

Conversation

ciyongch commented May 22, 2019

Description

Checklist

Essentials

Changes

Comments

pengzhao-intel commented May 22, 2019

pengzhao-intel commented May 22, 2019

ciyongch commented May 22, 2019

pengzhao-intel left a comment

Choose a reason for hiding this comment

ZhennanQin commented May 23, 2019

pengzhao-intel commented May 23, 2019