Support 3D input for MKL-DNN softmax operator #14818

TaoLv · 2019-04-27T05:31:35Z

Description

to support 3D softmax layers in GluonNLP BERT (need Update MKL-DNN submodule to v0.19 #14783 for better performance)
fix in-place softmax
remove ctx.is_train check so the cpp test for softmax can work
enhance the checks in SupportMKLDNNSoftmax

Checklist

Essentials

Please feel free to remove inapplicable items for your PR.

The PR title starts with [MXNET-$JIRA_ID], where $JIRA_ID refers to the relevant JIRA issue created (except PRs with tiny changes)
Changes are complete (i.e. I finished coding on this PR)
All changes have test coverage:
Unit tests are added for small changes to verify correctness (e.g. adding a new operator)
Nightly tests are added for complicated/long-running ones (e.g. changing distributed kvstore)
Build tests will be added for build configuration changes (e.g. adding a new build option with NCCL)
Code is well-documented:
For user-facing API changes, API doc string has been updated.
For new C++ functions in header files, their functionalities and arguments are documented.
For new examples, README.md is added to explain the what the example does, the source of the dataset, expected performance on test set and reference to the original paper if applicable
Check the API doc at http:https://mxnet-ci-doc.s3-accelerate.dualstack.amazonaws.com/PR-$PR_ID/$BUILD_ID/index.html
To the my best knowledge, examples are either not affected by this change, or have been fixed to be compatible with this change

Changes

Feature1, tests, (and when applicable, API doc)
Feature2, tests, (and when applicable, API doc)

Comments

If this change is a backward incompatible change, why must this change be made.
Interesting edge cases to note here

…to fix-softmax

TaoLv · 2019-04-27T05:32:05Z

@pengzhao-intel @juliusshufan

pengzhao-intel · 2019-04-27T07:30:53Z

@TaoLv thanks for the PR.

Is there a test for the 1D softmax and could you show the performance of MKL-DNN primitive against original implementation?

pengzhao-intel

LGTM :)

TaoLv · 2019-04-27T14:20:44Z

Tests should be covered by
https://github.com/apache/incubator-mxnet/blob/master/tests/python/unittest/test_operator.py#L4697
and
https://github.com/apache/incubator-mxnet/blob/master/tests/cpp/operator/mkldnn_operator_test.cc#L1288.
I used below code snippet for performance benchmarking:

def test_performance():
    shapes = [(1024,), (96, 512), (96, 128, 128), (96, 256, 256), (1, 8, 1024, 1024)]
    for sh in shapes:
        a = mx.nd.random.uniform(shape=sh)
        # warm up
        b = mx.nd.softmax(a, axis=-1)
        b.wait_to_read()

        tic = time.time()
        for i in range(1000):
            b = mx.nd.softmax(a, axis=-1)
            b.wait_to_read()

        toc = time.time()
        print("softmax %s, take %f ms" % (sh, (toc - tic)/1000*1000.0))

Some performance numbers as following:
mxnet==1.5.0b20190426

softmax (1024,), take 0.103340 ms
softmax (96, 512), take 0.127465 ms
softmax (96, 128, 128), take 1.655400 ms
softmax (96, 256, 256), take 6.369653 ms
softmax (1, 8, 1024, 1024), take 11.450656 ms

This PR with MKL-DNN backend:

softmax (1024,), take 0.062743 ms
softmax (96, 512), take 0.104104 ms
softmax (96, 128, 128), take 0.385350 ms
softmax (96, 256, 256), take 0.463220 ms
softmax (1, 8, 1024, 1024), take 1.704757 ms

TaoLv · 2019-04-28T15:02:57Z

Pending on MKL-DNN update for better performance~

…to fix-softmax

TaoLv · 2019-05-04T14:28:35Z

Fallback all softmax operations when axis != last dimension because they are not optimized in MKL-DNN.

pengzhao-intel · 2019-05-16T03:22:46Z

@TaoLv I have merged the MKL-DNN 0.19 and please rebase the code and see if everything is OK :)

…to fix-softmax

pengzhao-intel · 2019-05-17T00:53:52Z

@TaoLv please rebase and retrigger again the CI issue is fixed now.

pengzhao-intel · 2019-05-17T04:06:34Z

Merging now :) Thanks for your contribution.

* add 3d softmax * fix * handle req type * clean code * remove check * check axis * retrigger ci

TaoLv added 8 commits March 26, 2019 15:51

add 3d softmax

f4bcfae

Merge branch 'master' of https://github.com/apache/incubator-mxnet in…

f20d229

…to fix-softmax

Merge branch 'master' of https://github.com/apache/incubator-mxnet in…

da7477d

…to fix-softmax

fix

416b26f

Merge branch 'master' of https://github.com/apache/incubator-mxnet in…

72421d1

…to fix-softmax

Merge branch 'master' of https://github.com/apache/incubator-mxnet in…

8beacfb

…to fix-softmax

handle req type

96fb3ec

clean code

70cfa8e

TaoLv added MKLDNN Operator labels Apr 27, 2019

pengzhao-intel mentioned this pull request Apr 27, 2019

[Discussion] 1.5.0 Roadmap #14619

Closed

pengzhao-intel added this to Review in progress in CPU Performance and Quantization Apr 27, 2019

CPU Performance and Quantization automation moved this from Review in progress to Reviewer approved Apr 27, 2019

pengzhao-intel approved these changes Apr 27, 2019

View reviewed changes

remove check

cae51bf

TaoLv changed the title ~~Support 3D input for MKL-DNN softmax operator~~ [WIP] Support 3D input for MKL-DNN softmax operator Apr 28, 2019

TaoLv added 2 commits May 3, 2019 09:54

check axis

7796628

Merge branch 'master' of https://github.com/apache/incubator-mxnet in…

90bd188

…to fix-softmax

Merge branch 'master' of https://github.com/apache/incubator-mxnet in…

4e3f36b

…to fix-softmax

TaoLv changed the title ~~[WIP] Support 3D input for MKL-DNN softmax operator~~ Support 3D input for MKL-DNN softmax operator May 16, 2019

retrigger ci

8e5248f

pengzhao-intel merged commit 8d6ac4a into apache:master May 17, 2019

CPU Performance and Quantization automation moved this from Reviewer approved to Done May 17, 2019

haohuanw pushed a commit to haohuanw/incubator-mxnet that referenced this pull request Jun 23, 2019

Support 3D input for MKL-DNN softmax operator (apache#14818)

cd26145

* add 3d softmax * fix * handle req type * clean code * remove check * check axis * retrigger ci

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support 3D input for MKL-DNN softmax operator #14818

Support 3D input for MKL-DNN softmax operator #14818

TaoLv commented Apr 27, 2019

TaoLv commented Apr 27, 2019

pengzhao-intel commented Apr 27, 2019

pengzhao-intel left a comment

TaoLv commented Apr 27, 2019

TaoLv commented Apr 28, 2019

TaoLv commented May 4, 2019

pengzhao-intel commented May 16, 2019

pengzhao-intel commented May 17, 2019

pengzhao-intel commented May 17, 2019

Support 3D input for MKL-DNN softmax operator #14818

Support 3D input for MKL-DNN softmax operator #14818

Conversation

TaoLv commented Apr 27, 2019

Description

Checklist

Essentials

Changes

Comments

TaoLv commented Apr 27, 2019

pengzhao-intel commented Apr 27, 2019

pengzhao-intel left a comment

Choose a reason for hiding this comment

TaoLv commented Apr 27, 2019

TaoLv commented Apr 28, 2019

TaoLv commented May 4, 2019

pengzhao-intel commented May 16, 2019

pengzhao-intel commented May 17, 2019

pengzhao-intel commented May 17, 2019