fallback to dense version for grad(reshape), grad(expand_dims) #13599

yzhliu · 2018-12-10T06:33:26Z

Description

Current reshape uses _backward_copy to do gradient calculation.

But _backward_copy sparse calculation requires same shapes of input and output - which is fair since sparsity is bind to # of rows and cols.

And though reshape itself does not have sparse version, we can easily construct a network, in which its backward input and output are sparse - see test case.

Thus we need to provide a fallback version for reshape backward computation, so as expand_dims.

@eric-haibin-lin @zheng-da

Checklist

Essentials

Please feel free to remove inapplicable items for your PR.

The PR title starts with [MXNET-$JIRA_ID], where $JIRA_ID refers to the relevant JIRA issue created (except PRs with tiny changes)
Changes are complete (i.e. I finished coding on this PR)
All changes have test coverage:
Unit tests are added for small changes to verify correctness (e.g. adding a new operator)
Nightly tests are added for complicated/long-running ones (e.g. changing distributed kvstore)
Build tests will be added for build configuration changes (e.g. adding a new build option with NCCL)
Code is well-documented:
For user-facing API changes, API doc string has been updated.
For new C++ functions in header files, their functionalities and arguments are documented.
For new examples, README.md is added to explain the what the example does, the source of the dataset, expected performance on test set and reference to the original paper if applicable
Check the API doc at https://mxnet-ci-doc.s3-accelerate.dualstack.amazonaws.com/PR-$PR_ID/$BUILD_ID/index.html
To the my best knowledge, examples are either not affected by this change, or have been fixed to be compatible with this change

zheng-da · 2018-12-12T04:36:42Z

src/operator/tensor/elemwise_unary_op_basic.cc

+.set_attr<FResourceRequest>("FResourceRequest", [](const NodeAttrs& n) {
+ return std::vector<ResourceRequest>{ResourceRequest::kTempSpace};
+})
+#endif


does mkldnn support reshape?

if you want to optimize for mkldnn, you should add FComputeEx

FYI, #12980 is enabled the FW of MKL-DNN supported reshape but BW is still WIP @huangzhiyuan

Then I guess we need to remove this for now.

zheng-da · 2018-12-12T04:40:04Z

Otherwise, it looks good to me.

BTW, I don't think it has anything to do with sparse.

eric-haibin-lin · 2018-12-19T05:14:51Z

src/operator/tensor/elemwise_unary_op_basic.cc

+ [](const NodeAttrs& attrs){
+ return std::vector<std::pair<int, int> >{{0, 0}};
+ })
+.set_attr<FInferStorageType>("FInferStorageType", ElemwiseStorageType<1, 1, false, false, false>)


If an op only supports dense tensors and FCompute, FInferStorageType is not needed

updated, thanks!

…e#13599) * fallback to dense version for grad(reshape), grad(expand_dims) * add _backward_reshape gpu version * reshape test case comments * fix gpu test * remove mkldnn support for _backward_reshape

yzhliu requested a review from eric-haibin-lin December 10, 2018 06:33

yzhliu requested a review from anirudh2290 as a code owner December 10, 2018 06:33

yzhliu added Bug Sparse labels Dec 10, 2018

zheng-da reviewed Dec 12, 2018

View reviewed changes

yzhliu force-pushed the reshape_backward branch from ccc66e1 to 986803c Compare December 14, 2018 08:12

eric-haibin-lin reviewed Dec 19, 2018

View reviewed changes

yzhliu added 5 commits December 19, 2018 17:00

fallback to dense version for grad(reshape), grad(expand_dims)

9f6374c

add _backward_reshape gpu version

9456c64

reshape test case comments

3ab8045

fix gpu test

0bcb30f

remove mkldnn support for _backward_reshape

04587b5

yzhliu force-pushed the reshape_backward branch from 986803c to 04587b5 Compare December 20, 2018 01:01

eric-haibin-lin approved these changes Dec 20, 2018

View reviewed changes

eric-haibin-lin merged commit 59f4395 into apache:master Dec 20, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fallback to dense version for grad(reshape), grad(expand_dims) #13599

fallback to dense version for grad(reshape), grad(expand_dims) #13599

yzhliu commented Dec 10, 2018 •

edited

Loading

zheng-da Dec 12, 2018

pengzhao-intel Dec 12, 2018

yzhliu Dec 13, 2018

zheng-da commented Dec 12, 2018

eric-haibin-lin Dec 19, 2018

yzhliu Dec 20, 2018

fallback to dense version for grad(reshape), grad(expand_dims) #13599

fallback to dense version for grad(reshape), grad(expand_dims) #13599

Conversation

yzhliu commented Dec 10, 2018 • edited Loading

Description

Checklist

Essentials

zheng-da Dec 12, 2018

Choose a reason for hiding this comment

pengzhao-intel Dec 12, 2018

Choose a reason for hiding this comment

yzhliu Dec 13, 2018

Choose a reason for hiding this comment

zheng-da commented Dec 12, 2018

eric-haibin-lin Dec 19, 2018

Choose a reason for hiding this comment

yzhliu Dec 20, 2018

Choose a reason for hiding this comment

yzhliu commented Dec 10, 2018 •

edited

Loading