Add matrix inversion operator in linalg #14963

arcadiaphy · 2019-05-15T11:23:30Z

As title.

Checklist

Essentials

Please feel free to remove inapplicable items for your PR.

The PR title starts with [MXNET-$JIRA_ID], where $JIRA_ID refers to the relevant JIRA issue created (except PRs with tiny changes)
Changes are complete (i.e. I finished coding on this PR)
All changes have test coverage:
Unit tests are added for small changes to verify correctness (e.g. adding a new operator)
Nightly tests are added for complicated/long-running ones (e.g. changing distributed kvstore)
Build tests will be added for build configuration changes (e.g. adding a new build option with NCCL)
Code is well-documented:
For user-facing API changes, API doc string has been updated.
For new C++ functions in header files, their functionalities and arguments are documented.
For new examples, README.md is added to explain the what the example does, the source of the dataset, expected performance on test set and reference to the original paper if applicable
Check the API doc at http:https://mxnet-ci-doc.s3-accelerate.dualstack.amazonaws.com/PR-$PR_ID/$BUILD_ID/index.html
To the my best knowledge, examples are either not affected by this change, or have been fixed to be compatible with this change

Changes

Feature1, tests, (and when applicable, API doc)
Feature2, tests, (and when applicable, API doc)

Comments

If this change is a backward incompatible change, why must this change be made.
Interesting edge cases to note here

larroy · 2019-05-16T00:55:37Z

src/operator/linalg_impl.h

+struct set_matrix : public mxnet::op::mxnet_op::tunable {
+ template<typename DType>
+ MSHADOW_XINLINE static void Map(int i, DType **p, DType *m, int step) {
+ p[i] = m + i * step;


Right, I'll fix it.

larroy

Nice stuff!

larroy · 2019-05-16T00:56:23Z

src/operator/linalg_impl.h

+LINALG_CPU_GETRF(dgetrf, double)
+
+#ifdef __CUDACC__
+


Could you add a comment on what and why fill up the matrix this way?

larroy · 2019-05-16T00:59:45Z

src/operator/linalg_impl.h

+ DType **A_ptr = static_cast<DType **>(A_ptr_buf.dptr); \
+ const Tensor<gpu, 3, DType> temp(work.dptr_, A.shape_, s); \
+ int *pivot = reinterpret_cast<int *>(temp.dptr_ + temp.shape_.Size()); \
+ int *info = pivot + A.size(0) * A.size(1); \


what happens on int32 overflow?

The pivot's range is in [0, matrix_dim), I think creating a square matrix with int32 overflow dimension is not possible.

I think you know much more about this, but from what I understand A is the input matrix which gets overwritten by LU no? My question was if the product of A.size(0) and A.size(1) overflows, this can happen if both are bigger than 2^16 unless I'm mistaken. I have seen this bug in other places before we call Blas, it was nasty.

I have tried to write a overflow case, it always fails on size checks in Tensor or Blob. I think it's the right way, for ndarray with overflow size, it should fail in advance and not reach the code above.

larroy · 2019-05-16T01:00:31Z

src/operator/linalg_impl.h

+ DType **B_ptr = static_cast<DType **>(B_ptr_buf.dptr); \
+ Tensor<gpu, 3, DType> temp(work.dptr_, A.shape_, s); \
+ int *pivot = reinterpret_cast<int *>(temp.dptr_ + temp.shape_.Size()); \
+ int *info = pivot + A.size(0) * A.size(1); \


same comment as above

larroy · 2019-05-16T01:13:54Z

src/operator/tensor/la_op.cc

+.add_argument("A", "NDArray-or-Symbol", "Tensor of square matrix");
+
+NNVM_REGISTER_OP(_backward_linalg_inverse)
+.set_num_inputs(3)


why do we have 3 inputs?

Because I use ElemwiseGradUseInOut, so the 3 inputs are out_grad, input, output. Actually, input is not used in computing in_grad, I'll change it to ElemwiseGradUseOut.

Makes sense, It looked strange to me.

arcadiaphy · 2019-05-20T13:38:56Z

I'll merge this PR now and create another PR on matrix determinant which is depended upon this one.

* add inverse cpu * add comment * add inverse backward cpu * add inverse gpu * able to compile * fix * fix * guard for lower version cuda * update docs * update docs * fix misaligned memory * add test * fix lint * fix android * fix indent * change transfer gradient * fix * refactor test * delete unnecessary copy * trigger CI * fix test

arcadiaphy added 12 commits May 13, 2019 19:46

add inverse cpu

e0e6694

add comment

bb0e0c4

add inverse backward cpu

8beee0e

add inverse gpu

d142477

able to compile

bb49adb

fix

7b45f60

fix

77cba1a

guard for lower version cuda

decc6af

update docs

b031614

update docs

fa8e1ed

fix misaligned memory

cccef9e

add test

470ba36

arcadiaphy requested a review from szha as a code owner May 15, 2019 11:23

arcadiaphy mentioned this pull request May 15, 2019

Improve linear algebra functions #14962

Open

arcadiaphy added 2 commits May 15, 2019 19:54

fix lint

39551bc

fix android

10a5aa6

larroy reviewed May 16, 2019

View reviewed changes

arcadiaphy added 4 commits May 16, 2019 13:50

fix indent

8d16468

change transfer gradient

889fc6e

fix

808104d

refactor test

d647bfa

arcadiaphy force-pushed the pr_linalg branch from e728d22 to d647bfa Compare May 16, 2019 09:46

arcadiaphy added 3 commits May 17, 2019 02:10

delete unnecessary copy

9616d87

trigger CI

09d8125

fix test

412a181

arcadiaphy merged commit 3cbfe48 into apache:master May 20, 2019

arcadiaphy deleted the pr_linalg branch May 20, 2019 13:40

arcadiaphy mentioned this pull request May 21, 2019

Add matrix determinant operator in linalg #15007

Merged

7 tasks

arcadiaphy mentioned this pull request Jul 17, 2019

supporting matrix inversion and determinant #14360

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add matrix inversion operator in linalg #14963

Add matrix inversion operator in linalg #14963

arcadiaphy commented May 15, 2019

larroy May 16, 2019

arcadiaphy May 16, 2019

larroy left a comment

larroy May 16, 2019

arcadiaphy May 16, 2019

larroy May 16, 2019

arcadiaphy May 16, 2019

larroy May 16, 2019

arcadiaphy May 17, 2019

larroy May 16, 2019

larroy May 16, 2019

arcadiaphy May 16, 2019 •

edited

Loading

larroy May 16, 2019

arcadiaphy commented May 20, 2019

		LINALG_CPU_GETRF(dgetrf, double)

		#ifdef __CUDACC__

Add matrix inversion operator in linalg #14963

Add matrix inversion operator in linalg #14963

Conversation

arcadiaphy commented May 15, 2019

Checklist

Essentials

Changes

Comments

Choose a reason for hiding this comment

Choose a reason for hiding this comment

larroy left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

arcadiaphy May 16, 2019 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

arcadiaphy commented May 20, 2019

arcadiaphy May 16, 2019 •

edited

Loading