Implement remaining nn_basic ops in opperf #17456

connorgoggins · 2020-01-28T19:34:02Z

Description

This PR serves to implement the remaining operators from the nn_basic category in opperf.

Checklist

Essentials

Please feel free to remove inapplicable items for your PR.

Changes are complete (i.e. I finished coding on this PR)
All changes have test coverage
Code is well-documented
To the best of my knowledge, examples are either not affected by this change, or have been fixed to be compatible with this change

Changes

M benchmark/opperf/nd_operations/nn_basic_operators.py
M benchmark/opperf/rules/default_params.py
M benchmark/opperf/utils/benchmark_utils.py
M benchmark/opperf/utils/op_registry_utils.py

Comments

Tested on c5.18xl-ubuntu 16.04 and Mac OS with:

run_performance_test on individual ops
Checkout branch and call function run_nn_basic_operators_benchmarks
Checkout branch and run opperf.py (full run of all ops)

@apeforest @access2rohit

ChaiBapchya

As discussed offline
Not a good practice to add duplicate (redundant code)
30-40% of the lines can be taken care of with a function. So lets get that across this entire file.

Thanks for the contribution again!

benchmark/opperf/utils/benchmark_utils.py

benchmark/opperf/rules/default_params.py

benchmark/opperf/utils/benchmark_utils.py

connorgoggins · 2020-01-30T23:42:15Z

Group of operator test - all NN Basic ops (GPU)
Group of operator test - all NN Basic ops (CPU)

Full OpPerf test (GPU)
Full OpPerf test (CPU)

*Note: couldn't run SoftmaxOutput op backwards on GPU - see previously documented issue and couldn't run im2col either forwards or backwards on GPU - see new documented issue. In both cases, simply switching the context from cpu to gpu leads the op to fail.

ChaiBapchya · 2020-01-30T23:58:56Z

I was able to run GPU tests with topk - https://gist.github.com/ChaiBapchya/fac7310f7d167d1361854451e7daa342

That shouldn't be a problem. Please confirm.

benchmark/opperf/nd_operations/nn_basic_operators.py

benchmark/opperf/utils/benchmark_utils.py

CONTRIBUTORS.md

benchmark/opperf/utils/op_registry_utils.py

access2rohit · 2020-01-31T00:32:17Z

Can we change the logic here to ctx == mx.cpu() or op not in gpu_disabled_ops for readability?

Address the quoted and profiler comment made by @apeforest. Rest LGTM !

connorgoggins · 2020-01-31T01:58:46Z

Updated Full OpPerf GPU Results with topk perf: https://gist.github.com/connorgoggins/2d4d2ff6dca61494eb8151a5106fec6c

ChaiBapchya · 2020-02-04T02:47:34Z

Incorrect way to rebase @connorgoggins

First get your master branch updated

git remote add upstream https://github.com/apache/incubator-mxnet
git checkout master
git fetch upstream master
git merge upstream/master
git push origin master

Once your fork master is sync'ed with remote master, rebase your branch on master

git checkout <branchname>
git rebase master
git push origin <branchname>

Let me know if this works!

connorgoggins · 2020-02-05T23:47:09Z

@mxnet-label-bot add [pr-awaiting-review]

benchmark/opperf/utils/op_registry_utils.py

access2rohit

lgtm!

apeforest

LGTM. Thanks for your contribution

* Added SoftmaxOutput * Added LinearRegressionOutput * Added other regression ops * Added SVMOutput * Added L2, layer and instance norm * gamma and beta to ndarray * Reworked layer/instance norm * Added Embedding * Disabled backward on embedding * Added Correlation * Added data1 and 2 to ndarray * Added SpatialTransformer * Made loc ndarray type * Run backward test * Added IdentityAttachKLSparseReg * Dropping grad * Added sparseness target * Added grad back * Disabling backward for IdentityAttachKLSparseReg * Trying to debug * Print problematic op * Another log * Removing IdentityAttachKLSparseReg test for now * Removed faulty test * Added im2col * Added col2im * Added GroupNorm * Added RNN * Added paramters and state to ndarray * Added LRN * Added preloaded_multi_mp_sgd_mom_update * Added lamb_update_phase1 * Added lamb_update_phase2 * Dropped reversal * Finalized nn basic ops * Cleaned up code for linter * Refactored individual tests into generalized framework * Refined logic, added default params * Fixed LRN param placement * Refactored default params for clarity * Fixed lint errors * Fixed BatchNorm issue * Removed debugging comment * Cleaned up indentation * Added axis param for LayerNorm op * Fixed loc param issues * Linked Embedding backward issue in run_performance_test * Disabling problematic runs on gpu * Added myself to CONTRIBUTORS.md * Addressed PR comments * Fixed DEFAULT_LABEL issue * Tightend up logic, established consistency with master * Fixed indent

ChaiBapchya suggested changes Jan 28, 2020

View reviewed changes

apeforest mentioned this pull request Jan 29, 2020

[mxnet 2.0] [item 2.4] Turning on large tensor support by default #17331

Open

ChaiBapchya reviewed Jan 30, 2020

View reviewed changes

benchmark/opperf/utils/benchmark_utils.py Outdated Show resolved Hide resolved

benchmark/opperf/rules/default_params.py Outdated Show resolved Hide resolved

ChaiBapchya reviewed Jan 30, 2020

View reviewed changes

benchmark/opperf/utils/benchmark_utils.py Outdated Show resolved Hide resolved

apeforest reviewed Jan 31, 2020

View reviewed changes

benchmark/opperf/nd_operations/nn_basic_operators.py Show resolved Hide resolved

apeforest reviewed Jan 31, 2020

View reviewed changes

benchmark/opperf/utils/benchmark_utils.py Outdated Show resolved Hide resolved

ChaiBapchya reviewed Jan 31, 2020

View reviewed changes

CONTRIBUTORS.md Show resolved Hide resolved

apeforest reviewed Jan 31, 2020

View reviewed changes

benchmark/opperf/utils/op_registry_utils.py Show resolved Hide resolved

connorgoggins requested review from aaronmarkham, anirudh2290, eric-haibin-lin, marcoabreu, nswamy and szha as code owners February 4, 2020 02:07

connorgoggins force-pushed the implement_nn_basic_ops branch 3 times, most recently from fbebf35 to f5279be Compare February 5, 2020 20:08

lanking520 added the pr-awaiting-review PR is waiting for code review label Feb 5, 2020

connorgoggins force-pushed the implement_nn_basic_ops branch from 48d0667 to 3a8fbc4 Compare February 6, 2020 18:41

ChaiBapchya reviewed Feb 6, 2020

View reviewed changes

benchmark/opperf/utils/op_registry_utils.py Outdated Show resolved Hide resolved

ChaiBapchya approved these changes Feb 6, 2020

View reviewed changes

connorgoggins force-pushed the implement_nn_basic_ops branch 2 times, most recently from 9970fb7 to a5889ae Compare February 10, 2020 22:18

access2rohit approved these changes Feb 11, 2020

View reviewed changes

connorgoggins added 25 commits February 17, 2020 12:28

Added paramters and state to ndarray

089ad8b

Added LRN

8641f9f

Added preloaded_multi_mp_sgd_mom_update

903c8dd

Added lamb_update_phase1

24859f4

Added lamb_update_phase2

e4e8f78

Dropped reversal

03b44ae

Finalized nn basic ops

4c3898f

Cleaned up code for linter

bf321cd

Refactored individual tests into generalized framework

f41bc8b

Refined logic, added default params

944b153

Fixed LRN param placement

5f4f639

Refactored default params for clarity

82b138a

Fixed lint errors

e9d7a38

Fixed BatchNorm issue

0e34266

Removed debugging comment

bf0d7d6

Added axis param for LayerNorm op

b74af9a

Fixed loc param issues

65b86a9

Linked Embedding backward issue in run_performance_test

ca9dcb3

Disabling problematic runs on gpu

82dd18e

Added myself to CONTRIBUTORS.md

87991c8

Addressed PR comments

72b857e

Cleaned up indentation

6d39346

Fixed DEFAULT_LABEL issue

8456520

Tightend up logic, established consistency with master

d836fc2

Fixed indent

e0dc303

connorgoggins force-pushed the implement_nn_basic_ops branch from bd8041a to e0dc303 Compare February 17, 2020 20:28

apeforest approved these changes Feb 19, 2020

View reviewed changes

apeforest merged commit c2aff58 into apache:master Feb 19, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement remaining nn_basic ops in opperf #17456

Implement remaining nn_basic ops in opperf #17456

connorgoggins commented Jan 28, 2020 •

edited

Loading

ChaiBapchya left a comment

connorgoggins commented Jan 30, 2020 •

edited

Loading

ChaiBapchya commented Jan 30, 2020

access2rohit commented Jan 31, 2020 •

edited

Loading

connorgoggins commented Jan 31, 2020

ChaiBapchya commented Feb 4, 2020

connorgoggins commented Feb 5, 2020

access2rohit left a comment

apeforest left a comment

Implement remaining nn_basic ops in opperf #17456

Implement remaining nn_basic ops in opperf #17456

Conversation

connorgoggins commented Jan 28, 2020 • edited Loading

Description

Checklist

Essentials

Changes

Comments

ChaiBapchya left a comment

Choose a reason for hiding this comment

connorgoggins commented Jan 30, 2020 • edited Loading

ChaiBapchya commented Jan 30, 2020

access2rohit commented Jan 31, 2020 • edited Loading

connorgoggins commented Jan 31, 2020

ChaiBapchya commented Feb 4, 2020

connorgoggins commented Feb 5, 2020

access2rohit left a comment

Choose a reason for hiding this comment

apeforest left a comment

Choose a reason for hiding this comment

connorgoggins commented Jan 28, 2020 •

edited

Loading

connorgoggins commented Jan 30, 2020 •

edited

Loading

access2rohit commented Jan 31, 2020 •

edited

Loading