Use In-place operator to prevent memory spikes in optimizer updates #13960

anirudhacharya · 2019-01-22T21:06:42Z

Description

The update rules in Nadam, Adadelta, Adamax and SGLD optimizers have been changed to using in-place operators to prevent memory spikes during execution.

Checklist

Essentials

Please feel free to remove inapplicable items for your PR.

The PR title starts with [MXNET-$JIRA_ID], where $JIRA_ID refers to the relevant JIRA issue created (except PRs with tiny changes)
Changes are complete (i.e. I finished coding on this PR)
All changes have test coverage:
Unit tests are added for small changes to verify correctness (e.g. adding a new operator)
Nightly tests are added for complicated/long-running ones (e.g. changing distributed kvstore)
Build tests will be added for build configuration changes (e.g. adding a new build option with NCCL)
Code is well-documented:
For user-facing API changes, API doc string has been updated.
For new C++ functions in header files, their functionalities and arguments are documented.
For new examples, README.md is added to explain the what the example does, the source of the dataset, expected performance on test set and reference to the original paper if applicable
Check the API doc at http:https://mxnet-ci-doc.s3-accelerate.dualstack.amazonaws.com/PR-$PR_ID/$BUILD_ID/index.html
To the my best knowledge, examples are either not affected by this change, or have been fixed to be compatible with this change

anirudhacharya · 2019-01-22T21:20:12Z

Some statistics from profiler.dump() on running with an mnist example.

Adadelta Old

Profile Statistics.
	Note that counter items are counter values and not time units.
Device Storage
=================
Name                          Total Count        Time (ms)    Min Time (ms)    Max Time (ms)    Avg Time (ms)
----                          -----------        ---------    -------------    -------------    -------------
Memory: cpu/0                        1908         627.2000           0.0000        2007.0400        1003.5200

Adadelta New

Profile Statistics.
	Note that counter items are counter values and not time units.
Device Storage
=================
Name                          Total Count        Time (ms)    Min Time (ms)    Max Time (ms)    Avg Time (ms)
----                          -----------        ---------    -------------    -------------    -------------
Memory: cpu/0                        1764         627.2000           0.0000        1606.1440         803.0720

Adamax Old

Profile Statistics.
	Note that counter items are counter values and not time units.
Device Storage
=================
Name                          Total Count        Time (ms)    Min Time (ms)    Max Time (ms)    Avg Time (ms)
----                          -----------        ---------    -------------    -------------    -------------
Memory: cpu/0                        1728         627.2000           0.0000        2009.6000        1004.8000

Adamax New

Profile Statistics.
	Note that counter items are counter values and not time units.
Device Storage
=================
Name                          Total Count        Time (ms)    Min Time (ms)    Max Time (ms)    Avg Time (ms)
----                          -----------        ---------    -------------    -------------    -------------
Memory: cpu/0                        1656         627.2000           0.0000        1606.6560         803.3280

anirudhacharya · 2019-01-22T21:22:02Z

@mxnet-label-bot add [pr-awaiting-review]

@szha @eric-haibin-lin

anirudhacharya · 2019-01-25T08:18:59Z

For 5 batches of training with a deep embeddings example the decrease in memory consumption is ~0.27 factor for Nesterov Adam optimizer.

Old Nadam -

New Nadam( with in-place operators) -

vandanavk

LGTM

vandanavk · 2019-02-05T22:46:57Z

@mxnet-label-bot update [pr-awaiting-merge]

ankkhedia · 2019-02-14T23:51:54Z

@sandeep-krishnamurthy Could you please review/merge this PR?

anirudhacharya · 2019-02-15T01:31:22Z

Thanks for merging and thanks to @szha for the tip here - #13683 (comment)

in place updates

6775340

anirudhacharya requested a review from eric-haibin-lin as a code owner January 22, 2019 21:06

marcoabreu added the pr-awaiting-review PR is waiting for code review label Jan 22, 2019

eric-haibin-lin approved these changes Jan 25, 2019

View reviewed changes

vandanavk approved these changes Feb 5, 2019

View reviewed changes

marcoabreu added pr-awaiting-merge Review and CI is complete. Ready to Merge and removed pr-awaiting-review PR is waiting for code review labels Feb 5, 2019

eric-haibin-lin merged commit a4e249b into apache:master Feb 15, 2019

anirudhacharya deleted the opt_mem branch February 15, 2019 01:29

stephenrawls pushed a commit to stephenrawls/incubator-mxnet that referenced this pull request Feb 16, 2019

In-place updates for Nadam, Adadelta, Adamax and SGLD (apache#13960)

f51c8cf

jessr92 pushed a commit to jessr92/incubator-mxnet that referenced this pull request Feb 19, 2019

In-place updates for Nadam, Adadelta, Adamax and SGLD (apache#13960)

6059a4a

drivanov pushed a commit to drivanov/incubator-mxnet that referenced this pull request Mar 4, 2019

In-place updates for Nadam, Adadelta, Adamax and SGLD (apache#13960)

f39148c

vdantu pushed a commit to vdantu/incubator-mxnet that referenced this pull request Mar 31, 2019

In-place updates for Nadam, Adadelta, Adamax and SGLD (apache#13960)

88af4fe

haohuanw pushed a commit to haohuanw/incubator-mxnet that referenced this pull request Jun 23, 2019

In-place updates for Nadam, Adadelta, Adamax and SGLD (apache#13960)

7874b31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use In-place operator to prevent memory spikes in optimizer updates #13960

Use In-place operator to prevent memory spikes in optimizer updates #13960

anirudhacharya commented Jan 22, 2019 •

edited

Loading

anirudhacharya commented Jan 22, 2019 •

edited

Loading

anirudhacharya commented Jan 22, 2019

anirudhacharya commented Jan 25, 2019 •

edited

Loading

vandanavk left a comment

vandanavk commented Feb 5, 2019

ankkhedia commented Feb 14, 2019

anirudhacharya commented Feb 15, 2019

Use In-place operator to prevent memory spikes in optimizer updates #13960

Use In-place operator to prevent memory spikes in optimizer updates #13960

Conversation

anirudhacharya commented Jan 22, 2019 • edited Loading

Description

Checklist

Essentials

anirudhacharya commented Jan 22, 2019 • edited Loading

Adadelta Old

Adadelta New

Adamax Old

Adamax New

anirudhacharya commented Jan 22, 2019

anirudhacharya commented Jan 25, 2019 • edited Loading

vandanavk left a comment

Choose a reason for hiding this comment

vandanavk commented Feb 5, 2019

ankkhedia commented Feb 14, 2019

anirudhacharya commented Feb 15, 2019

anirudhacharya commented Jan 22, 2019 •

edited

Loading

anirudhacharya commented Jan 22, 2019 •

edited

Loading

anirudhacharya commented Jan 25, 2019 •

edited

Loading