Making MKL-DNN default on MXNet master #13681

mseth10 · 2018-12-19T01:02:50Z

Description

This PR continues work of PR #13464 and aims to make MKL-DNN default on MXNet master for Linux kernels with Intel/AMD processors..

Checklist

Essentials

Please feel free to remove inapplicable items for your PR.

Changes are complete (i.e. I finished coding on this PR)
All changes have test coverage:
Unit tests are added for small changes to verify correctness (e.g. adding a new operator)
Nightly tests are added for complicated/long-running ones (e.g. changing distributed kvstore)
Build tests will be added for build configuration changes (e.g. adding a new build option with NCCL)
Code is well-documented:
For user-facing API changes, API doc string has been updated.
For new C++ functions in header files, their functionalities and arguments are documented.
For new examples, README.md is added to explain the what the example does, the source of the dataset, expected performance on test set and reference to the original paper if applicable
Check the API doc at http:https://mxnet-ci-doc.s3-accelerate.dualstack.amazonaws.com/PR-$PR_ID/$BUILD_ID/index.html
To the my best knowledge, examples are either not affected by this change, or have been fixed to be compatible with this change

Changes

Feature1, tests, (and when applicable, API doc)
Feature2, tests, (and when applicable, API doc)

Comments

If this change is a backward incompatible change, why must this change be made.
Interesting edge cases to note here

…tor-mxnet into feature/mklnn-default-make

mseth10 · 2018-12-26T19:01:10Z

@xinyu-intel No, we need not modify USE_MKLDNN flag in config.mk file. USE_MKLDNN=1 is set on select kernel/processors in Makefile and CMakefile.

TaoLv

Looks good to me now. Thanks for your effort @mseth10 . Seems there is a trouble with CI. Do you mind taking a look at the failure?

pengzhao-intel · 2018-12-29T01:24:06Z

@mseth10 @azai91 could you rebase and pass the CI? This will be the final step before the merge:)
Thanks all of your efforts.

pengzhao-intel

LGTM

azai91 · 2018-12-29T01:35:55Z

mseth is out for the next two weeks so I am continuing his PR on #13744. @pengzhao-intel @TaoLv please review this version (all I did was retrigger as it fails on what I believe is a flaky test on windows-gpu stage. documented here #13743)

pengzhao-intel · 2018-12-29T01:40:39Z

@azai91 could you let @mseth10 adding you into collaborator list so that we can continue work in this PR?

https://help.github.com/articles/inviting-collaborators-to-a-personal-repository/

azai91 · 2018-12-29T01:45:28Z

that would be ideal. would be a lot cleaner to just retrigger on this PR compared with starting a new one just for one additional commit. however, seth is out in india without his laptop until 1/11. I tried to find ways to push directly onto his repo / retrigger from the jenkins GUI with no avail. @sandeep-krishnamurthy, since you are a committer, do you have permission to retrigger the last commit on this PR from the jenkins GUI?

mseth10 · 2018-12-29T06:18:07Z

@azai91 I have added you as a collaborator to my mxnet repo. You should be able to re-trigger now. Sorry for the delayed reaponse.
FYI, commiters have the power to re-trigger a particular CI stage (e.g. windows gpu) from GUI. That would be a lot faster. Can you please re-trigger that failed stage @nswamy @sandeep-krishnamurthy ?

TaoLv · 2019-01-02T05:44:23Z

@mseth10 @azai91 Thank you. Now this PR looks good to me. Before merging, I want to double check with @szha , @marcoabreu if we need adjust the night build releasing process for this change? I assume that pip install mxnet==1.5.0b20190103 will have USE_MKLDNN=0 while pip install mxnet-mkl==1.5.0b20190103 will have USE_MKLDNN=1.

https://pypi.org/project/mxnet/#history
https://pypi.org/project/mxnet-mkl/#history

azai91 · 2019-01-02T18:12:01Z

it is explicitly turned off (https://www.dropbox.com/s/fk2jipmiivfjog0/Screenshot%202019-01-02%2010.11.43.png?dl=0).

pengzhao-intel · 2019-01-03T02:13:04Z

Thanks for the info @azai91 . I think the PR is good to merge.

@TaoLv @sandeep-krishnamurthy could you help merge the code?

TaoLv · 2019-01-03T03:07:35Z

@azai91 Thank you for the information.

pengzhao-intel · 2019-01-04T13:15:12Z

Add next step in here #12591 (comment)

lebeg · 2019-01-07T11:16:12Z

CMakeLists.txt

@@ -20,7 +20,7 @@ mxnet_option(USE_F16C "Build with x86 F16C instruction support" ON)
 mxnet_option(USE_LAPACK "Build with lapack support" ON)
 mxnet_option(USE_MKL_IF_AVAILABLE "Use MKL if found" ON)
 mxnet_option(USE_MKLML_MKL "Use MKLDNN variant of MKL (if MKL found)" ON IF USE_MKL_IF_AVAILABLE AND (NOT APPLE))
-mxnet_option(USE_MKLDNN "Use MKLDNN variant of MKL (if MKL found)" ON IF USE_MKL_IF_AVAILABLE AND (NOT APPLE))
+mxnet_option(USE_MKLDNN "Use MKLDNN variant of MKL (if MKL found)" ON IF USE_MKL_IF_AVAILABLE AND (NOT APPLE) AND (NOT MSVC) AND (CMAKE_SYSTEM_PROCESSOR MATCHES x86_64))


Actually CMAKE_SYSTEM_PROCESSOR will not work for cross compilation. You could reuse a variable CMAKE_CROSSCOMPILING for this as shown here.

Do you suggest we check for
SYSTEM_ARCHITECTURE STREQUAL "x86_64" AND NOT CMAKE_CROSSCOMPILING
instead of
CMAKE_SYSTEM_PROCESSOR MATCHES x86_64?

Also, will CMAKE_HOST_SYSTEM_PROCESSOR MATCHES x86_64 help in case of cross compilation?

Yes, I think SYSTEM_ARCHITECTURE STREQUAL "x86_64" AND NOT CMAKE_CROSSCOMPILING should work. But you need to add the trick for CMAKE_CROSSCOMPILING from here as well:

# workaround to store CMAKE_CROSSCOMPILING because is getting reset by the project command if(CMAKE_CROSSCOMPILING) set(__CMAKE_CROSSCOMPILING ${CMAKE_CROSSCOMPILING}) set(__CMAKE_CROSSCOMPILING_OVERRIDE ON) endif() project(mxnet C CXX) if(__CMAKE_CROSSCOMPILING_OVERRIDE) set(CMAKE_CROSSCOMPILING ${__CMAKE_CROSSCOMPILING}) endif()

@lebeg I checked, CMAKE_SYSTEM_PROCESSOR works for cross compilation. CMAKE_CROSSCOMPILING is TRUE only for ARM v6, v7, v8, and for all these cases CMAKE_SYSTEM_PROCESSOR exists and CMAKE_SYSTEM_PROCESSOR MATCHES x86_64 returns FALSE. Hence, I don't think any change is needed. Please correct me if I am missing anything.

Note the semantics of mxnet_option. If the condition is not satisfied, this option will be turned off ,not setting the default value off.

aaronmarkham · 2019-01-07T19:39:23Z

Does this update have anything to do with this nightly test failure (cmake + mkldnn)?

* mkldnn is default makefile and explicitly turn off for buidls * add endif * retrigger * retrigger * build mkldnn as static lib * update makefile to statically build mkldnn * build static mkldnn * fix static name * fix static name * update static for mac * rename mkldnn dep in ci * remove moving mkldnn dynamic lib * retrigger * remove commented code * retrigger * remove mkldnn dnaymic for unitest * retrigger * retrigger * force static for mkldnn lib * turn of mkldnn on arm builds * remove dynamic mkldnn bind * update jenkins to use only mkldnn * remove last flag * turn mkldnn by default on mac * move mkldnn files for GPU MKLDNN build * copy lib mxnet in gpu build * only link windows * add mkldnn.mk * try force linking * retrigger * retrigger * remove mkldnn dynanmic check * use ifndef * remove test mkldnn install * fix spacing * fix index * remove cp of mkldnn since statically linked * add libmkldnn.a to list of files to pack * include mkl_ml * add mkldnn to pack * add libiomp to ci pack * move static libs * fix typo * pack mkldnn * retrigger * add linux artifacts * move libmkldnn in gpu cmake build * move libmkldnn and libiomp5 on gpu workspace * move linked files * fix typo * fix typo * add artifacts for tensorrt * move mkldnn lib in scala build * move mkldnn lib on cpu scala * create dir for binding * rename libmkldnn in scala * move mklml dep in scala builds * move mkl to another linked folder * move libmkl to another dir * add libmklml * move mkldnn * move mkldnn on centos * specify new dynamic path * retrigger * remove mkldnn dynamic lib * remove moving mkldnn artifact * add ld path * retrigger * Revert "remove moving mkldnn artifact" This reverts commit 16cca19. * Revert "remove mkldnn dynamic lib" This reverts commit d510436. * update makefile * Revert RPATH change and trigger CI * correcting use-mkldnn flags for two tests * mkldnn default on linux for starters * reverting naming rules of pack_lib * adding mkldnn=0 flags to centos non mkldnn builds * adding mkldnn=0 flags to ubuntu gpu non mkldnn builds * removing mkldnn binary operation for ubuntu gpu cmake non mkldnn build * removing mkldnn binary operation for centos non-mkldnn unittest * adding explicit USE_MKLDNN=0 flags for clang builds * adding explicit USE_MKLDNN=0 flags for cpu ubuntu builds * removing mkldnn binaries from non mkldnn builds scala gpu * adding explicit flag mkldnn=0 for tensorrt gpu build * adding explicit flag mkldnn=0 for ubuntu cmake asan * adding centos cpu mkldnn tests to CI * adding CentOS GPU MKLDNN build and unittest * not keeping mkldnn default for mac os * setting mkldnn default for x86_64 only * running docs with mkldnn=0 flag * removing CentOS CPU Scala MKLDNN test * setting mkldnn default for x86_64 only * not making mkldn default on windows * removing Centos MKLDNN tests from CI * retrigger * retrigger * retrigger

azai91 and others added 30 commits October 26, 2018 16:48

mkldnn is default makefile and explicitly turn off for buidls

1e17a51

add endif

abbc3ad

retrigger

92b91f8

Merge branch 'master' into feature/mklnn-default-make

335748b

retrigger

ce5336c

Merge branch 'master' into feature/mklnn-default-make

d12d2cd

build mkldnn as static lib

b8a0203

update makefile to statically build mkldnn

bc6c482

build static mkldnn

15a41fc

fix static name

42b3353

fix static name

5af258a

update static for mac

32ab9ce

rename mkldnn dep in ci

e2422d6

remove moving mkldnn dynamic lib

372f697

retrigger

67e4dff

remove commented code

150b324

merge master

890cf1d

retrigger

89b11c6

Merge branch 'master' into feature/mklnn-default-make

da8f62c

Merge branch 'master' into feature/mkldnn-static

f29c254

remove mkldnn dnaymic for unitest

40fd0ac

Merge branch 'master' into feature/mklnn-default-make

78c6093

retrigger

cb095c6

retrigger

c08f6fa

force static for mkldnn lib

0302290

turn of mkldnn on arm builds

bf78666

Merge branch 'master' into feature/mkldnn-static

23fd7d9

remove dynamic mkldnn bind

d103ec8

Merge branch 'master' into feature/mklnn-default-make

46da874

Merge branch 'feature/mklnn-default-make' of github.com:azai91/incuba…

9b60119

…tor-mxnet into feature/mklnn-default-make

removing Centos MKLDNN tests from CI

b554519

TaoLv approved these changes Dec 27, 2018

View reviewed changes

pengzhao-intel approved these changes Dec 29, 2018

View reviewed changes

retrigger

f770f8a

azai91 mentioned this pull request Dec 29, 2018

Making MKL-DNN default on MXNet master (continue from #13681) #13744

Closed

7 tasks

azai91 added 3 commits December 29, 2018 10:26

Merge branch 'master' into mkldnn-default

8da6078

retrigger

28ba12c

retrigger

10e40a8

TaoLv merged commit 855f8b9 into apache:master Jan 3, 2019

azai91 mentioned this pull request Jan 4, 2019

USE_MKLDNN=1 is default in make build (mkldnn must be explicitly turned off) #12591

Closed

7 tasks

lebeg reviewed Jan 7, 2019

View reviewed changes

jlcontreras mentioned this pull request Jan 8, 2019

[Nightly test] v1.3.x failing with missing cmake #13800

Closed

mseth10 mentioned this pull request Jan 16, 2019

disable default MKLDNN for cross compilation #13893

Merged

5 tasks

TaoLv mentioned this pull request Jan 17, 2019

importing mxnet causing subprocess to crash #13875

Closed

mseth10 deleted the mkldnn-default branch June 1, 2020 10:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Making MKL-DNN default on MXNet master #13681

Making MKL-DNN default on MXNet master #13681

mseth10 commented Dec 19, 2018 •

edited

Loading

mseth10 commented Dec 26, 2018

TaoLv left a comment

pengzhao-intel commented Dec 29, 2018

pengzhao-intel left a comment

azai91 commented Dec 29, 2018

pengzhao-intel commented Dec 29, 2018

azai91 commented Dec 29, 2018

mseth10 commented Dec 29, 2018

TaoLv commented Jan 2, 2019

azai91 commented Jan 2, 2019

pengzhao-intel commented Jan 3, 2019

TaoLv commented Jan 3, 2019

pengzhao-intel commented Jan 4, 2019

lebeg Jan 7, 2019

mseth10 Jan 7, 2019

lebeg Jan 15, 2019

mseth10 Jan 17, 2019 •

edited

Loading

yajiedesign Apr 8, 2019

aaronmarkham commented Jan 7, 2019

Making MKL-DNN default on MXNet master #13681

Making MKL-DNN default on MXNet master #13681

Conversation

mseth10 commented Dec 19, 2018 • edited Loading

Description

Checklist

Essentials

Changes

Comments

mseth10 commented Dec 26, 2018

TaoLv left a comment

Choose a reason for hiding this comment

pengzhao-intel commented Dec 29, 2018

pengzhao-intel left a comment

Choose a reason for hiding this comment

azai91 commented Dec 29, 2018

pengzhao-intel commented Dec 29, 2018

azai91 commented Dec 29, 2018

mseth10 commented Dec 29, 2018

TaoLv commented Jan 2, 2019

azai91 commented Jan 2, 2019

pengzhao-intel commented Jan 3, 2019

TaoLv commented Jan 3, 2019

pengzhao-intel commented Jan 4, 2019

lebeg Jan 7, 2019

Choose a reason for hiding this comment

mseth10 Jan 7, 2019

Choose a reason for hiding this comment

lebeg Jan 15, 2019

Choose a reason for hiding this comment

mseth10 Jan 17, 2019 • edited Loading

Choose a reason for hiding this comment

yajiedesign Apr 8, 2019

Choose a reason for hiding this comment

aaronmarkham commented Jan 7, 2019

mseth10 commented Dec 19, 2018 •

edited

Loading

mseth10 Jan 17, 2019 •

edited

Loading