Refactor ImageRecordIter #14824

ZhennanQin · 2019-04-28T07:54:35Z

Description

This PR brings below changes to ImageRecordIter:

Add new parameter dtype, making ImageRecordIter(dtype='uint8') equivalent to ImageRecordUInt8Iter.
Add new optional parameter ctx, which indicates the device context used for. When ctx='cpu' is specified, a CPU backend optimized data loader will be used.
Add new CPU backend optimized implementation. In this implementation, data_loader is working as a engine operator. Overall throughput get improved, and allow profiling data_loader overhead with built-in profiler.

@pengzhao-intel @TaoLv @xinyu-intel @anirudh2290

Checklist

Essentials

Please feel free to remove inapplicable items for your PR.

The PR title starts with [MXNET-$JIRA_ID], where $JIRA_ID refers to the relevant JIRA issue created (except PRs with tiny changes)
Changes are complete (i.e. I finished coding on this PR)
All changes have test coverage:
Unit tests are added for small changes to verify correctness (e.g. adding a new operator)
Nightly tests are added for complicated/long-running ones (e.g. changing distributed kvstore)
Build tests will be added for build configuration changes (e.g. adding a new build option with NCCL)
Code is well-documented:
For user-facing API changes, API doc string has been updated.
For new C++ functions in header files, their functionalities and arguments are documented.
For new examples, README.md is added to explain the what the example does, the source of the dataset, expected performance on test set and reference to the original paper if applicable
Check the API doc at http:https://mxnet-ci-doc.s3-accelerate.dualstack.amazonaws.com/PR-$PR_ID/$BUILD_ID/index.html
To the my best knowledge, examples are either not affected by this change, or have been fixed to be compatible with this change

Changes

Feature1, tests, (and when applicable, API doc)
Feature2, tests, (and when applicable, API doc)

Comments

If this change is a backward incompatible change, why must this change be made.
Interesting edge cases to note here

pengzhao-intel

This change will make the CPU user more convenience to avoid tuning how much thread is used for data loading.

pengzhao-intel · 2019-04-28T12:16:55Z

src/io/iter_image_recordio_2.cc

 n_parsed_ = 0;
 overflow = false;
 rnd_.seed(kRandMagic + record_param_.seed);
 int maxthread, threadget;
 #pragma omp parallel
 {
 // be conservative, set number of real cores
- maxthread = std::max(omp_get_num_procs() / 2 - 1, 1);
+ maxthread = std::max(omp_get_num_procs(), 1);


Does this get the # of logic cores?

Yes. I think we should let user to make the decision, just as normal operator. Before this change, data loader will only use n/2 cores for export OMP_NUM_THREADS=n.

Since now the iterator is pushed to engine, will this omp threading make troubles?

pengzhao-intel · 2019-04-29T05:15:02Z

@wkcn could you help take a review?

wkcn

LGTM. Thanks for your contribution!

Could you please provide a performance comparision?

pengzhao-intel · 2019-04-29T05:49:54Z

@szha to confirm the API enhancement.

roywei · 2019-04-29T15:51:44Z

@mxnet-label-bot add [Data-loading]

pengzhao-intel

LGTM

Wait a moment to see if there are other comments; otherwise, I will merge this PR in 24 hours.

pengzhao-intel · 2019-04-30T22:03:24Z

@xinyu-intel please help rebase the code and paste the performance data as the request from @wkcn

zhreshold

the wrapper looks neet.

* cpu optimized data loader * Fix CI * Fix CI * Fix ci * Fix doc

ZhennanQin requested a review from szha as a code owner April 28, 2019 07:54

pengzhao-intel added this to Review in progress in CPU Performance and Quantization Apr 28, 2019

cpu optimized data loader

b1ca701

ZhennanQin force-pushed the loader_cpu branch from 620b6c4 to b1ca701 Compare April 28, 2019 08:17

ZhennanQin added 4 commits April 28, 2019 16:49

Fix CI

369a964

Fix CI

0369b74

Fix ci

5f25a5d

Fix doc

3914136

pengzhao-intel reviewed Apr 28, 2019

View reviewed changes

pengzhao-intel requested a review from wkcn April 29, 2019 05:16

CPU Performance and Quantization automation moved this from Review in progress to Reviewer approved Apr 29, 2019

wkcn approved these changes Apr 29, 2019

View reviewed changes

marcoabreu added the Data-loading label Apr 29, 2019

pengzhao-intel approved these changes Apr 30, 2019

View reviewed changes

pengzhao-intel mentioned this pull request Apr 30, 2019

[Discussion] 1.5.0 Roadmap #14619

Closed

szha requested a review from zhreshold May 1, 2019 20:28

zhreshold reviewed May 1, 2019

View reviewed changes

resolve conflict

549225c

pengzhao-intel merged commit 621b391 into apache:master May 5, 2019

CPU Performance and Quantization automation moved this from Reviewer approved to Done May 5, 2019

access2rohit pushed a commit to access2rohit/incubator-mxnet that referenced this pull request May 14, 2019

Refactor ImageRecordIter (apache#14824)

5aa9a6f

* cpu optimized data loader * Fix CI * Fix CI * Fix ci * Fix doc

ZhennanQin deleted the loader_cpu branch May 31, 2019 02:07

haohuanw pushed a commit to haohuanw/incubator-mxnet that referenced this pull request Jun 23, 2019

Refactor ImageRecordIter (apache#14824)

f8a15a8

* cpu optimized data loader * Fix CI * Fix CI * Fix ci * Fix doc

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor ImageRecordIter #14824

Refactor ImageRecordIter #14824

ZhennanQin commented Apr 28, 2019

pengzhao-intel left a comment

pengzhao-intel Apr 28, 2019

ZhennanQin Apr 28, 2019

pengzhao-intel Apr 29, 2019

zhreshold May 1, 2019

pengzhao-intel commented Apr 29, 2019

wkcn left a comment

pengzhao-intel commented Apr 29, 2019

roywei commented Apr 29, 2019

pengzhao-intel left a comment

pengzhao-intel commented Apr 30, 2019

zhreshold left a comment

Refactor ImageRecordIter #14824

Refactor ImageRecordIter #14824

Conversation

ZhennanQin commented Apr 28, 2019

Description

Checklist

Essentials

Changes

Comments

pengzhao-intel left a comment

Choose a reason for hiding this comment

pengzhao-intel Apr 28, 2019

Choose a reason for hiding this comment

ZhennanQin Apr 28, 2019

Choose a reason for hiding this comment

pengzhao-intel Apr 29, 2019

Choose a reason for hiding this comment

zhreshold May 1, 2019

Choose a reason for hiding this comment

pengzhao-intel commented Apr 29, 2019

wkcn left a comment

Choose a reason for hiding this comment

pengzhao-intel commented Apr 29, 2019

roywei commented Apr 29, 2019

pengzhao-intel left a comment

Choose a reason for hiding this comment

pengzhao-intel commented Apr 30, 2019

zhreshold left a comment

Choose a reason for hiding this comment