[OpPerf] Fixed Python profiler bug #17642

connorgoggins · 2020-02-20T23:35:25Z

Description

Currently, the Python profiler for the opperf utility is broken. This fix changes the way args are passed to the underlying op during testing.

Fixes #17640

Checklist

Essentials

Changes are complete (i.e. I finished coding on this PR)
All changes have test coverage
Code is well-documented
To the best of my knowledge, examples are either not affected by this change, or have been fixed to be compatible with this change

Changes

M benchmark/opperf/utils/profiler_utils.py

Comments

Test suite was run with Python profiler on Mac OS X, building the latest version of MXNet (with my fix) from source.

Full OpPerf test suite - CPU (Native profiler)

Full OpPerf test suite - CPU (Python profiler)

@apeforest

connorgoggins · 2020-02-20T23:36:46Z

@mxnet-label-bot add [pr-awaiting-review]

apeforest · 2020-02-21T00:46:22Z

benchmark/opperf/utils/profiler_utils.py

@@ -248,12 +248,11 @@ def python_profile(func):
 @functools.wraps(func)
 def python_profile_it(*args, **kwargs):
 runs = args[1]
- modified_args = (args[0], 1, args[2])


I don't remember the reason why we need to write this way in the first place. @ChaiBapchya could you please review. @connorgoggins Have you run through existing tests to make sure this does not break any existing usage?

@apeforest yes, ran full OpPerf suite with Python profiler with this change and everything passed (see results here).

So the args that are passed to this function are
args[0] = op
args[1] = warmup / runs (number of times to run for warmup or number of times to run)
args[2] - rest of the args

https://github.com/apache/incubator-mxnet/blob/9dcf71d8fe33f77ed316a95fcffaf1f7f883ff70/benchmark/opperf/utils/benchmark_utils.py#L114

https://github.com/apache/incubator-mxnet/blob/9dcf71d8fe33f77ed316a95fcffaf1f7f883ff70/benchmark/opperf/utils/benchmark_utils.py#L121

The way it worked for native MXNet CPP profiler is that you could pass the runs (and it would capture the time for each value along with mean/max, etc)

But for Python's time it function, we had to manually run the for loop for the number of runs.
So that's what I did there

we copy the number of runs in a variable in run and then run it that many number of times

For each run, we use python time it function to time it and then take average, mean, max, etc values for each of those individual python time runs.

Makes sense? @apeforest

@ChaiBapchya So basically you are saying we don't need to do this modified_args for python profiler, right? So @connorgoggins change is valid?

We do need to modify the args to meet the requirement for capturing per run timing info using python's time_it function. This needs to handled in a way that doesn't break existing native profiler.

ChaiBapchya · 2020-02-21T01:17:53Z

preloaded and multi_* ops aren't being tracked for some reason. could you fix that too? @connorgoggins

apeforest · 2020-02-21T19:01:49Z

preloaded and multi_* ops aren't being tracked for some reason. could you fix that too? @connorgoggins

Let's fix that in a separate PR.

ChaiBapchya · 2020-02-21T19:24:59Z

benchmark/opperf/utils/profiler_utils.py

 times = []

 for _ in range(runs):
 start_time = time.perf_counter() # 1
- res = func(*modified_args, **kwargs)


@connorgoggins @apeforest
If we pass the *args as is, it will still have
args[0] as op
args[1] as runs
For eg if user passed runs as 10

So the native profiler would run 10 times and so will the for loop run 10 times (for timing Python profiler)

Coz the func here is nd_forward_backward_profile or nd_forward_profile (both take runs as a parameter)

ChaiBapchya · 2020-02-21T19:37:17Z

benchmark/opperf/utils/profiler_utils.py

@@ -248,7 +248,7 @@ def python_profile(func):
 @functools.wraps(func)
 def python_profile_it(*args, **kwargs):
 runs = args[1]
- modified_args = (args[0], 1, args[2])
+ modified_args = (args[0], 1)


Now it looks good!

ChaiBapchya

Looks good now! Thanks for the fix!

* Changed arg structure in op func call * Length check to prevent index out of bounds error * Dropping args[2] as it is no longer used (only using kwargs)

lanking520 added the pr-awaiting-review PR is waiting for code review label Feb 20, 2020

connorgoggins mentioned this pull request Feb 20, 2020

[OPPERF] opperf error out if I use python timer instead of the builtin mxnet profiler #17640

Closed

apeforest reviewed Feb 21, 2020

View reviewed changes

apeforest approved these changes Feb 21, 2020

View reviewed changes

ChaiBapchya reviewed Feb 21, 2020

View reviewed changes

ChaiBapchya approved these changes Feb 21, 2020

View reviewed changes

connorgoggins added 3 commits February 24, 2020 09:50

Changed arg structure in op func call

db46553

Length check to prevent index out of bounds error

9cbf836

Dropping args[2] as it is no longer used (only using kwargs)

850260a

connorgoggins force-pushed the opperf_python_timer_fix branch from c4711c4 to 850260a Compare February 24, 2020 17:51

apeforest merged commit 1906eff into apache:master Feb 26, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[OpPerf] Fixed Python profiler bug #17642

[OpPerf] Fixed Python profiler bug #17642

connorgoggins commented Feb 20, 2020 •

edited

Loading

connorgoggins commented Feb 20, 2020

apeforest Feb 21, 2020

connorgoggins Feb 21, 2020 •

edited

Loading

ChaiBapchya Feb 21, 2020 •

edited

Loading

apeforest Feb 21, 2020

ChaiBapchya Feb 21, 2020

ChaiBapchya commented Feb 21, 2020

apeforest commented Feb 21, 2020

ChaiBapchya Feb 21, 2020

ChaiBapchya Feb 21, 2020

ChaiBapchya left a comment

[OpPerf] Fixed Python profiler bug #17642

[OpPerf] Fixed Python profiler bug #17642

Conversation

connorgoggins commented Feb 20, 2020 • edited Loading

Description

Checklist

Essentials

Changes

Comments

connorgoggins commented Feb 20, 2020

apeforest Feb 21, 2020

Choose a reason for hiding this comment

connorgoggins Feb 21, 2020 • edited Loading

Choose a reason for hiding this comment

ChaiBapchya Feb 21, 2020 • edited Loading

Choose a reason for hiding this comment

apeforest Feb 21, 2020

Choose a reason for hiding this comment

ChaiBapchya Feb 21, 2020

Choose a reason for hiding this comment

ChaiBapchya commented Feb 21, 2020

apeforest commented Feb 21, 2020

ChaiBapchya Feb 21, 2020

Choose a reason for hiding this comment

ChaiBapchya Feb 21, 2020

Choose a reason for hiding this comment

ChaiBapchya left a comment

Choose a reason for hiding this comment

connorgoggins commented Feb 20, 2020 •

edited

Loading

connorgoggins Feb 21, 2020 •

edited

Loading

ChaiBapchya Feb 21, 2020 •

edited

Loading