Set static alloc for quantized models #755

xinyu-intel · 2019-04-24T07:47:21Z

When gluon model hybridize with static_shape=True, static_alloc=True, cached_op with static mode will be used. For this situation, we should try to cache operator state for better performance. This PR is to enable this feature along with MXNet #14785 to speed up gluon inference speed, especially for small batch sizes.

mli · 2019-04-24T08:25:55Z

Job PR-755-1 is done.
Docs are uploaded to http:https://gluon-vision-staging.s3-website-us-west-2.amazonaws.com/PR-755/1/index.html
Code coverage of this PR: vs. Master:

zhreshold · 2019-04-24T18:07:45Z

I'll wait for apache/mxnet#14785

zhreshold · 2019-04-26T17:53:22Z

merged together with apache/mxnet#14785

pengzhao-intel · 2019-04-27T07:36:13Z

@xinyu-intel do we need to update the performance in the tutorial?

xinyu-intel · 2019-04-27T10:40:34Z

@pengzhao-intel Throughput has a little bit improvement. Plan to update them along with some other models and waiting for 2nd gen Xeon online.

xinyu-intel added 2 commits April 24, 2019 15:39

set static alloc when quantized

ebfcf25

Merge remote-tracking branch 'origin/master' into static_alloc

dc9b4c9

xinyu-intel requested a review from zhreshold April 24, 2019 07:47

xinyu-intel mentioned this pull request Apr 25, 2019

Improve cached_op performance for static mode apache/mxnet#14785

Merged

7 tasks

zhreshold merged commit 59d5b74 into dmlc:master Apr 26, 2019

xinyu-intel added this to Done in CPU Performance and Quantization Oct 24, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Set static alloc for quantized models #755

Set static alloc for quantized models #755

xinyu-intel commented Apr 24, 2019

mli commented Apr 24, 2019

zhreshold commented Apr 24, 2019

zhreshold commented Apr 26, 2019

pengzhao-intel commented Apr 27, 2019

xinyu-intel commented Apr 27, 2019

Set static alloc for quantized models #755

Set static alloc for quantized models #755

Conversation

xinyu-intel commented Apr 24, 2019

mli commented Apr 24, 2019

zhreshold commented Apr 24, 2019

zhreshold commented Apr 26, 2019

pengzhao-intel commented Apr 27, 2019

xinyu-intel commented Apr 27, 2019