peformance issue #4048

kernel8liang · 2016-12-01T07:55:23Z

I compiled mxnet with MXNET_USE_CUDA=1, sometimes I didn't use gpu, in this situation it made me to wait almost 2 minutes to watch the iterations running, compare with before I didn't use MXNET_USE_CUDA=1.

debuged around, found in src/kvstore/comm.h

   Comm() {
     pinned_ctx_ = (MXNET_USE_CUDA != 0) ? Context::CPUPinned(0) : Context::CPU();
   }

Comm() use MXNET_USE_CUDA a compile macro to get the context, which is called from _initialize_kvstore in model.py, finally, it case to active gpu, this made me to wait a long time(8 Tesla M40 crads on board) even i didn't use gpu.

I think, it's better to determine pinned_ctx by a runtime variable like a command argument, rather than a compile macro.

The text was updated successfully, but these errors were encountered:

mli · 2017-01-05T21:11:27Z

should be fixed by #4550

ChaiBapchya · 2019-08-02T03:26:42Z

@szha @mli since #4550 has merged is this good to close?
OR
@nswamy is this pending?

mli added the enhancement label Dec 2, 2016

mli self-assigned this Dec 2, 2016

pono unassigned mli Jul 28, 2017

szha removed Feature labels Nov 14, 2018

nswamy added the Feature request label Nov 14, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

peformance issue #4048

peformance issue #4048

kernel8liang commented Dec 1, 2016 •

edited

Loading

mli commented Jan 5, 2017

ChaiBapchya commented Aug 2, 2019

peformance issue #4048

peformance issue #4048

Comments

kernel8liang commented Dec 1, 2016 • edited Loading

mli commented Jan 5, 2017

ChaiBapchya commented Aug 2, 2019

kernel8liang commented Dec 1, 2016 •

edited

Loading