The saved MFK result takes too much space #149

xuzhengChai · 2019-06-05T14:18:06Z

Hi

In case:
3000 sampling points
8 features
using MFK to create surrogate model
e.g.
from smt.extensions import MFK
sm_MFK = MFK(theta0=numpy.ones(8), eval_noise=True, noise0=5)
sm_MFK.set_training_values(X, Y)
sm_MFK.train()

The saved MFK model (sm_MFK) will take around 0.4GB, which is far beyond my expectation.

So is it possible to reduce the taken memory?

Thanks in advance!

relf · 2019-06-13T08:43:13Z

How do you save the MFK model?

xuzhengChai · 2019-06-13T15:47:50Z

I used dill.dump to save it.
I tried to use pickle.dump, but got PicklingError: Can't pickle <class 'function'>: attribute lookup function on builtins failed

relf · 2019-06-14T16:10:10Z

Ok. I've just fixed the pickle problem with #154, but I do not think it will solve your problem.
Taking a look at the MFK code, it seems you can "nonify" the D_all instance member which is not used in prediction, like this:

sm_MFK.D_all = None
dill.dump(...)

It should decrease the size of the model.

xuzhengChai · 2019-06-16T19:38:38Z

Great! Thank you vary much!

By the way, is it also possible to improve the prediction speed?
Still the same surrogate mode, it takes around 440 seconds to predict 1200000 times. And these 1200000 poitns have to be divided into e.g. 100 groups to do prediction separately, otherwise it will raise a MemoryError.

I built the same model using GaussianProcessRegressor in scikit-learn, which takes only 90s to run 1200000 times of prediction, but the model fitting time is too much (~40 minutes), while the fitting time in MFK is only 2 mins!!!

Since I will use surrogate in Bayesian inference, which requires hundreds of thousands of predictions and I also have over one hundred surrogate model. The reduction of both fitting time and prediction time makes sense to me.

Thanks in advance!!!!

relf · 2019-06-17T08:13:54Z

Well... Feel free to make a pull request if you have a way to improve the current implementation. For the meantime, I close the issue.

relf closed this as completed Jun 17, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The saved MFK result takes too much space #149

The saved MFK result takes too much space #149

xuzhengChai commented Jun 5, 2019

relf commented Jun 13, 2019

xuzhengChai commented Jun 13, 2019

relf commented Jun 14, 2019

xuzhengChai commented Jun 16, 2019

relf commented Jun 17, 2019

The saved MFK result takes too much space #149

The saved MFK result takes too much space #149

Comments

xuzhengChai commented Jun 5, 2019

relf commented Jun 13, 2019

xuzhengChai commented Jun 13, 2019

relf commented Jun 14, 2019

xuzhengChai commented Jun 16, 2019

relf commented Jun 17, 2019