[MRG+2] Modification of GaussianMixture class. #7123

tguillemot · 2016-08-01T11:33:43Z

The PR is to simplify the integration of the BayesianGaussianMixture class #6651.
I've simplify the GaussianMixture class by integrating a function which computes the determinant of the cholesky decomposition of the precision matrix (which will be usefull for BayesianGaussianMixture).

I've also corrected a bug during the EM process: normally, the lower bound is computed after the M-step not after the E-step. It's not a problem for GMM but that creates some problem for BayesianGaussianMixture.

@agramfort @ogrisel @amueller Can you have a look ?

The purpose here is to prepare the integration of BayesianGaussianMixture.

agramfort · 2016-08-01T16:00:55Z

sklearn/mixture/base.py

 for init in range(n_init):
 self._print_verbose_msg_init_beg(init)

 if do_init:
 self._initialize_parameters(X)
- current_log_likelihood, resp = self._e_step(X)
+  self.lower_bound_ = np.infty


shouldn't you document the new attribute?

Indeed, sorry for that mistake.

I didn't know np.infty also returns np.inf... Interesting...

Shouldn't it be self.lower_bound_ = -np.infty ?

It has been merged but I re-ask the question: shouldn't it be -np.infty ?

ping @tguillemot

@ngoix Sorry I forgot that - indeed. I don't know why I've missed your comment.
Sorry for that. I've solved that on #7180

agramfort · 2016-08-01T16:09:14Z

that's it for me

tguillemot · 2016-08-02T16:37:08Z

@amueller @ogrisel @raghavrv If you have time to do another review :)

raghavrv · 2016-08-02T16:39:23Z

sklearn/mixture/base.py

@@ -136,7 +136,7 @@ def _initialize_parameters(self, X):
 ----------
 X : array-like, shape (n_samples, n_features)
 """
- n_samples = X.shape[0]
+ n_samples, _ = X.shape


Why this change?

Sorry if this was suggested before...

It just I prefer like this and it also check that the shape of X is a tuple.
Sorry for these useless little modifications.

it also checks implicitly that X.ndim == 2. It do the same in my code.

agramfort · 2016-08-03T10:53:02Z

sklearn/mixture/gaussian_mixture.py

@@ -563,6 +553,9 @@ class GaussianMixture(BaseMixture):

 n_iter_ : int
 Number of step used by the best fit of EM to reach the convergence.
+
+ lower_bound_ : float


lower_bound_ -> best_log_likelihood_ ?

In fact for GMM this is the best log likelihood but not for VBGMM which is lower bound.
I've chosen lower_bound_ because it was the most understandable.

ok. Can you clarify this the docstring?

tguillemot · 2016-08-04T12:43:18Z

@amueller @ogrisel I really need this to be merged before you review #6651.

tguillemot · 2016-08-05T07:05:18Z

@ngoix This PR is related to Bayesian Gaussian Mixture, if you have some time to review it.
Thx in advance :)

agramfort · 2016-08-05T15:17:18Z

one more +1 and we're good here...

ngoix · 2016-08-07T21:52:34Z

sklearn/mixture/base.py


 for n_iter in range(self.max_iter):
- prev_log_likelihood = current_log_likelihood
+ prev_log_likelihood = self.lower_bound_


prev_log_likelihood -> prev_lower_bound ?

ngoix · 2016-08-08T01:12:06Z

By testing the code, I realized that there is a problem which was here before this PR:
In gaussian_mixture._set_parameters(), self.covariances_ is not updated. This causes that with n_init > 1, the covariances outputed do not correspond to the means.

ngoix · 2016-08-08T01:12:18Z

sklearn/mixture/gaussian_mixture.py


 def _estimate_log_weights(self):
 return np.log(self.weights_)

+ def _compute_lower_bound(self, _, log_prob_norm):
+ return log_prob_norm
+


In _set_parameters below, compute self.covariances_ from self.precisions_.

ngoix · 2016-08-08T01:20:20Z

That's all from me!

tguillemot · 2016-08-08T07:30:32Z

@ngoix Indeed you're right. I've missed that. Thanks.
I've corrected that and added a test to check it.

tguillemot · 2016-08-08T14:51:42Z

@agramfort merge ?

tguillemot · 2016-08-10T06:55:01Z

@amueller Can you merge please ?

agramfort · 2016-08-10T07:04:07Z

Thanks @tguillemot

tguillemot · 2016-08-10T07:51:18Z

@agramfort @ngoix Thanks

* Modification of GaussianMixture class. The purpose here is to prepare the integration of BayesianGaussianMixture. * Fix comments. * Modification of the Docstring. * Add license and author. * Fix review and add tests for init.

Modification of GaussianMixture class.

cf97453

The purpose here is to prepare the integration of BayesianGaussianMixture.

agramfort reviewed Aug 1, 2016
View reviewed changes

tguillemot changed the title ~~Modification of GaussianMixture class.~~ [MRG] Modification of GaussianMixture class. Aug 1, 2016

tguillemot force-pushed the gmm-modif branch from 32aacb7 to 3893502 Compare August 2, 2016 13:07

Fix comments.

6eaf9c5

tguillemot force-pushed the gmm-modif branch from 3893502 to 6eaf9c5 Compare August 2, 2016 13:10

tguillemot changed the title ~~[MRG] Modification of GaussianMixture class.~~ [MRG+1] Modification of GaussianMixture class. Aug 2, 2016

raghavrv reviewed Aug 2, 2016
View reviewed changes

tguillemot mentioned this pull request Aug 3, 2016

[MRG+1] Bayesian Gaussian Mixture (Integration of GSoC2015 -- second step) #6651

Merged

8 tasks

agramfort reviewed Aug 3, 2016
View reviewed changes

tguillemot added 2 commits August 4, 2016 10:05

Modification of the Docstring.

94418ec

Add license and author.

7514cc4

ngoix reviewed Aug 7, 2016
View reviewed changes

ngoix reviewed Aug 8, 2016
View reviewed changes

Fix review and add tests for init.

c481d65

tguillemot changed the title ~~[MRG+1] Modification of GaussianMixture class.~~ [MRG+2] Modification of GaussianMixture class. Aug 8, 2016

agramfort merged commit 65b7d7a into scikit-learn:master Aug 10, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[MRG+2] Modification of GaussianMixture class. #7123

[MRG+2] Modification of GaussianMixture class. #7123

tguillemot commented Aug 1, 2016

agramfort Aug 1, 2016

tguillemot Aug 2, 2016

raghavrv Aug 2, 2016

ngoix Aug 7, 2016

ngoix Aug 11, 2016

ngoix Aug 11, 2016

tguillemot Aug 11, 2016

agramfort commented Aug 1, 2016

tguillemot commented Aug 2, 2016

raghavrv Aug 2, 2016

raghavrv Aug 2, 2016

tguillemot Aug 2, 2016 •

edited

Loading

raghavrv Aug 2, 2016

agramfort Aug 3, 2016

agramfort Aug 3, 2016

tguillemot Aug 3, 2016

agramfort Aug 3, 2016 via email

tguillemot Aug 4, 2016

tguillemot commented Aug 4, 2016

tguillemot commented Aug 5, 2016

agramfort commented Aug 5, 2016

ngoix Aug 7, 2016

ngoix commented Aug 8, 2016

ngoix Aug 8, 2016

ngoix commented Aug 8, 2016

tguillemot commented Aug 8, 2016

tguillemot commented Aug 8, 2016

tguillemot commented Aug 10, 2016

agramfort commented Aug 10, 2016

tguillemot commented Aug 10, 2016

[MRG+2] Modification of GaussianMixture class. #7123

[MRG+2] Modification of GaussianMixture class. #7123

Conversation

tguillemot commented Aug 1, 2016

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

agramfort commented Aug 1, 2016

tguillemot commented Aug 2, 2016

Choose a reason for hiding this comment

Choose a reason for hiding this comment

tguillemot Aug 2, 2016 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

agramfort Aug 3, 2016 via email

Choose a reason for hiding this comment

Choose a reason for hiding this comment

tguillemot commented Aug 4, 2016

tguillemot commented Aug 5, 2016

agramfort commented Aug 5, 2016

Choose a reason for hiding this comment

ngoix commented Aug 8, 2016

Choose a reason for hiding this comment

ngoix commented Aug 8, 2016

tguillemot commented Aug 8, 2016

tguillemot commented Aug 8, 2016

tguillemot commented Aug 10, 2016

agramfort commented Aug 10, 2016

tguillemot commented Aug 10, 2016

tguillemot Aug 2, 2016 •

edited

Loading