Support L2 regularization & cross validation for Classifier #135

norabelrose · 2023-03-17T00:47:44Z

This PR adds an l2_penalty parameter to Classifier.fit with a default value of 0.1— this is a change from the previous behavior, where there was no penalty by default.

I initially tried to exactly imitate the behavior of scikit-learn's C inverse regularization parameter, but I couldn't quite figure out how they're computing the final loss. Based on their code it seems like they're doing some weird thing where they're summing the BCE loss over the samples rather than taking the average and this changes the scale of everything, making it dependent on the number of samples. But that didn't seem to give the exact same results either, so idk. I gave up on exactly imitating it— the tests only check that when l2_penalty is set to 0.0, the results are ~the same.

This PR also adds a relatively well optimized fit_cv method that uses warm-starting to get at least a 2x speed up over a naive approach where you start from a zero initialization every time. I initially wanted to parallelize this code over the folds but this seemed like it would be a real pain in the ass that would complicate the code substantially, and I'm not sure there would be a significant speed boost at the end of the day (at least not without rewriting PyTorch's LBFGS optimizer which I don't want to do right now).

There is a question of whether we want to use fit_cv by default in train.py. I think we probably should, but it does get us back into the territory where Classifier is taking up more compute than the actual VINC algorithm. At the moment the code does use fit_cv and doesn't actually give you an option to turn this off (which should probably be changed).

AlexTMallen

LGTM

norabelrose added 4 commits March 16, 2023 20:58

Add L2 regularization to Classifier

0b9180b

fit_cv implemented

5c1377a

Fix stupid logspace bug

6711dfb

Change default regularization to 0.1

3fb6e6c

norabelrose requested review from thejaminator, FabienRoger, AlexTMallen and lauritowal March 17, 2023 00:48

Remove health check due to false positives

1ce79ae

lauritowal approved these changes Mar 17, 2023

View reviewed changes

AlexTMallen approved these changes Mar 18, 2023

View reviewed changes

norabelrose merged commit fdce031 into main Mar 19, 2023

norabelrose deleted the cross-val branch March 19, 2023 22:03

This was referenced Mar 20, 2023

Support regularization on classifier #119

Closed

Support cross-validation for selecting the best weight decay / L2 penalty #21

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support L2 regularization & cross validation for Classifier #135

Support L2 regularization & cross validation for Classifier #135

norabelrose commented Mar 17, 2023

AlexTMallen left a comment

Support L2 regularization & cross validation for Classifier #135

Support L2 regularization & cross validation for Classifier #135

Conversation

norabelrose commented Mar 17, 2023

AlexTMallen left a comment

Choose a reason for hiding this comment