add platt scaling for burns + fix for leace #288

lauritowal · 2023-08-23T11:06:58Z

Fixes platt scaling for Leace in CCS, where even during the training of the reporters the platt scaling parameters are multiplied and added to the raw_scores.
Adds platt_scaling when using burns normalization

elk/training/train.py

AlexTMallen · 2023-08-23T16:56:30Z

elk/training/train.py

- rearrange(first_train_h, "n v k d -> (n v k) d"),
- )
+ labels = to_one_hot(train_gt, k)
+ labels = repeat(train_gt, "n -> n v k", v=v, k=k)


Why do we repeat the labels along the choices dimension? I think this means all dimensions of the one-hot prediction are expected to have the same value.

CCS only supports k=2, so we can just deal with that case after after asserting it true.

I think this line should be labels = to_one_hot(repeat(train_gt, "n -> (n v)", v=v), 2).flatten()

labels = to_one_hot(repeat(train_gt, "n -> (n v)", v=v), 2).flatten()
That wouldn't work, since we dimensions have to be the same for the labels and first_train_h, which is a 3D tensor in the case of CCS. But yeah let me see what I can do instead

AlexTMallen · 2023-08-23T16:58:40Z

elk/training/ccs_reporter.py

@@ -88,6 +88,8 @@ def __init__(
 num_variants: int = 1,
 ):
 super().__init__()
+ self._is_training = True


if self._is_training is false during the call to platt_scale, then it won't work because the platt scaling parameter needs to be used for it to be updated in the backwards pass

Does it not?

if self._is_training: return raw_scores else: platt_scaled_scores = raw_scores.mul(self.scale).add(self.bias).squeeze(-1) return platt_scaled_scores

Anyway seems simple to test by comparing the results with and without the platt scaling params added

elk/training/train.py

derpyplops · 2023-08-23T16:56:04Z

elk/training/train.py

- rearrange(first_train_h, "n v k d -> (n v k) d"),
- )
+ labels = to_one_hot(train_gt, k)
+ labels = repeat(train_gt, "n -> n v k", v=v, k=k)


🔴 This implementation is incorrect.

Suggested change

labels = repeat(train_gt, "n -> n v k", v=v, k=k)

labels = to_one_hot(repeat(train_gt, "n -> n v", v=v), k)

Yours replicates the label values in both the second and third dimensions.

The corrected one first replicates the label values in the second dimension, then converts them into a one-hot representation in the third dimension.

derpyplops · 2023-08-23T17:02:05Z

elk/training/ccs_reporter.py

@@ -88,6 +88,8 @@ def __init__(
 num_variants: int = 1,
 ):
 super().__init__()
+ self._is_training = True


🟡 I think this approach basically works but perhaps the wrapper approach that the EigenFitter/Reporter uses is cleaner

In fact maybe we should just do that and make CcsReporter.fit() return a Reporter. Maybe need to have add a param that disables the eraser.

Yeah, I removed the _is.training again.

In fact maybe we should just do that and make CcsReporter.fit() return a Reporter. Maybe need to have add a param that disables the eraser.

We can take a look at that. Could be maybe a new pull request or the same...

AlexTMallen

lgtm

derpyplops · 2023-08-26T19:58:15Z

elk/training/train.py

- rearrange(first_train_h, "n v k d -> (n v k) d"),
- )
+ labels = to_one_hot(train_gt, k)
+ labels = repeat(labels, "n k -> n v k", v=v)


🟢 nitpick: bugs like the one made here are the reason I dislike reassigning things to the same variable. Either the name should be changed:
one_hotted_labels = to_one_hot(train_gt, k)
or it should just be inlined

add platt scaling for burns + fix for leace

5d4e7a4

lauritowal requested review from derpyplops and AlexTMallen August 23, 2023 11:07

lauritowal marked this pull request as ready for review August 23, 2023 15:20

lauritowal requested a review from norabelrose August 23, 2023 15:20

add comment

707ce8e

AlexTMallen requested changes Aug 23, 2023

View reviewed changes

derpyplops requested changes Aug 23, 2023

View reviewed changes

lauritowal added 3 commits August 23, 2023 22:23

repeat onehot in template dimension

777912e

readd platt scaling

89f8987

remove platt scaling paramters from param

f678c90

lauritowal requested review from derpyplops and AlexTMallen August 23, 2023 22:16

AlexTMallen approved these changes Aug 26, 2023

View reviewed changes

derpyplops reviewed Aug 26, 2023

View reviewed changes

remove extra line for label

6ed5eca

lauritowal requested review from norabelrose and removed request for norabelrose August 27, 2023 22:19

derpyplops approved these changes Aug 28, 2023

View reviewed changes

lauritowal merged commit 4a6b654 into main Aug 28, 2023
6 checks passed

lauritowal deleted the fix-platt-scaling-ccs branch August 28, 2023 13:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add platt scaling for burns + fix for leace #288

add platt scaling for burns + fix for leace #288

lauritowal commented Aug 23, 2023

AlexTMallen Aug 23, 2023

lauritowal Aug 23, 2023 •

edited

Loading

AlexTMallen Aug 23, 2023

derpyplops Aug 23, 2023

derpyplops Aug 23, 2023 •

edited

Loading

derpyplops Aug 23, 2023

derpyplops Aug 23, 2023

lauritowal Aug 23, 2023

AlexTMallen left a comment

derpyplops Aug 26, 2023

	labels = repeat(train_gt, "n -> n v k", v=v, k=k)
	labels = to_one_hot(repeat(train_gt, "n -> n v", v=v), k)

add platt scaling for burns + fix for leace #288

add platt scaling for burns + fix for leace #288

Conversation

lauritowal commented Aug 23, 2023

AlexTMallen Aug 23, 2023

Choose a reason for hiding this comment

lauritowal Aug 23, 2023 • edited Loading

Choose a reason for hiding this comment

AlexTMallen Aug 23, 2023

Choose a reason for hiding this comment

derpyplops Aug 23, 2023

Choose a reason for hiding this comment

derpyplops Aug 23, 2023 • edited Loading

Choose a reason for hiding this comment

derpyplops Aug 23, 2023

Choose a reason for hiding this comment

derpyplops Aug 23, 2023

Choose a reason for hiding this comment

lauritowal Aug 23, 2023

Choose a reason for hiding this comment

AlexTMallen left a comment

Choose a reason for hiding this comment

derpyplops Aug 26, 2023

Choose a reason for hiding this comment

lauritowal Aug 23, 2023 •

edited

Loading

derpyplops Aug 23, 2023 •

edited

Loading