Add recursive CCS #57

FabienRoger · 2023-02-13T17:34:56Z

No description provided.

norabelrose · 2023-02-13T17:47:23Z

elk/training/ccs.py

+ and data[0].dtype == data[1].dtype == self.dtype
+ ), "Data must be a tuple of two tensors of the same shape and dtype"
+
+ def correct_dtypes(


Why is this necessary at all? I don't see why we need to cast to a single dtype

When removing this cast I get

Traceback (most recent call last): File "/home/ubuntu/elk/elk/extensions/recursive_ccs/train.py", line 110, in <module> train(args) File "/home/ubuntu/elk/elk/extensions/recursive_ccs/train.py", line 74, in train probe, train_loss = rccs.fit_next_probe( File "/home/ubuntu/elk/elk/extensions/recursive_ccs/rccs.py", line 33, in fit_next_probe train_loss = ccs.fit(data, **train_params) File "/home/ubuntu/elk/elk/training/ccs.py", line 142, in fit loss = self.train_loop_lbfgs(x0, x1, num_epochs, weight_decay) File "/home/ubuntu/elk/elk/training/ccs.py", line 238, in train_loop_lbfgs optimizer.step(closure) File "/home/ubuntu/miniconda3/envs/elk/lib/python3.9/site-packages/torch/optim/optimizer.py", line 140, in wrapper out = func(*args, **kwargs) File "/home/ubuntu/miniconda3/envs/elk/lib/python3.9/site-packages/torch/autograd/grad_mode.py", line 27, in decorate_context return func(*args, **kwargs) File "/home/ubuntu/miniconda3/envs/elk/lib/python3.9/site-packages/torch/optim/lbfgs.py", line 312, in step orig_loss = closure() File "/home/ubuntu/miniconda3/envs/elk/lib/python3.9/site-packages/torch/autograd/grad_mode.py", line 27, in decorate_context return func(*args, **kwargs) File "/home/ubuntu/elk/elk/training/ccs.py", line 224, in closure logit0, logit1 = self(x0), self(x1) File "/home/ubuntu/miniconda3/envs/elk/lib/python3.9/site-packages/torch/nn/modules/module.py", line 1194, in _call_impl return forward_call(*input, **kwargs) File "/home/ubuntu/elk/elk/training/ccs.py", line 103, in forward return self.probe(x) File "/home/ubuntu/miniconda3/envs/elk/lib/python3.9/site-packages/torch/nn/modules/module.py", line 1194, in _call_impl return forward_call(*input, **kwargs) File "/home/ubuntu/miniconda3/envs/elk/lib/python3.9/site-packages/torch/nn/modules/container.py", line 204, in forward input = module(input) File "/home/ubuntu/miniconda3/envs/elk/lib/python3.9/site-packages/torch/nn/modules/module.py", line 1194, in _call_impl return forward_call(*input, **kwargs) File "/home/ubuntu/miniconda3/envs/elk/lib/python3.9/site-packages/torch/nn/modules/linear.py", line 114, in forward return F.linear(input, self.weight, self.bias) RuntimeError: mat1 and mat2 must have the same dtype

If you just call .float() on the input before calling fit this should work

elk/extensions/recursive_ccs/train.py

elk/training/ccs.py

elk/extensions/recursive_ccs/parser.py

norabelrose · 2023-02-13T17:53:15Z

elk/extensions/recursive_ccs/train.py

@@ -0,0 +1,110 @@
+import csv


It's not clear why this entire file is necessary. It seems to be mostly copied over from the primary train.py. Could we add a flag or a subparser to the main command instead?

It makes training more complicated. I think this should not happen before the first release.

FabienRoger added 4 commits February 13, 2023 15:51

Add basic rccs

b61fcee

Add parsing & parametrization

9f7864b

Remove junk

c54e4bd

Fix bugs

c6a81ca

norabelrose requested review from norabelrose and lauritowal February 13, 2023 17:44

norabelrose requested changes Feb 13, 2023

View reviewed changes

FabienRoger added 3 commits February 13, 2023 18:03

Shorten num recusrive iterations

a54acd6

Add parametrization outside of ccs

ca5ac56

Remove useless autocast

c9de542

norabelrose requested a review from Benw8888 February 14, 2023 02:09

norabelrose closed this Mar 19, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add recursive CCS #57

Add recursive CCS #57

FabienRoger commented Feb 13, 2023

norabelrose Feb 13, 2023

FabienRoger Feb 13, 2023

norabelrose Feb 13, 2023

norabelrose Feb 13, 2023

FabienRoger Feb 13, 2023

Add recursive CCS #57

Add recursive CCS #57

Conversation

FabienRoger commented Feb 13, 2023

norabelrose Feb 13, 2023

Choose a reason for hiding this comment

FabienRoger Feb 13, 2023

Choose a reason for hiding this comment

norabelrose Feb 13, 2023

Choose a reason for hiding this comment

norabelrose Feb 13, 2023

Choose a reason for hiding this comment

FabienRoger Feb 13, 2023

Choose a reason for hiding this comment