Fix gated forward functions #295

callummcdougall · 2024-09-18T11:08:18Z

2 fixes:

In the forward method, when self.use_error_term is True, for non-standard architectures things like centering and input activation functions weren't being applied correctly. I think this was just a mistake, and is now fixed.
In the encode_gated method, the hook hook_sae_acts_post pointed to the activations post-ReLU (or whatever activation function is), but not to the actual output of this function, i.e. post-ReLU activations multiplied by masking values. I think a long-term solution has 2 separate hooks for each of these (e.g. hook_sae_acts_post and hook_sae_mag_post), but if we just have a single hook called hook_sae_acts_post then I think it makes a lot more sense for it to refer to the output of the encoder.

jbloomAus · 2024-09-20T08:53:56Z

Thanks!

callummcdougall · 2024-09-20T09:34:22Z

np! also let me know if there are things I can do to not require formatting PRs in the future - would this involve something like using the same workspace config files as this repo does?

jbloomAus · 2024-09-20T11:49:55Z

@callummcdougall run make format and make check-ci nothing more to it!

* support seqpos slicing * fix forward functions for gated * remove seqpos changes * fix formatting (remove my changes) * format --------- Co-authored-by: jbloomAus <[email protected]>

callummcdougall added 4 commits September 18, 2024 09:13

support seqpos slicing

0a57e47

fix forward functions for gated

89714c8

remove seqpos changes

d38622e

fix formatting (remove my changes)

d9ea96a

format

c16366c

jbloomAus merged commit a708220 into jbloomAus:main Sep 20, 2024
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix gated forward functions #295

Fix gated forward functions #295

callummcdougall commented Sep 18, 2024

jbloomAus commented Sep 20, 2024

callummcdougall commented Sep 20, 2024

jbloomAus commented Sep 20, 2024

Fix gated forward functions #295

Fix gated forward functions #295

Conversation

callummcdougall commented Sep 18, 2024

jbloomAus commented Sep 20, 2024

callummcdougall commented Sep 20, 2024

jbloomAus commented Sep 20, 2024