Smoke tests with tiny gpt2, fix CCSReporter #149

thejaminator · 2023-03-23T15:03:25Z

This MR adds a smoke test for CCSReporter with tinygpt2

Subsequent issues:

Fix EigenReporter for tinygpt2
Make the tests work for tiny deberta (or some other MLM?)

elk/training/ccs_reporter.py

thejaminator · 2023-03-24T06:01:45Z

tests/test_smoke_elicit.py

+but you'll need to make deberta fp32 instead of fp16
+because pytorch cpu doesn't support fp16
+"""
+


basically the deberta model itself does some fp16 operations that aren't supported. but tiny-gpt doesn't. so that why tiny-gpt works.

i'll try to figure that out in another MR. i think we could convert fp16 models to fp32 if its CPU, but seems hacky?

I actually don't view this as hacky. In general float16 isn't supported on CPUs. We could check the whether we're running on CPU right when we call AutoModel.from_pretrained and tell HF to always load the model as float32 in that case, and use the dtype of the checkpoint otherwise.

^ just made this change, so if you want to add a deberta test you can. I think tiny gpt2 is probably sufficient for now though

thejaminator · 2023-03-24T06:02:21Z

tests/test_smoke_elicit.py

+ u[:] = torch.einsum("...ij,...j->...i", A, V[..., k, :])
+
+ RuntimeError: select(): index 1 out of range for tensor of size [1, 2]
+ at dimension 0


EigenReporterConfig is broken for tiny-gpt2. didn't investigate, would like to push the fix for CCS first

Thanks for noticing this, I just pushed a fix

norabelrose

After my changes it LGTM

thejaminator added 8 commits March 23, 2023 21:41

add smoke tests

01f3a2f

fix test

73bd2b0

patch CCS

9a824bc

remove comments

d46bf2a

undo changes

c124d40

add eigenreporter test for tiny-gpt2

b233517

use repeat_interleave like auc calculation

c1cf9a0

make flake happy

e43d2c5

thejaminator changed the title ~~WIP: Smoke tests with tiny models~~ Smoke tests with tiny gpt2 Mar 24, 2023

thejaminator commented Mar 24, 2023

View reviewed changes

elk/training/ccs_reporter.py Show resolved Hide resolved

thejaminator commented Mar 24, 2023

View reviewed changes

thejaminator changed the title ~~Smoke tests with tiny gpt2~~ Smoke tests with tiny gpt2, fix CCSReporter Mar 24, 2023

norabelrose added 2 commits March 24, 2023 10:41

Fix lanczos_eigsh for small matrices

89fbee5

Always use float32 on CPU

d876f88

norabelrose self-requested a review March 24, 2023 10:45

norabelrose approved these changes Mar 24, 2023

View reviewed changes

norabelrose merged commit ad2a088 into main Mar 24, 2023

norabelrose deleted the smoke-test branch March 24, 2023 11:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Smoke tests with tiny gpt2, fix CCSReporter #149

Smoke tests with tiny gpt2, fix CCSReporter #149

thejaminator commented Mar 23, 2023 •

edited

Loading

thejaminator Mar 24, 2023

norabelrose Mar 24, 2023

norabelrose Mar 24, 2023

thejaminator Mar 24, 2023

norabelrose Mar 24, 2023

norabelrose left a comment

Smoke tests with tiny gpt2, fix CCSReporter #149

Smoke tests with tiny gpt2, fix CCSReporter #149

Conversation

thejaminator commented Mar 23, 2023 • edited Loading

thejaminator Mar 24, 2023

Choose a reason for hiding this comment

norabelrose Mar 24, 2023

Choose a reason for hiding this comment

norabelrose Mar 24, 2023

Choose a reason for hiding this comment

thejaminator Mar 24, 2023

Choose a reason for hiding this comment

norabelrose Mar 24, 2023

Choose a reason for hiding this comment

norabelrose left a comment

Choose a reason for hiding this comment

thejaminator commented Mar 23, 2023 •

edited

Loading