Early September Improvements + Fixes #95

wz-ml · 2024-09-05T20:05:10Z

Additions:

svwingerden · 2024-09-05T22:31:59Z

This looks great, thanks also for adding in some more comments / docstrings. Could you write a quick check that multi-GPU llc estimation works? doesn't need to converge to any known value, i'd just like to have some indication when someone inevitably breaks this in the future

wz-ml · 2024-09-05T22:50:48Z

Sure, I'll write a test!

…mpling.

svwingerden · 2024-09-06T15:24:08Z

tests/integration/test_sampling.py

+def distance(s1, s2):
+ assert s1.keys() == s2.keys(), f"Expected the same keys in both stats, got {s1.keys()} and {s2.keys()}."
+ assert s1["llc/trace"].shape == s2["llc/trace"].shape, f"Expected the same shape for llc/trace, got {s1['llc/trace'].shape} and {s2['llc/trace'].shape}."
+ return np.mean((s1["llc/trace"] - s2["llc/trace"])**2)


I'd rather you use standard numpy or (preferred for tensors) torch closeness checks, such as torch.isclose or numpy.isclose. Also, this is a check for distance averaged across every step of the chain, why choose for this vs. a per-step distance? (The latter is more sensitive to small changes, so might be preferred for this test IMO, though I don't feel strong about that)

You're right, isclose or allclose would be better, especially in the per-step distance. My rationale for looking at the sampling chain is to check if the chains diverge, but averaging the errors across steps makes that less effective. Let me fix that real quick:

svwingerden · 2024-09-06T15:25:03Z

tests/integration/test_sampling.py

+ sampling_method=SGLD,
+ optimizer_kwargs=dict(lr=4e-4, localization=100.0),
+ num_chains=chains, # How many independent chains to run
+ num_draws=200, # How many samples to draw per chain


I don't think 200 draws is necessary, using 10 draws should still allow the same checks (at lower distances) while being much more computationally efficient

svwingerden · 2024-09-06T15:25:57Z

tests/integration/test_sampling.py

+def gpu_default():
+ return get_stats("cuda", seed = 100)
+
+def test_cpu_consistent(cpu_default):


Please add @pytest.mark.gpu or @pytest.mark.slow where appropriate, this allows our (monorepo) github CI/CD to ignore slow / inapplicable tests

my criteria for slowness are vague, let's use >20s as slow, unless we think a test is crucial, in which case I'd be happy up to 1m. my criteria for GPUness are pretty clear :)

wz-ml · 2024-09-06T20:39:54Z

Thanks for the feedback! Let me know if any more issues with this PR need to be resolved. Do our Github actions runner have more than 1 core?

…a returning a different nbeta depending on the dataloader's batch size.

wz-ml added 3 commits September 3, 2024 17:54

Add docs for estimate_learning_coeff_with_summary

38de865

Merge main into dev.

fe593f8

Add init_loss seeding

ddcf302

wz-ml self-assigned this Sep 5, 2024

wz-ml temporarily deployed to testpypi September 5, 2024 20:05 — with GitHub Actions Inactive

Add multi-GPU support

adb9b68

wz-ml temporarily deployed to testpypi September 5, 2024 21:27 — with GitHub Actions Inactive

Fix sample call

2249ce8

wz-ml temporarily deployed to testpypi September 5, 2024 21:34 — with GitHub Actions Inactive

wz-ml requested a review from svwingerden September 5, 2024 21:55

svwingerden changed the base branch from main to rfc-for-monorepo September 5, 2024 21:56

svwingerden changed the base branch from rfc-for-monorepo to main September 5, 2024 21:56

Add tests for multiprocessing and multigpu support.

40090a8

wz-ml temporarily deployed to testpypi September 5, 2024 23:50 — with GitHub Actions Inactive

Update introduction.ipynb with comments on multi-GPU and multicore sa…

6bcd1b2

…mpling.

wz-ml temporarily deployed to testpypi September 5, 2024 23:56 — with GitHub Actions Inactive

svwingerden reviewed Sep 6, 2024

View reviewed changes

Fix test_sampling.py and annotate (with pytest.mark).

867976d

wz-ml temporarily deployed to testpypi September 6, 2024 20:38 — with GitHub Actions Inactive

Fix batch accumulation discrepancy - turns out it's just default_nbet…

0bfed8b

…a returning a different nbeta depending on the dataloader's batch size.

wz-ml temporarily deployed to testpypi September 6, 2024 22:08 — with GitHub Actions Inactive

Add grad accum consistency test.

7ad8116

wz-ml temporarily deployed to testpypi September 6, 2024 22:36 — with GitHub Actions Inactive

Add mixed precision support

90f8cac

wz-ml deployed to testpypi September 7, 2024 03:02 — with GitHub Actions View deployment

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Early September Improvements + Fixes #95

Early September Improvements + Fixes #95

wz-ml commented Sep 5, 2024 •

edited

Loading

svwingerden commented Sep 5, 2024

wz-ml commented Sep 5, 2024

svwingerden Sep 6, 2024

wz-ml Sep 6, 2024

svwingerden Sep 6, 2024 •

edited

Loading

svwingerden Sep 6, 2024 •

edited

Loading

svwingerden Sep 6, 2024

wz-ml commented Sep 6, 2024

Early September Improvements + Fixes #95

Are you sure you want to change the base?

Early September Improvements + Fixes #95

Conversation

wz-ml commented Sep 5, 2024 • edited Loading

svwingerden commented Sep 5, 2024

wz-ml commented Sep 5, 2024

svwingerden Sep 6, 2024

Choose a reason for hiding this comment

wz-ml Sep 6, 2024

Choose a reason for hiding this comment

svwingerden Sep 6, 2024 • edited Loading

Choose a reason for hiding this comment

svwingerden Sep 6, 2024 • edited Loading

Choose a reason for hiding this comment

svwingerden Sep 6, 2024

Choose a reason for hiding this comment

wz-ml commented Sep 6, 2024

wz-ml commented Sep 5, 2024 •

edited

Loading

svwingerden Sep 6, 2024 •

edited

Loading

svwingerden Sep 6, 2024 •

edited

Loading