feat: batched sampling for MCMC #1176

manuelgloeckler · 2024-06-18T07:32:38Z

What does this implement/fix? Explain your changes

This pull request aims to implement the sample_batched method for MCMC.

Current problem

BasePotential can either "allow_iid" or not. Hence, each batch dimension will be interpreted as IID samples.
- Replace allow_iid with a mutable attribute (or optional input argument) interpret_as_iid.
- Remove warning for batched x and default to batched evaluation
Refactor all MCMC initialization methods to work with batch dim.
- resample should break
- SIR should break
- proposal should work
Add tests to check if correct samples are in each dimension (currently, only shapes are checked)
- The problem is currently not catched by tests...

The current implementation will let you sample the correct shape, BUT will output the wrong solution. This is because the potential function will broadcast, repeat and finally sum up the first dimension which is incorrect.

… amortizedsample

…rs' into amortizedsample

…posteriors' into amortizedsample" This reverts commit 07084e2, reversing changes made to f16622d.

…from-different-posteriors' into amortizedsample

… reshapes in rejection

This reverts commit 17c5343.

…oved

Co-authored-by: Jan <[email protected]>

codecov · 2024-06-18T07:49:22Z

Codecov Report

Attention: Patch coverage is 80.67227% with 23 lines in your changes missing coverage. Please review.

Project coverage is 75.65%. Comparing base (2398a7a) to head (fd11a72).
Report is 2 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #1176      +/-   ##
==========================================
- Coverage   84.55%   75.65%   -8.91%     
==========================================
  Files          96       96              
  Lines        7603     7685      +82     
==========================================
- Hits         6429     5814     -615     
- Misses       1174     1871     +697

Flag	Coverage Δ
unittests	`75.65% <80.67%> (-8.91%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Files	Coverage Δ
sbi/inference/posteriors/base_posterior.py	`86.04% <100.00%> (ø)`
...inference/potentials/likelihood_based_potential.py	`100.00% <100.00%> (ø)`
sbi/inference/potentials/ratio_based_potential.py	`100.00% <100.00%> (ø)`
sbi/utils/sbiutils.py	`78.35% <ø> (-8.21%)`	⬇️
sbi/utils/user_input_checks.py	`80.64% <100.00%> (-2.87%)`	⬇️
sbi/inference/abc/mcabc.py	`15.87% <0.00%> (-68.26%)`	⬇️
sbi/inference/abc/smcabc.py	`12.44% <0.00%> (-69.96%)`	⬇️
sbi/inference/potentials/base_potential.py	`92.85% <85.71%> (+0.35%)`	⬆️
.../inference/potentials/posterior_based_potential.py	`95.00% <92.85%> (-1.97%)`	⬇️
sbi/inference/posteriors/ensemble_posterior.py	`50.00% <0.00%> (-37.97%)`	⬇️
... and 2 more

... and 19 files with indirect coverage changes

gmoss13 · 2024-06-27T16:03:58Z

I've made some progress now towards this PR, and would like some feedback before I continue.

BasePotential can either "allow_iid" or not.

Given batch_dim_theta!=batch_dim_x, we need to decide how to interpret how to evaluate potential(x,theta). We could return (batch_dim_x,batch_dim_theta) potentials (i.e. every combination), but I am worried this can add a lot of computational overhead, especially when sampling. Instead, the current implementation I suggest that we assume that batch_dim_theta is a multiple of batch_dim_x (i.e. for sampling, we have n chains in theta for each x). In this case we expand the batch dim of x to batch_theta, and match which x goes to which theta. If we are happy with this approach, I'll go ahead and apply this also to the MCMC init_strategy, etc., and make sure this is consistent with other calls.

Remove warning for batched x and default to batched evaluation
Not sure if we want batched evaluation as the default. I think it's easier to do batched evaluation when sample_batched or log_prob_batched is called, and otherwise assume iid (and warn if batch dim >1 as before).

manuelgloeckler · 2024-06-28T12:45:54Z

Great, it looks good. I like that the choice on iid or not can now be made at the set_x method which makes a lot of sense.

I would also opt for your suggested option. The question arises because we squeeze the batch_shape into a single dimension, right? For "PyTorch" broadcasting, one would expect something like (1,batch_x_dim, x_dim) and (batch_theta_dim, betach_x_dim, theta_dim) -> (batch_x_dim, batch_theta_dim), so by squeezing the xs, thetas into 2d one would always get a dimension that is a multiple of batch_x_dim (otherwise it cannot be represented by a fixed size tensor).

For (1,batch_x_dim,x_dim) and (batch_theta_dim, 1, theta_dim), PyTorch broadcasting semantics would compute all combinations. Unfortunately, after squeezing, these distinctions between cases can no longer be fully preserved.

janfb

Great effort, thanks a lot for tacking this 👏

I do have a couple of comments and questions. Happy to discuss in person if needed.

sbi/inference/posteriors/mcmc_posterior.py

janfb · 2024-07-17T16:49:58Z

sbi/inference/posteriors/mcmc_posterior.py

+
+ x_ = x.repeat_interleave(num_chains, dim=0)
+
+ self.potential_fn.set_x(x_, interpret_as_iid=False)


I don't understand why the =False is hardcoded here. but maybe it will become clear below.

Makes sense now. I am just wondering, what if x was set already before using set_default_x, and it was set with iid samples. Maybe we should add a warning then? Effectively, sample_batched then overwrites the default x by default. It should all be clear from the API of course, especially because one has to pass a new x here, but for users it might not be clear that they cannot mix iid and batched evaluation. What do you think?

I like adding a warning here. We can check if self._potential_fn already has a x_is_iid set as True, and then raise a warning in this case, that the user is mixing iid and batch evaluation.

In this case, we should also raise a warning for MCMCPosterior.sample() if x_is_iid was previously set as False.

sbi/inference/posteriors/mcmc_posterior.py

sbi/utils/conditional_density_utils.py

sbi/utils/potentialutils.py

sbi/utils/sbiutils.py

tests/posterior_nn_test.py

gmoss13 · 2024-07-19T15:32:43Z

Great effort, thanks a lot for tacking this 👏

I do have a couple of comments and questions. Happy to discuss in person if needed.

Thanks for the review! I implemented your suggestions.

An additional point - For posterior_based_potential, indeed we should not allow for iid_x, as this is handled by PermutationInvariantNetwork. Instead, we now always treat x batches as not iid. If the user tries to set potential.set_x(x,x_is_iid=True) with a PosteriorBasedPotential, we raise an error stating this. I added a few test cases in embedding_net_test.py::test_embedding_api_with_multiple_trials to test whether batches of x are interpreted correctly when we use a PermutationInvariantNetwork.

janfb

Looks great! I added just a couple of last questions..

janfb · 2024-07-21T16:40:42Z

sbi/inference/posteriors/mcmc_posterior.py

+ if not x_o_is_iid:
+ warn(
+ "The default `x_o` has `x_is_iid = False`, but you are now sampling "
+ "with a batch `x` with `x_is_iid = True`. If you want to sample non-iid"
+ "`x`, please reset `x_is_iid = False` in the potential function.",
+ stacklevel=2,
+ )


when does this happen? Does the user have to explicitly use the posterior.potential_fn object to make it happen?
If not, i.e., if it can happen to user that constructed the posterior using build_posterior (maybe without knowing anything about potentials), then we should add more details to this warning, e.g., "by setting posterior.potential_fn.x_is_iid=False".

Or am I missing something here?

Now that we support both sample_batched and sample to work when a batch of x is passed, we want to warn the user to use the correct one. That is, if x is batched iid, then the user should use sample, and if x is batched and NOT iid, the user should use sample_batched. I think instead of warning the user to change the definition of x in the potential, it's sufficient to just warn them to use the correct one of sample and sample_batched. But this is already covered in sbi.utils.sbiutils, where we already raise a warning that if the batch dimension of x is greater than 1, make sure to use the correct choice of sample or sample_batched to make sure the x is interpreted correctly. So maybe here we remove the warning altogether?

janfb · 2024-07-21T16:41:41Z

sbi/inference/posteriors/mcmc_posterior.py

@@ -321,6 +334,7 @@ def sample(
 thin=thin, # type: ignore
 warmup_steps=warmup_steps, # type: ignore
 vectorized=(method == "slice_np_vectorized"),
+ interchangeable_chains=True,


This is just true for slice_np sampling? Or should we expose it via the kwargs?

interchangeable_chains corresponds to whether the batch of x provided the sampler is iid or not. Right now, we only support sample_batched with slice_np_vectorized. In the future, we might want to support doing this with one of the other mcmc samplers we have available, in which case we can expose interchangeable_chains in the kwargs - although I would suggest this can be a future PR?

sbi/inference/posteriors/mcmc_posterior.py

sbi/inference/potentials/posterior_based_potential.py

janfb · 2024-07-21T16:46:12Z

sbi/inference/potentials/posterior_based_potential.py

- theta = ensure_theta_batched(torch.as_tensor(theta)).to(self.device)
-


why is this not needed anymore? especially the to(self.device).

you're right, it's still needed. I will add this back

tests/embedding_net_test.py

manuelgloeckler and others added 30 commits April 29, 2024 09:04

Base estimator class

17c5343

intermediate commit

705e9df

make autoreload work

07b53cd

amortized_sample works for MCMCPosterior

dd02e22

fixes current bug!

663185b

Added tests

df8899a

batched_rejection_sampling

aa82aab

intermediate commit

00cdade

make autoreload work

cb8e4d8

amortized_sample works for MCMCPosterior

d64557f

Merge branch 'amortizedsample' of https://github.com/sbi-dev/sbi into…

f16622d

… amortizedsample

Merge branch '990-add-sample_batched-and-log_prob_batched-to-posterio…

07084e2

…rs' into amortizedsample

Revert "Merge branch '990-add-sample_batched-and-log_prob_batched-to-…

e54a2fb

…posteriors' into amortizedsample" This reverts commit 07084e2, reversing changes made to f16622d.

Merge branch '1154-density-estimator-batched-sample-mixes-up-samples-…

52d0e7e

…from-different-posteriors' into amortizedsample

sample works, try log_prob_batched

cd808d5

log_prob_batched works

f542224

abstract method implement for other methods

48a1a28

temp fix mcmcposterior

5a37330

meh for general use i.e. in the restriction prior we have to add some…

2b23e42

… reshapes in rejection

... test class

6362051

Revert "Base estimator class"

294609d

This reverts commit 17c5343.

removing previous change

99abbb1

removing some artifacts

ef9e99c

revert wierd change

5eb1007

docs and tests

82127ab

MCMC sample_batched works but not log_prob batched

41617a8

adding some docs

82951db

batch_log_prob for MCMC requires at best changes for potential -> rem…

c5fac1d

…oved

intermediate commit

0d82422

make autoreload work

57cfde3

manuelgloeckler and others added 7 commits June 11, 2024 15:01

restriction priopr requires float as output of accept_reject

2dc6ebd

Adding a few comments

7dfda13

2d sample_shape tests

89b6e8f

Merge branch 'main' into amortizedsample

35dcf40

Apply suggestions from code review

93ca374

Co-authored-by: Jan <[email protected]>

Adding comment about squeeze

86f3531

Formating new mcmc branch

c55e6e4

janfb changed the title ~~Amortized sample for MCMC~~ feat: batched sampling for MCMC Jun 18, 2024

janfb mentioned this pull request Jun 18, 2024

feat: batched sampling and log prob methods. #1153

Merged

gmoss13 and others added 5 commits June 25, 2024 19:31

mcmc sample batched for likelihood estimator

c18958a

batch sampling for snpe,snre

9ff2ce8

Merge branch 'main' into amortized_sample_mcmc

05da5e3

ruff fixes after merge

f759e23

pytest not catching xfail

94732aa

gmoss13 requested a review from janfb June 27, 2024 16:04

mcmc_posterior sample_batched disappeared in merge

69f459e

manuelgloeckler mentioned this pull request Jul 8, 2024

Score-based density estimators for SBI #1015

Open

8 tasks

gmoss13 added 5 commits July 11, 2024 17:58

move mcmc chain shape handling to mcmcposterior away from potentials

ce24632

batched init strategies for mcmc

25f7e2c

Merge branch 'main' into amortized_sample_mcmc

f98bf4d

update raio_based_potential for new RatioEstimator class

4524853

mcmc sample shape out fix and process_x utils

2c7fc0e

janfb reviewed Jul 18, 2024

View reviewed changes

suggestions from jan

fd11a72

gmoss13 requested a review from janfb July 19, 2024 15:51

janfb reviewed Jul 21, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: batched sampling for MCMC #1176

feat: batched sampling for MCMC #1176

manuelgloeckler commented Jun 18, 2024 •

edited by gmoss13

Loading

codecov bot commented Jun 18, 2024 •

edited

Loading

gmoss13 commented Jun 27, 2024

manuelgloeckler commented Jun 28, 2024

janfb left a comment

janfb Jul 17, 2024

janfb Jul 18, 2024

gmoss13 Jul 19, 2024

gmoss13 Jul 19, 2024 •

edited

Loading

gmoss13 commented Jul 19, 2024

janfb left a comment

janfb Jul 21, 2024

gmoss13 Jul 24, 2024 •

edited

Loading

janfb Jul 21, 2024

gmoss13 Jul 24, 2024

janfb Jul 21, 2024

gmoss13 Jul 24, 2024


		x_ = x.repeat_interleave(num_chains, dim=0)

		self.potential_fn.set_x(x_, interpret_as_iid=False)

		theta = ensure_theta_batched(torch.as_tensor(theta)).to(self.device)

feat: batched sampling for MCMC #1176

Are you sure you want to change the base?

feat: batched sampling for MCMC #1176

Conversation

manuelgloeckler commented Jun 18, 2024 • edited by gmoss13 Loading

What does this implement/fix? Explain your changes

Current problem

codecov bot commented Jun 18, 2024 • edited Loading

Codecov Report

gmoss13 commented Jun 27, 2024

manuelgloeckler commented Jun 28, 2024

janfb left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

gmoss13 Jul 19, 2024 • edited Loading

Choose a reason for hiding this comment

gmoss13 commented Jul 19, 2024

janfb left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

gmoss13 Jul 24, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

manuelgloeckler commented Jun 18, 2024 •

edited by gmoss13

Loading

codecov bot commented Jun 18, 2024 •

edited

Loading

gmoss13 Jul 19, 2024 •

edited

Loading

gmoss13 Jul 24, 2024 •

edited

Loading