Keras 3: Streamlined Backend #159

LarsKue · 2024-04-15T10:00:56Z

Work In Progress

This PR is still in progress. Please discuss open issues and raise possible concerns below.

Summary

Streamline the definition of Prior and other distributions
When sampling parameters, they are now always passed by name in a dictionary
Add decorators for easy construction and batching of backend-conforming distribution objects

@bf.distribution
def prior():
    return {"theta": np.random.normal(size=2)}

batch_of_samples = prior.sample((32,))
assert batch_of_samples["theta"].shape == (32, 2)

Amortizers are now Keras 3 Models, which allows backend-agnostic training

# set backend to "tensorflow", "jax", or "torch" before importing bayesflow
import os
os.environ["KERAS_BACKEND"] = "torch"

import bayesflow as bf

amortizer = bf.AmortizedPosterior(...)
amortizer.fit()

Move behavior of Configurator into Amortizer
Add default configuration behavior for edge cases, like no summary network
General overhaul of the data flow inside Amortizer
Add a Dataset object that takes care of data loading in multiple worker processes
Move Training Strategy from Trainer into Dataset

# online
dataset = bf.datasets.OnlineDataset(generative_model=..., workers=12, use_multiprocessing=True)
amortizer.fit(dataset, steps_per_epoch=1000)

# offline, in memory
data = {...}  # some dictionary
dataset = OfflineDataset(data, workers=12, use_multiprocessing=True)
amortizer.fit(dataset)

# offline, on disk
import keras

class MyDataset(keras.utils.PyDataset):
    ...  # user-implemented data loading from disk

dataset = MyDataset(workers=12, use_multiprocessing=True)
amortizer.fit(dataset)

In Progress

Allow the user to specify what variables are observed vs. inferred vs. conditioned on by name

# two moons
posterior = Amortizer(
    inference_network,
    observed_variables=["x1", "x2"],
    inferred_variables=["theta1", "theta2"],
    inference_conditions=["r", "alpha"]
)

Named arguments with distribution decorators (@LarsKue)

@bf.Prior(is_conditional=True)
def prior(alpha, beta):
    return {"a": 2 * alpha, "b": alpha + beta}

Update Documentation (@LarsKue, mostly done)
Implement hierarchical amortizers for multi-level models (@daniel-habermann)
Port existing networks to Keras 3 (@Chase-Grajeda)
Add support for non-batchable context and make this the default (@LarsKue)

Postponed

We should probably do these things after merging with dev and before merging with main:

Add support for graph structured priors
Split sampled data by parameter names and return as a dict (@jerrymhuang)
Constrain predicted parameters to user-defined subspaces (@Kucharssim)
Add coverage statistics to README.md with a workflow
Expand test coverage to at least 75%
Update example notebooks
- Add smart defaults (@stefanradev93)

settings = bf.settings.propose(
    training="offline",
    dataset_size=600_000,
    data_shape=(200, 2),
    data_type="time series",
    parameters_shape=(8,)
)

In Discussion

Add a WorkFlow (name wip) object that encapsulates both amortizer and dataset for easier post-processing and model sharing
Rename batchable / non-batchable context

Dropped

Allow the user to pass a dictionary of distributions instead of defining a prior

import tensorflow_probability as tfp
D = tfp.distributions
prior = {"theta": D.Normal([0.0, 0.0], [1.0, 1.0])}

Reason: We should enforce a single way to do things. Also poor support with pure jax.

Return sampled data as a DataFrame

Reason: Too restrictive for data structure.

paul-buerkner · 2024-04-15T10:08:38Z

@LarsKue Thank you so much! This already looks amazing!

Could you perhaps add a simple fully runnable example here for people to get started playing around with it? It is kind of there above, but I think it would make things easier to have one chunk of example code to copy and edit from there.

Everyone, please try out the new interface and tell us what you think!

LarsKue · 2024-04-15T10:12:02Z

@paul-buerkner Yes, I am working on it. I hope I have one ready today.

review-notebook-app · 2024-04-17T11:21:42Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

marvinschmitt · 2024-04-18T07:51:18Z

Great work!! 👏

I'll write down some thoughts on the installation process. Those don't need any changes in the streamlined codebase but are just reminders for our future selves shortly before the release.

In addition to keras (which will replace the current tensorflow dependencies), the user has to install their favorite backend. How do we approach this for users who aren't proficient in Python-ecosystem stuff? A few initial thoughts:
- Extras like pip install bayesflow[torch]. Advantage: Easy interface. Disadvantage: Restricted to pip, no mamba equivalent. I don't like that option.
- Message during the bayesflow installation. That's annoying (if possible at all?) and I wouldn't do that either.
- After installation, upon running bayesflow: Catch any errors that relate to missing backend packages and provide a comprehensive error message with concrete pointers on how to install the necessary backend to fix the issue. Currently my favorite option.
Python >=3.11 is required for typing.Self

…lined-backend

…r outside parallelization this is technically faster, since environment setup can be parallelized as well, but requires to pass the `--parallel auto` flag to tox

Fixed bug that attempted to reshape over None dimension when training offline. Updated build() for both modules. Condensed comments in lstnet.

Added new kwargs retrieval method in lstnet.py and skip_gru.py

…3/BayesFlow into streamlined-backend

Tuned default arguments for both LSTNet and SkipGRU. Condensed arithmetic in SkipGRU.

…3/BayesFlow into streamlined-backend

…lined-backend

Also rename FunctionalSimulator to LambdaSimulator

this makes the implementation more explicit, which is easier to debug but also means a little bit more code clutter all in all I think this is better

…mi-dynamic type checkers

LarsKue added refactoring Some code shall be redesigned unit tests A new set of tests needs to be added. labels Apr 15, 2024

LarsKue self-assigned this Apr 15, 2024

vpratz mentioned this pull request Apr 19, 2024

bayesflow breaks existing tensorflow installation #162

Open

stefanradev93 and others added 22 commits June 12, 2024 04:27

Add concatenate from dict utility

9998fbf

Clean up overly long lines

8396e41

Clean up

047204d

Bugfix concatenation axis

42d20ab

Better semantics for keys_list

60935ac

Fix package reference

2a7f4cf

Add docstring

5152b5c

Slight semantic change

8f195c2

Implement base compute metrics

84ed906

Simplify import structure

7280f40

Simplify import structure

060fd10

Add output type hints

ab983d9

add scipy to dependencies

5435a1e

Merge remote-tracking branch 'origin/streamlined-backend' into stream…

67ef313

…lined-backend

add loss tracking to tensorflow approximator

967f268

clean up data configuration

f90ddab

remove within-process test parallelization for tox to allow for bette…

635c6e4

…r outside parallelization this is technically faster, since environment setup can be parallelized as well, but requires to pass the `--parallel auto` flag to tox

fix imports

73602de

Fixed reshape bug

c70bc43

Fixed bug that attempted to reshape over None dimension when training offline. Updated build() for both modules. Condensed comments in lstnet.

Added new kwargs method

af0e1fb

Added new kwargs retrieval method in lstnet.py and skip_gru.py

Merge branch 'streamlined-backend' of https://github.com/stefanradev9…

b9849c3

…3/BayesFlow into streamlined-backend

Tuned default args

360874e

Tuned default arguments for both LSTNet and SkipGRU. Condensed arithmetic in SkipGRU.

LarsKue and others added 30 commits July 10, 2024 19:11

update keras dependency in pyproject.toml

0fbd131

remove coverage from tox tests

75e6cbb

Propagate kwargs

ad54660

Propagate kwargs to velocity

78074bf

Merge branch 'streamlined-backend' of https://github.com/stefanradev9…

00a123a

…3/BayesFlow into streamlined-backend

Equalize defaults

d8428c8

add compute_output_shape to mlp

4b629e4

Add compute_shape

85073f1

Retire resnet

9ed43e8

Change order of methods

994b984

fix squeeze for sinkhorn log

caf5b0f

Merge remote-tracking branch 'origin/streamlined-backend' into stream…

3f08880

…lined-backend

Remove autodetection of is_batched and is_numpy

5213c6b

Also rename FunctionalSimulator to LambdaSimulator

revert to building with self.call

0919762

improve filter_kwargs

215b12e

potential fix for circular import

3267cd3

make implementation of lambda simulator more procedural

df2dc94

this makes the implementation more explicit, which is easier to debug but also means a little bit more code clutter all in all I think this is better

use keras.ops.cast

a196c7f

remove passing networks by class (not serializable)

6a91227

re-add convert_args and convert_kwargs to utils imports

01493aa

adjust tests to use changes for sequential and lambda simulator

373fffc

clean up tests

970679e

globalize more fixtures

f3e093a

Add minimal docs, fix base class

942044d

Add docs

4968d75

Fix base class and build methods

c8b76e3

fix summary networks

b908c91

allow and change default base distribution to None

ab635f7

simplify Tensor type definition to allow better type inference for se…

a14cfff

…mi-dynamic type checkers

hacky fix for approximator.sample

b4afea0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Keras 3: Streamlined Backend #159

Keras 3: Streamlined Backend #159

LarsKue commented Apr 15, 2024 •

edited

Loading

paul-buerkner commented Apr 15, 2024 •

edited

Loading

LarsKue commented Apr 15, 2024

review-notebook-app bot commented Apr 17, 2024

marvinschmitt commented Apr 18, 2024

Keras 3: Streamlined Backend #159

Are you sure you want to change the base?

Keras 3: Streamlined Backend #159

Conversation

LarsKue commented Apr 15, 2024 • edited Loading

Work In Progress

Summary

In Progress

Postponed

In Discussion

Dropped

paul-buerkner commented Apr 15, 2024 • edited Loading

LarsKue commented Apr 15, 2024

review-notebook-app bot commented Apr 17, 2024

marvinschmitt commented Apr 18, 2024

LarsKue commented Apr 15, 2024 •

edited

Loading

paul-buerkner commented Apr 15, 2024 •

edited

Loading