[WIP] SamTov measure memory scaling #476

SamTov · 2022-01-25T09:29:15Z

Test this PR with 2 GB on

Fix #464 and fix the memory safety of MDSuite which is currently not robust enough for small memory machines (< 8 GB or so) on large data sets (>10 GB).

Memory scaling measurement methodology

It seems one can use the pytest-monitoring pytest plugin to measure the memory usage of specific tests. The outputs of these measurements are stored in an sqlite database that can then be queried for information about the specific test that was run. The workflow is outlined below:

Turn off memory management in a calculator
Add several experiments to a project, each with a different size of data in steps [1, 10, 50, 100] MBs. In the case of species wise computations different experiments are not needed however for flux data we cannot enforce computations over different groups and therefore, several experiments will be required.
Run the calculator for different databases and measure memory scaling w.r.t the dominant scaling axis
- In the case of an atom-wise calculators this may be data range or number of atoms depending on the operation.
Fit a function to the relationship and try to evaluate the scaling function exactly.

This requires the person running the experiment to install pytest-monitoring locally. It can be added theoretically to the CI on GitHub and be tested every time but we will then have to make the SQL reading more robust.

Tasks

Add general framework for memory testing using pytest monitor
Add functionality for adding a hdf5 database directly into an MDSuite experiment
Add functionality for turning off memory management in case of a test (all batch sizes to 1 and no minibatching)
Add curve fitting to the test for identify correct relationship.
Exclude the memory scaling test from the CI
Adjust GitHub runners to use 4 GB memory machine to ensure that memory safety is achieved on a reasonably memory limitd device.

Reviewer notes

Any assistance with this is welcomed as it is not the only important PR on MDSuite at the moment, namely, #475 and this still need to be fixed before any release.

…nto SamTov_Measure_Memory_Scaling

PythonFZ · 2022-01-25T12:11:46Z

I would have some questions:

Add functionality for adding a hdf5 database directly into an MDSuite experiment

What do you mean by that?

Exclude the memory scaling test from the CI

I did this for MLSuite with @pytest.mark.gap and then run pytest -m "not gap" and I think it makes sense to go with the same framework for MDSuite, but we could also name the files differently, idk.

Adjust GitHub runners to use 4 GB memory machine to ensure that memory safety is achieved on a reasonably memory limitd device.

The do have 7 GB of memory for linux / windows and I dont't think we can change that.

CI/integration_tests/calculators/test_radial_distribution_function.py

PythonFZ · 2022-01-25T14:20:25Z

mdsuite/utils/config.py

+ memory_scaling_test: bool = False
+ memory_fraction: float = 0.5


Maybe we could expand the config to be config.memory.scaling_test = True instead of config.memory_scaling_test = True with additional dataclasses. This way it could be more structured.

Yeah that's an interesting idea. Managing how configuration things are set in general is a nice thing to discuss as it can be quite involved. I think having data classes for different things like you mention here would be very nice.

SamTov · 2022-01-25T14:45:23Z

I would have some questions:

Add functionality for adding a hdf5 database directly into an MDSuite experiment

What do you mean by that?

So what I want to do, and what I have started in the test module, is to generate several hdf5 databases with data of different exact sizes, e.g 1MB, 10MB and so on. In this case, rather than generate a numpy array, save it to a readable file, and then read it with MDSuite, I want to make hdf5 database, make an experiment, and then add it as data to that experiment.

In practice, this would be the equivalent of saving your simulation data into the H5MD database format and then using it in MDSuite.

Exclude the memory scaling test from the CI

I did this for MLSuite with @pytest.mark.gap and then run pytest -m "not gap" and I think it makes sense to go with the same framework for MDSuite, but we could also name the files differently, idk.
So the only factor here is whether we want to have it run each time? I think the it is unnecessary to have the scaling stuff run every time but am open to differing opinions.

Adjust GitHub runners to use 4 GB memory machine to ensure that memory safety is achieved on a reasonably memory limitd device.

The do have 7 GB of memory for linux / windows and I dont't think we can change that.

I thought somewhere you can set a memory limit but maybe I am mistaken. In that case, we can set up local runners and just keep it as a test for say, releases that we perform locally.

Please work

# Conflicts: # CI/integration_tests/calculators/test_einstein_helfand_ionic_conductivity.py

github-actions · 2022-02-02T07:49:33Z

a262669

Memory Scaling

Raw data

Activate in workflow file

…nto SamTov_Measure_Memory_Scaling

PythonFZ and others added 7 commits January 21, 2022 09:12

remove GPU keyword

7909e35

Add GPU check and include it in the memory_manager

99ff3d4

Merge branch 'main' into gpu_batching

5abee31

Merge branch 'main' into gpu_batching

d1add32

Merge branch 'main' into gpu_batching

ab1a4ce

start memory measurement modules.

2d85c89

Intiial commit to scaling function updates.

4cd6d00

SamTov changed the title ~~Sam tov measure memory scaling~~ SamTov measure memory scaling Jan 25, 2022

SamTov changed the title ~~SamTov measure memory scaling~~ [WIP] SamTov measure memory scaling Jan 25, 2022

SamTov added 4 commits January 25, 2022 10:31

Merge branch 'main' into SamTov_Measure_Memory_Scaling

eb6283b

run black and isort

bee4387

Merge remote-tracking branch 'origin/SamTov_Measure_Memory_Scaling' i…

00f5cd2

…nto SamTov_Measure_Memory_Scaling

remove file call in CI.

83d24f6

SamTov requested review from PythonFZ, christophlohrmann and Fratorhe January 25, 2022 09:40

SamTov added 2 commits January 25, 2022 10:41

Fix additional flake8 import complaint

81c867c

add config memory testing and include an override for batching.

505acff

PythonFZ reviewed Jan 25, 2022

View reviewed changes

SamTov added 2 commits January 25, 2022 15:54

remove config argument.

e320755

resolve flake8 complaint.

d898394

SamTov mentioned this pull request Jan 27, 2022

Update molecule mapping #475

Merged

4 tasks

PythonFZ added 6 commits February 1, 2022 11:41

CI profiling

c974a78

CI profiling

25d0fe3

update sqlite

2b97375

typo

7f51983

patch ubuntu version

4b23a73

try conda for newer sqlite version

4729e28

PythonFZ and others added 18 commits February 1, 2022 13:46

add a plot

883474d

plot everything

8d77a09

plot everything

0601c84

run ADF memory test

aedc169

run ADF memory test

b9ca27f

Update test_memory.py

492370f

Update test_memory.py

719de34

reduce size even further

3542778

Please work

Update test_memory.py

1ebdc2f

Update test_memory.py

e12368c

remove print

869c047

Merge branch 'main' into SamTov_Measure_Memory_Scaling

4bf249c

# Conflicts: # CI/integration_tests/calculators/test_einstein_helfand_ionic_conductivity.py

clean up a bit

1713f8a

fix black / flake8

7898a6b

add plot function

8743d23

add update to not spam to PR

327e538

only run on push

427fd13

add package

f49303e

PythonFZ added 10 commits February 2, 2022 09:10

small code cleanup + update

3b8ad22

Update lint.yaml

8f14b22

add diffusion + fix plots

963580e

Merge remote-tracking branch 'origin/SamTov_Measure_Memory_Scaling' i…

de1ef0d

…nto SamTov_Measure_Memory_Scaling

add continue-on-error to still gather the plot at the end.

b5cf14c

add GK diffusion

d66ca6e

deselect memory by default

91e06b9

enable memory management

1eddad1

add einstein data range test

a262669

run with / without fixture

a756c6b

SamTov mentioned this pull request Mar 16, 2022

RDF issue that occasionally happens in the CI #504

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] SamTov measure memory scaling #476

[WIP] SamTov measure memory scaling #476

SamTov commented Jan 25, 2022 •

edited by PythonFZ

Loading

PythonFZ commented Jan 25, 2022

PythonFZ Jan 25, 2022

SamTov Jan 25, 2022

SamTov commented Jan 25, 2022

github-actions bot commented Feb 2, 2022 •

edited

Loading

		memory_scaling_test: bool = False
		memory_fraction: float = 0.5

[WIP] SamTov measure memory scaling #476

Are you sure you want to change the base?

[WIP] SamTov measure memory scaling #476

Conversation

SamTov commented Jan 25, 2022 • edited by PythonFZ Loading

Memory scaling measurement methodology

Tasks

Reviewer notes

PythonFZ commented Jan 25, 2022

PythonFZ Jan 25, 2022

Choose a reason for hiding this comment

SamTov Jan 25, 2022

Choose a reason for hiding this comment

SamTov commented Jan 25, 2022

github-actions bot commented Feb 2, 2022 • edited Loading

Memory Scaling

Raw data

SamTov commented Jan 25, 2022 •

edited by PythonFZ

Loading

github-actions bot commented Feb 2, 2022 •

edited

Loading