mini-batch training #642

brucefan1983 · 2024-06-13T14:36:51Z

This PR aims to improve the training results when using mini-batches, based on the following tricks:

Increase the default regularization by a factor of $\sqrt{10}$.
Set up an upper limit of the learning rate: $\sigma_{\rm max}=0.01$, which is also the initial learning rate $\sigma_0$.
Prepare the mini-batches according to energy/atom values, maximizing the diversity for each batch.

brucefan1983 added 4 commits June 13, 2024 22:09

try sigma decay

ca9f07a

merge master

3ff01a0

set up upper bound of learning rate

fb3d3b0

simga_min and sigma_max

0409f30

brucefan1983 changed the title ~~try sigma decay~~ sigma_max and sigma_min Jun 15, 2024

better mini-batches

4531e41

brucefan1983 changed the title ~~sigma_max and sigma_min~~ mini-batch training Jun 15, 2024

brucefan1983 marked this pull request as ready for review June 15, 2024 10:28

brucefan1983 added 5 commits June 15, 2024 19:37

good to have lower limit

b26f535

no lower limit

a7c6919

clean up

54ab65c

clean up again

15bcc2c

allow for tunning simga0

cbef518

shdchen approved these changes Jun 15, 2024

View reviewed changes

brucefan1983 merged commit db325ae into master Jun 15, 2024

brucefan1983 deleted the sigma_decay branch June 15, 2024 15:17

Provide feedback