Adagrad Optimizer Implementation #154

Spnetic-5 · 2023-07-28T05:14:20Z

src/nf/nf_optimizers.f90

milancurcic · 2023-08-06T02:53:20Z

Thanks @Spnetic-5. I believe it's correct now. In your original implementation, the L2 regularization was not accounted for in the accumulation of the squared gradients because you applied it later in the param update. The learning rate decay was also doubly accounted for because in each step the learning rate should be amortized relative to the original learning rate, not the one from the previous step. Subtle differences that weren't caught in the tests.

I'll go ahead and merge, please release v0.15.0 when you get a chance.

Adagrad Implementation

ffbf7f7

Spnetic-5 marked this pull request as ready for review July 28, 2023 05:14

Spnetic-5 requested a review from milancurcic July 28, 2023 05:14

milancurcic reviewed Jul 28, 2023

View reviewed changes

src/nf/nf_optimizers.f90 Show resolved Hide resolved

src/nf/nf_optimizers.f90 Outdated Show resolved Hide resolved

src/nf/nf_optimizers.f90 Outdated Show resolved Hide resolved

Resolved comments

edfcfac

Spnetic-5 requested a review from milancurcic August 2, 2023 09:05

Spnetic-5 and others added 5 commits August 4, 2023 10:51

Added test for adagrad

181aed1

Comment

d79782b

Fix L2 penalty and learning rate decay

7207415

Add Adagrad to the list in README

e2af3d4

Bump minor version

97b872f

milancurcic merged commit b119194 into modern-fortran:main Aug 6, 2023
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adagrad Optimizer Implementation #154

Adagrad Optimizer Implementation #154

Spnetic-5 commented Jul 28, 2023 •

edited

Loading

milancurcic commented Aug 6, 2023

Adagrad Optimizer Implementation #154

Adagrad Optimizer Implementation #154

Conversation

Spnetic-5 commented Jul 28, 2023 • edited Loading

milancurcic commented Aug 6, 2023

Spnetic-5 commented Jul 28, 2023 •

edited

Loading