Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Bugfix] Fix AdamG instability #308

Merged
merged 2 commits into from
Dec 6, 2024
Merged

Conversation

Vectorrent
Copy link
Contributor

@Vectorrent Vectorrent commented Dec 6, 2024

Problem (Why?)

Close #305

AdamG is unstable, and always leads to gradient explosion.

Solution (What/How?)

We had accidentally duplicated the minus-one operation in the bias correction steps. After removing that, the problem was fixed.

Also changed the default hyperparameters, to better align with the research.

Copy link
Owner

@kozistr kozistr left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

wow you nailed it. I have never guessed. thank you!

@kozistr kozistr merged commit d1db791 into kozistr:main Dec 6, 2024
1 of 2 checks passed
@kozistr kozistr mentioned this pull request Dec 6, 2024
1 task
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

AdamG has serious numerical instabilities
2 participants