GitHub - a-llison-lau/CSC413-project: Project for CSC413

CSC413: ADAM-Add: Enhancing ADAM with Adaptive Decay Rates

Abstract

We proposed a modification to the Adam optimizer. In addition to the maximum of past squared gradient adpated by AMSGrad, we added an update to the decay rates $\beta_1$ and $\beta_2$. Following this modifications, we visualized the convergence on several artificial landscapes, and tested our optimizer empirically on logistic regression, multi-layer neural network, and CNNs. Initially, we believe that adding an adaptive decay rate will stabilize the update, which will improve the speed of convergence. However, in practice, as shown in our experiments, the result was not always positive. In addition, in cases where our optimizer converges, although the optimizer was stable, it is sometimes too cautious and thus result in slow convergence. Nonetheless, we do find in some experiments that our additional update to $\beta$ improves Adam in both the rate of convergence and the loss at convergence.

Report

Our report is available at CSC413_Project_Report.pdf

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
CSC413_Project_Report.pdf		CSC413_Project_Report.pdf
README.md		README.md
adam.py		adam.py
convergence_path.ipynb		convergence_path.ipynb
project_cnn.ipynb		project_cnn.ipynb
project_imdb.ipynb		project_imdb.ipynb
project_mnist.ipynb		project_mnist.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CSC413: ADAM-Add: Enhancing ADAM with Adaptive Decay Rates

Abstract

Report

About

Releases

Packages

Languages

a-llison-lau/CSC413-project

Folders and files

Latest commit

History

Repository files navigation

CSC413: ADAM-Add: Enhancing ADAM with Adaptive Decay Rates

Abstract

Report

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages