zero2SGD

Stochastic gradient descent (SGD) in contrast performs a parameter update for each training example x(i) and label y(i):θ=θ−η⋅∇θJ(θ;x(i);y(i)).

Batch gradient descent performs redundant computations for large datasets, as it recomputes gradients for similar examples before each parameter update. SGD does away with this redundancy by performing one update at a time. It is therefore usually much faster and can also be used to learn online. SGD performs frequent updates with a high variance that cause the objective function to fluctuate heavily as in Image 1.

While batch gradient descent converges to the minimum of the basin the parameters are placed in, SGD's fluctuation, on the one hand, enables it to jump to new and potentially better local minima. On the other hand, this ultimately complicates convergence to the exact minimum, as SGD will keep overshooting. However, it has been shown that when we slowly decrease the learning rate, SGD shows the same convergence behaviour as batch gradient descent, almost certainly converging to a local or the global minimum for non-convex and convex optimization respectively. Its code fragment simply adds a loop over the training examples and evaluates the gradient w.r.t. each example. Note that we shuffle the training data at every epoch as explained in this section.

for i in range(nb_epochs):
  np.random.shuffle(data)
  for example in data:
    params_grad = evaluate_gradient(loss_function, example, params)
    params = params - learning_rate * params_grad

LICENSE

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 70 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
activation.py		activation.py
main.py		main.py
model.py		model.py
ops.py		ops.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

init.py

init.py

activation.py

activation.py

main.py

main.py

model.py

model.py

ops.py

ops.py

Repository files navigation

zero2SGD

LICENSE

About

Releases

Packages

Languages

License

Lornatang/zero2SGD

Folders and files

Latest commit

History

Repository files navigation

zero2SGD

LICENSE

About

Resources

License

Stars

Watchers

Forks

Languages