Optimizer serialization #101

coreylowman · 2022-07-20T15:44:55Z

A common feature to use when building training frameworks is to save the optimizer state along with network state.

There's already a way to convert a D::Vec<E> into a rust Vec<E>, which should make it easy to save Gradients object. However a pain point is that gradients are currently keyed off of UniqueId, which may not be persistent across saving/loading depending on how model is initialized.

Options:

Move optimizers away from Gradients and towards using HashMap<String, D::Vec<E>> where String would be the full path to the tensor. This would make it easy to use with TensorCollection, and also require minimal changes to implement this serialization.
The API for serializing optimizers could go through a model, so optimizer gets full paths to tensors on save.

The text was updated successfully, but these errors were encountered:

kurnevsky · 2023-05-09T01:16:52Z

It's also good to have a way to clone in memory the state of optimizer. It's useful if you want to pit 2 networks to see if the trained network performs better than the one before training (which is part of alpha go algorithm).

coreylowman · 2023-05-09T12:55:32Z

@kurnevsky thanks for bringing that up, cloning optimizers is actually really easy with some changes recently. Addressed in the above PR 👍

coreylowman mentioned this issue Aug 17, 2022

Named Tensors for serialization & error reporting. #141

Closed

coreylowman changed the title ~~impl SaveToNpz and LoadFromNpz for optimizers~~ optimizer serialization Mar 8, 2023

coreylowman changed the title ~~optimizer serialization~~ Optimizer serialization Mar 8, 2023

coreylowman added the new feature New feature or request label Mar 8, 2023

coreylowman mentioned this issue May 9, 2023

impl Clone for Adam, SGD, RMSprop #775

Merged

coreylowman mentioned this issue May 16, 2023

Impl SaveToNpz for Optimizers #785

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimizer serialization #101

Optimizer serialization #101

coreylowman commented Jul 20, 2022 •

edited

Loading

kurnevsky commented May 9, 2023

coreylowman commented May 9, 2023

Optimizer serialization #101

Optimizer serialization #101

Comments

coreylowman commented Jul 20, 2022 • edited Loading

kurnevsky commented May 9, 2023

coreylowman commented May 9, 2023

coreylowman commented Jul 20, 2022 •

edited

Loading