Add docs note about saving/loading models with anonymous functions #2263

darsnack · 2023-06-09T15:20:03Z

The new save/load docs promote JLD2.jl which does not support saving/loading anonymous functions reliably. This most commonly occurs for activation functions. The solution is to use Flux.state + Flux.loadmodel! and set the desired anonymous function in the destination of loadmodel!. This avoids needing the serialization library to correctly handle the anonymous function.

This will be problematic when the anonymous function contains data (state) that actually should be restored. A possible solution here is to make the closure an explicit struct. Maybe there are better solutions.

Regardless, new users are unlikely to realize these edge cases. We should expand the saving/loading documentation to explain how to handle these cases with code examples.

The text was updated successfully, but these errors were encountered:

theabhirath · 2023-06-10T05:53:19Z

Given that activation functions have always been handled weirdly, despite the fact that it is a little against the Flux style, maybe it might not be a bad idea to have an Activation layer that just does this explicitly. We've had this discussion before, though (FluxML/NNlib.jl#423 (comment) and FluxML/NNlib.jl#423 (comment)), so I thought it may just be time to do this.

darsnack · 2023-06-10T12:28:16Z

An Activation layer won't help if it wraps an anonymous function. It's a wrapper so it just pushes the issue one node deeper in the tree.

This kind of solution is both cleaner and correct by just naming the function (e.g. myact(x) = ...). If you are closing over some data that needs to be serialized, then define a callable.

ToucheSir · 2023-06-10T14:06:13Z

I wonder if we could create a helper function which searches the model for these closures and warns the user if it finds them?

tom-plaa · 2023-09-20T13:38:06Z

Might my issue at #2339 be related to this? It contains anonymous functions that slice the input arrays, like x->x[begin:inputpoints, 1, :] for example. How would one go around correctly saving a model like this according to your advice?

ToucheSir · 2023-09-20T13:50:46Z

Just as mentioned up top: extract the parameters with Flux.state and only save those. I suspect we'll be scrubbing any examples that use BSON to save the whole model from the docs soon because it's just too error-prone.

tom-plaa · 2023-09-20T13:57:58Z

After checking the docs, this implies that the model definition must be available in the session, right? Is it necessary to create a custom struct and apply the Flux.@functor macro to it before saving (like in the docs)? Must we also repeat these steps before loading it (creating the same struct and applying the macro)? I'm saying this because of this line in the docs:
model = MyModel(); # MyModel definition must be available

ToucheSir · 2023-09-20T14:12:06Z

state strips out any custom container types and gives you a tree of plain old Julia objects (tuples, namedtuples, arrays) which should be easier to save and mostly immune to type-related breakages down the line. It is necessary to have any layer types with parameters/non-trainable state support functor for it to work, but you'll need those declarations anyhow because loadmodel! takes in an already constructed model to stuff the aforementioned tree of plain old Julia objects back into.

tom-plaa · 2023-09-20T15:38:52Z

Thank you, I managed to make it work with the loadmodel! function. I will update my other issue accordingly. It might be clearer to expand this on the documentation to mention that you need to rebuild the custom struct all over again and apply the functor macro when loading as well.

ToucheSir · 2023-09-20T19:57:28Z

The reason we don't mention that in the docs is the same reason PyTorch doesn't mention that you need to define all the layer types for a model before calling model.load_state_dict(...): if you have model already to load into, that means all of the custom layer structs, @functor definitions, etc must already be present! That said, this issue exists in the first place because the saving and loading docs could use some work, so any suggestions (ideally in the form of PRs) is appreciated :)

darsnack added documentation help wanted good first issue labels Jun 9, 2023

ToucheSir mentioned this issue Sep 20, 2023

Segmentation fault when doing a forward pass with a model saved with BSON #2339

Closed

darsnack removed the good first issue label Dec 11, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add docs note about saving/loading models with anonymous functions #2263

Add docs note about saving/loading models with anonymous functions #2263

darsnack commented Jun 9, 2023

theabhirath commented Jun 10, 2023

darsnack commented Jun 10, 2023

ToucheSir commented Jun 10, 2023

tom-plaa commented Sep 20, 2023

ToucheSir commented Sep 20, 2023

tom-plaa commented Sep 20, 2023

ToucheSir commented Sep 20, 2023

tom-plaa commented Sep 20, 2023 •

edited

ToucheSir commented Sep 20, 2023

Add docs note about saving/loading models with anonymous functions #2263

Add docs note about saving/loading models with anonymous functions #2263

Comments

darsnack commented Jun 9, 2023

theabhirath commented Jun 10, 2023

darsnack commented Jun 10, 2023

ToucheSir commented Jun 10, 2023

tom-plaa commented Sep 20, 2023

ToucheSir commented Sep 20, 2023

tom-plaa commented Sep 20, 2023

ToucheSir commented Sep 20, 2023

tom-plaa commented Sep 20, 2023 • edited

ToucheSir commented Sep 20, 2023

tom-plaa commented Sep 20, 2023 •

edited