[Enhancement] Support for fallbacks when loading cloudpickle items fail #172

Miffyli · 2020-10-01T22:23:09Z

Related issue #171

As seen in #171, cloudpickle/pickle can easily fail when transferring models between Python versions (and in case some other shenanigans). There is currently no way to address this issue as loading of model tries to un-pickle all pickle objects and does not catch errors.

Two suggestions to deal with this, both of which would be nice to have:

Provide custom_objects dictionary (similar to what we had in SB2), which will override any objects read from file. This could be used to even prevent loading the item from the file, in which case all JSON-ed items can be used as before.
Update set_parameters function to only load parameters from the file but not the data objects (which include pickled items). This was included in SB2.

@ManifoldFR If you had other suggestions please feel free to throw them here :)

The text was updated successfully, but these errors were encountered:

ManifoldFR · 2020-10-01T22:53:34Z

From what I saw the problem I encountered is with the learning rate and clip ratio schedulers (for PPO), in the particular case where a constant scheduler is created using constant_fn (I don't know about other cases though) because pickling/unpickling functions is brittle especially when they are locals of a higher-order functon. I think we can look at how PyTorch supports saving state dicts for schedulers. From the source I see they use classes and the __dict__ attribute to save/load state dicts.

In general I think the ideas for other fallbacks are good because it allows the user more flexibility.

ManifoldFR · 2020-10-03T15:16:50Z

I started a branch for schedulers. I think we should not subclass from PyTorch's _LRScheduler straightaway to get the learning rate because they're designed to step with the optimizer.
I think we can take inspiration from their design though:

use a class with state_dict/load_state_dict methods
update the scheduler parameters (namely timesteps) using their step() method, update the param groups from there when applicable (i.e. not for clip ratio)

The third point would require to slightly modify OnPolicyAlgorithm and OffPolicyAlgorithm. We'll still make the class Callable so as to not modify other classes.

Miffyli · 2020-10-04T10:45:04Z

I wouldn't just focus on schedules though, as other objects are stored as pickled objects (no easy alternative). I am not sure having non-pickled versions of all objects is possible, and it would not be easy at the very least. I'd focus on the fallback tools to help reconstructing models if loading fails (part of the reason why currently the saved model contains some description of the pickled object). However if you can find something that could be used to achieve less-brittle serialization I am all ears!

aliamiri1380 · 2021-02-12T14:23:35Z

this can create a file from model, but it's too large:

import dill
with open('nnn.pkl', 'wb') as f:
    dill.dump(model, f)

then you can save and load with dumped model without any error:

with open('Downloads/model.pkl', 'rb') as f:
    model = dill.load(f)
model.save('nnn')
model = PPO.load('nnn')

this sometimes gives cuda error, if you saw that, modify serialization.py as:
https://stackoverflow.com/questions/56369030/runtimeerror-attempting-to-deserialize-object-on-a-cuda-device

araffin · 2021-03-06T14:21:38Z

as a follow-up, I added support for custom objects here: #336
and included that in the rl zoo: DLR-RM/rl-baselines3-zoo#69
(so models trained with python 3.6/3.7 can be loaded with python 3.8+, but not retrained yet)

Miffyli added the enhancement New feature or request label Oct 1, 2020

araffin mentioned this issue Oct 3, 2020

Python 3.8: "an integer is required (got type bytes)" when loading models saved under older Python versions #171

Closed

araffin mentioned this issue Oct 5, 2020

BadZipFile when running PPO2. araffin/rl-baselines-zoo#109

Closed

araffin mentioned this issue Mar 1, 2021

Add custom objects support + bug fix #336

Merged

14 tasks

araffin mentioned this issue Apr 20, 2021

[Bug] model trained couldn't load #403

Closed

araffin closed this as completed Sep 26, 2021

MaximilienLC mentioned this issue Sep 29, 2021

[Question] How to load models on Python3.8? #589

Closed

2 tasks

chenyangkang mentioned this issue Apr 2, 2024

segmentation fault while loading policy HumanCompatibleAI/imitation#845

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Enhancement] Support for fallbacks when loading cloudpickle items fail #172

[Enhancement] Support for fallbacks when loading cloudpickle items fail #172

Miffyli commented Oct 1, 2020

ManifoldFR commented Oct 1, 2020 •

edited

Loading

ManifoldFR commented Oct 3, 2020 •

edited

Loading

Miffyli commented Oct 4, 2020

aliamiri1380 commented Feb 12, 2021 •

edited

Loading

araffin commented Mar 6, 2021

[Enhancement] Support for fallbacks when loading cloudpickle items fail #172

[Enhancement] Support for fallbacks when loading cloudpickle items fail #172

Comments

Miffyli commented Oct 1, 2020

ManifoldFR commented Oct 1, 2020 • edited Loading

ManifoldFR commented Oct 3, 2020 • edited Loading

Miffyli commented Oct 4, 2020

aliamiri1380 commented Feb 12, 2021 • edited Loading

araffin commented Mar 6, 2021

ManifoldFR commented Oct 1, 2020 •

edited

Loading

ManifoldFR commented Oct 3, 2020 •

edited

Loading

aliamiri1380 commented Feb 12, 2021 •

edited

Loading