[RLlib] Implement `TorchPolicy.export_model`. #13989

sven1977 · 2021-02-08T17:11:15Z

Implement TorchPolicy.export_model.

This method is currently missing and users get an NotImplementedError when trying to call it on any TorchPolicy.
This PR provides a solution storing the Policy's model as a TorchScript.
The only remaining problem is for non-RNNs: A fake state in tensor (any tensor is fine) must be provided when calling the TorchScript model. This is due to torch.jit not handling the otherwise empty internal states list that is returned by RLlib's ModelV2s.

Why are these changes needed?

Related issue number

Checks

I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

michaelzhiluo · 2021-02-08T19:37:28Z

rllib/policy/torch_policy.py

+        dummy_inputs = self._lazy_tensor_dict(self._dummy_batch.data)
+        # Provide dummy state inputs if not an RNN (torch cannot jit with
+        # returned empty internal states list).
+        if "state_in_0" not in dummy_inputs:


Why does torch jit require state_in even if the model is not an RNN?

Agree, this is a total hack. However, torch requires the output to be some tensor (or nested struct of tensors), but NOT an empty list :/ That's why we need to fake it here. One more reason to keep thinking about a possible new ModelV3 API :)

torch jit doesn't require state_in, it's the self.model that requires it.

michaelzhiluo · 2021-02-08T19:38:06Z

rllib/policy/torch_policy.py

+                                 (dummy_inputs, state_ins, seq_lens))
+        if not os.path.exists(export_dir):
+            os.makedirs(export_dir)
+        file_name = os.path.join(export_dir, "model.pt")


Also enable user to customize model name^

Yeah, was thinking about this, too. The problem is: I didn't want to change the method's signature, which is:
export_model(self, export_dir). For TF, this is ok b/c a TF model export will produce many files (inside the export_dir).
For Torch, it's just a single file.
But yes, we could add an optional arg (filename=None) to the torch method (and then make the base's signature: export_dir, **kwargs).

michaelzhiluo · 2021-02-17T23:35:19Z

Once conflict is done, we can merge it ^^

sven1977 added 2 commits February 8, 2021 17:01

wip.

8348b06

wip and LINT.

958a674

michaelzhiluo requested changes Feb 8, 2021

View reviewed changes

eadlam mentioned this pull request Feb 11, 2021

How to extract pytorch model weights from rllib? #11903

Closed

michaelzhiluo approved these changes Feb 17, 2021

View reviewed changes

merge and LINT.

10f6d1b

sven1977 merged commit 95ef04b into ray-project:master Feb 22, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RLlib] Implement `TorchPolicy.export_model`. #13989

[RLlib] Implement `TorchPolicy.export_model`. #13989

sven1977 commented Feb 8, 2021 •

edited

Loading

michaelzhiluo Feb 8, 2021

sven1977 Feb 10, 2021 •

edited

Loading

eadlam Feb 11, 2021

michaelzhiluo Feb 8, 2021

sven1977 Feb 10, 2021

michaelzhiluo commented Feb 17, 2021

[RLlib] Implement TorchPolicy.export_model. #13989

[RLlib] Implement TorchPolicy.export_model. #13989

Conversation

sven1977 commented Feb 8, 2021 • edited Loading

Why are these changes needed?

Related issue number

Checks

michaelzhiluo Feb 8, 2021

Choose a reason for hiding this comment

sven1977 Feb 10, 2021 • edited Loading

Choose a reason for hiding this comment

eadlam Feb 11, 2021

Choose a reason for hiding this comment

michaelzhiluo Feb 8, 2021

Choose a reason for hiding this comment

sven1977 Feb 10, 2021

Choose a reason for hiding this comment

michaelzhiluo commented Feb 17, 2021

[RLlib] Implement `TorchPolicy.export_model`. #13989

[RLlib] Implement `TorchPolicy.export_model`. #13989

sven1977 commented Feb 8, 2021 •

edited

Loading

sven1977 Feb 10, 2021 •

edited

Loading