Fully support from/to numpy/pytorch for Batch #62

duburcqa · 2020-05-29T08:47:20Z

The current implementation of PPO and other policy algorithm do not support action dict because of this line.

It could be solved by adding to new method to Batch class to convert back the relevant fields to torch.Tensor.

I'm opening a PR to fix that.

The text was updated successfully, but these errors were encountered:

duburcqa · 2020-05-29T09:15:58Z

Also, the use of torch.tensor must be prohibited to convert numpy array to torch tensor since it is less efficient and break memory sharing on cpu.

Trinkle23897 · 2020-05-29T09:39:47Z

Also, the use of torch.tensor must be prohibited to convert numpy array to torch tensor since it is less efficient and break memory sharing on cpu.

For most of the scenarios, the agent's action contains only a few elements that could be considered as negligible.
Currently, the replay buffer is stored as np.ndarray. If you want to prohibit the conversion, the underlying data structure should change to torch.tensor. But I think it is not a good approach since the memory in GPU is far less than RAM, and some basic operations (e.g. compute the returns) are more efficient in the CPU side.

duburcqa · 2020-05-29T09:42:34Z

I'm not saying to use only torch.tensor, but convert them using torch.from_numpy. Sorry for lack of clarity :/

duburcqa · 2020-05-29T12:12:22Z

@Trinkle23897 The PR should be ready right now. I try to do the minimal modifications to fully support Batch from/to numpy/pytorch.

duburcqa mentioned this issue May 29, 2020

Robust conversion from/to numpy/pytorch #63

Merged

duburcqa changed the title ~~Add support of action dict~~ Fully support from/to numpy/pytorch for Batch May 29, 2020

duburcqa closed this as completed May 29, 2020

Trinkle23897 added the enhancement Feature that is not a new algorithm or an algorithm enhancement label Jun 1, 2020

Trinkle23897 added this to TODO in Issue/PR Categories via automation Jun 1, 2020

Trinkle23897 moved this from TODO to Other feature requests in Issue/PR Categories Jun 1, 2020

This was linked to pull requests Jun 29, 2020

Robust conversion from/to numpy/pytorch #63

Merged

Fix 'to_tensor' dtype/device forwarding for Batch over Batch. #68

Merged

Fix to_numpy. #73

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fully support from/to numpy/pytorch for Batch #62

Fully support from/to numpy/pytorch for Batch #62

duburcqa commented May 29, 2020

duburcqa commented May 29, 2020

Trinkle23897 commented May 29, 2020

duburcqa commented May 29, 2020 •

edited

duburcqa commented May 29, 2020 •

edited

Fully support from/to numpy/pytorch for Batch #62

Fully support from/to numpy/pytorch for Batch #62

Comments

duburcqa commented May 29, 2020

duburcqa commented May 29, 2020

Trinkle23897 commented May 29, 2020

duburcqa commented May 29, 2020 • edited

duburcqa commented May 29, 2020 • edited

duburcqa commented May 29, 2020 •

edited

duburcqa commented May 29, 2020 •

edited