[REQUEST] Can the online learning buffers be used to create an MDPDataset? #22

jamartinh · 2021-01-01T02:44:24Z

Describe the solution you'd like
When fit_online, it should be very useful to save all the experiences into an MDPDataset so that we can use it for offline RL to improve the policy.

Has sense to use the Buffers already done for online learning? or should we think in another mechanism, such as openai gym wrappers monitor to make this?

Perhaps not all online learning algos use a Buffer, perhaps a new param to the fit_online so save every transition into history and every save time we also save the corresponding MDPDataset?

takuseno · 2021-01-10T03:42:14Z

@jamartinh Hello, sorry for the late response... I've implemented to_mdp_dataset method to ReplayBuffer. Do you think this is good enough for your requirement?
5936131

# convert online buffer to static dataset by tracing Transition objects in the buffer.
dataset = replay_buffer.to_mdp_dataset()

jamartinh · 2021-01-10T10:35:55Z

Beautiful !

takuseno · 2021-01-10T15:53:40Z

I'm glad to hear that. And, let me close this issue since it seems to be resolved.

jamartinh added the enhancement New feature or request label Jan 1, 2021

takuseno closed this as completed Jan 10, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[REQUEST] Can the online learning buffers be used to create an MDPDataset? #22

[REQUEST] Can the online learning buffers be used to create an MDPDataset? #22

jamartinh commented Jan 1, 2021

takuseno commented Jan 10, 2021

jamartinh commented Jan 10, 2021

takuseno commented Jan 10, 2021

[REQUEST] Can the online learning buffers be used to create an MDPDataset? #22

[REQUEST] Can the online learning buffers be used to create an MDPDataset? #22

Comments

jamartinh commented Jan 1, 2021

takuseno commented Jan 10, 2021

jamartinh commented Jan 10, 2021

takuseno commented Jan 10, 2021