Implement "Real" RL Baseline Method like A2C #12

lebrice · 2020-09-27T21:26:49Z

Need to have an actual "baseline" method for RL. It should probably be some kind of on-policy method, given how the data generation currently works (no replay buffers yet)

lebrice created this issue from a note in Sequoia - Initial Release (To do) Sep 27, 2020

lebrice mentioned this issue Oct 30, 2020

Add the Continual RL Branch to the Tree #21

Merged

lebrice linked a pull request Oct 30, 2020 that will close this issue

Add the Continual RL Branch to the Tree #21

Merged

lebrice mentioned this issue Oct 30, 2020

Validate the RL "baseline" Methods #23

Closed

lebrice closed this as completed Dec 9, 2020

Sequoia - Initial Release automation moved this from To do to Done Dec 9, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement "Real" RL Baseline Method like A2C #12

Implement "Real" RL Baseline Method like A2C #12

lebrice commented Sep 27, 2020

Implement "Real" RL Baseline Method like A2C #12

Implement "Real" RL Baseline Method like A2C #12

Comments

lebrice commented Sep 27, 2020