Skip to content

Latest commit

 

History

History
7 lines (6 loc) · 284 Bytes

File metadata and controls

7 lines (6 loc) · 284 Bytes

Meta-Learning-for-Reinforcement-Learning

Reptile algorithm (Meta) for PPO (RL) on 'Reacher' environment.

The Reacher environment: