7/27 |
Deep Recurrent Q-Learning for Partially Observable MDPs, M. Hausknecht et al, 2015. |
Minsuk Sung |
[paper] [review] |
8/3 |
Hierarchical Visuomotor Control of Humanoids, J. Merel et al, 2018. |
Seonghyeon Moon |
[paper] [review] |
8/10 |
Learning Dexterous In-Hand Manipulation, M. Andrychowicz et al, 2020. |
Ingyun Ahn |
[paper] [review] |
8/10 |
Deep Reinforcement Learning with a Natural Language Action Space, J. He et al, 2015. |
Jihun Kim |
[paper] [review] |
8/10 |
Program Guided Agent, SH. Sun et al, 2020. |
Haneul Choi |
[paper] [review] |
8/24 |
Trust Region Policy Optimization, J. Schulman et al, 2015. |
Chris Ohk |
[paper] [review] |
8/31 |
Proximal Policy Optimization Algorithms, J. Schulman et al, 2017. |
Chris Ohk |
[paper] [review] |
8/31 |
Implementation Matters in Deep RL: A Case Study on PPO and TRPO, L. Engstrom et al, 2020. |
Yunhyeok Kwak |
[paper] [review] |
9/7 |
Generative Adversarial Imitation Learning, J. Ho et al, 2016. |
Hoesung Ryu |
[paper] [review] |
9/14 |
Efficient Reductions for Imitation Learning, S. Ross et al, 2010. |
Hyecheol (Jerry) Jang |
[paper] [review] |
9/14 |
Variational Discriminator Bottleneck: Improving Imitation Learning, Inverse RL, and GANs by Constraining Information Flow, XB. Peng et al, 2018. |
Do-Hoon Kim |
[paper] [review] |
9/14 |
Grandmaster Level in StarCraft II using Multi-agent Reinforcement Learning, O. Vinyals et al, 2019. |
Donggu Kang |
[paper] [review] |