Skip to content

JiwenJ/Awesome-RL

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 

Repository files navigation

Awesome RL

Awesome Static Badge Static Badge Static Badge

Contents


Books


Courses


RL Research Topics

  • Approximate Dynamic Programming and Offline RL

    Approximate Dynamic Programming (ADP) concerns obtaining approximate solutions to large planning problems, often with the help of sampling and function approximation. Many ADP methods can be considered as prototype algorithms for popular value-based RL algorithms used today, especially in the offline setting, so it is important to understand their behaviors and guarantees.

    • Online + Offline (Hybrid)

      • Policy Finetuning: Bridging Sample-Efficient Offline and Online Reinforcement Learning
      • Policy Finetuning in Reinforcement Learning via Design of Experiments using Offline Data
      • Hybrid rl: Using both offline and online data can make rl efficient
  • Multi-agent RL

  • Off-policy Evaluation

    How to estimate the performance of a policy using data collected from a different policy? This question has important implications in safety and real-world applications of RL.


GitHub Repo


Website


Activity


Application


Community


Conference & Journal

Conference: NIPS, ICML, ICLR, AAAI, IJCAI, AAMAS, IROS, etc.

Journal: JMLR, JAIR, JAAMAS, etc.


Research Group

Other outer link

⬆ back to top


Industry Group

⬆ back to top


Misc

⬆ back to top


Discussion

  1. Policy-based vs. Value-based [ZhiHu]
  2. Philosophy of Reinforcement Learning

⬆ back to top


Contributing

This is an active repository and it is time-consuming to maintain the content. So your contributions really matter!

If you find it helpful, please vote for it by adding 👍.

If you have any question about this list, do not hesitate to contact me at 1546631808@qq.com.

⬆ back to top


Reference

⬆ back to top

About

A curated list of RL resources

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published