You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Brief Discussion about the Intergroup Discussions planned for around November
Some points brought up (in addition to the slides)
People need prior knowledge to understand some of the papers, prereqs should be listed somewhere
Most people don't know how to read papers, we'll need a good split on the people that can actually read vs. people that can.
RL Group Covered:
Value Function and on-policy/off-policy review.
Covered Q-Learning, SARSA, Double Q-Learning.
Advantages and Disadvantages of on/off policy
Some intuition on how to actually program Q-Learning. (Small introduction on epsilon greedy, mentioned Experience Replay, Hyperparameter Selection, all the gross stuff)