forked from Alfredvc/paac
-
Notifications
You must be signed in to change notification settings - Fork 3
TODOs and Ideas
dh edited this page Jul 21, 2017
·
3 revisions
-
July 5th Wed, 6:10-7:10pm:
- play Montezuma's Revenge yourself
- train any algorithm of your choice on Montezuma's Revenge and share results (testing environment shared here)
- (optional) read NLP-based solution to MZ
- (optional) read why MZ is hard
-
By July 15th Sat, 2-4pm:
- Hoyeop Kim: Count-Based Exploration 2017
- Sangjin Park: Hierarchical RL
- DH: FeUdal Networks
- HG: [UNREAL](https://arxiv.org/abs/1611.05397]
-
By July 20th: 9:30pm:
- DH: modularize paac.py
- DH: integrate CTS to paac.py
- Sangjin: try MOL bonus
- HG: try feature control bonus
-
By July 27th:
-
By Aug 3rd:
-
By Aug 10th
- publish results (play video, blogpost)
- reward function learning (inverse RL)
- bonus for surviving
- objects detection
- attention model
- transfer learning
- life -> give larger penalty for losing a life
- semantic segmentation (edge) difference as bonus