Skip to content

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

Sign in

Sign up

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

RL-ninja / beating-montezuma Public

forked from Alfredvc/paac

Notifications You must be signed in to change notification settings
Fork 3
Star 4

Code
Pull requests
Actions
Projects
Wiki
Security
Insights

Additional navigation options

Code
Pull requests
Actions
Projects
Wiki
Security
Insights

TODOs and Ideas

Jump to bottom Edit New page

dh edited this page Jul 21, 2017 · 3 revisions

Todos

By Aug 10th
- publish results (play video, blogpost)
By Aug 3rd:
By July 27th:
By July 20th: 9:30pm:
- DH: modularize paac.py
- DH: integrate CTS to paac.py
- [?] Sangjin: try MOL bonus
- [?] HG: try feature control bonus
July 5th Wed, 6:10-7:10pm:
- play Montezuma's Revenge yourself
- train any algorithm of your choice on Montezuma's Revenge and share results (testing environment shared here)
- (optional) read NLP-based solution to MZ
- (optional) read why MZ is hard
By July 15th Sat, 2-4pm:
- Hoyeop Kim: Count-Based Exploration 2017
- Sangjin Park: Hierarchical RL
- DH: FeUdal Networks
- HG: [UNREAL](https://arxiv.org/abs/1611.05397]

ideas

reward function learning (inverse RL)
bonus for surviving
objects detection
attention model
transfer learning
life -> give larger penalty for losing a life
semantic segmentation (edge) difference as bonus

Add a custom footer

Toggle table of contents Pages 4

Loading
Home
Loading
Montezuma's Revenge
Loading
Resources
Loading
TODOs and Ideas
- Todos
- ideas

Add a custom sidebar

Clone this wiki locally

Footer

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.