Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Safety gridworld needs #92

Open
4 tasks
jjfiv opened this issue Apr 5, 2019 · 3 comments
Open
4 tasks

Safety gridworld needs #92

jjfiv opened this issue Apr 5, 2019 · 3 comments
Labels
gridworld rust This should probably be solved in Rust.

Comments

@jjfiv
Copy link
Collaborator

jjfiv commented Apr 5, 2019

with @kclary

  • "stone" status: once you touch this tile, you can no longer move.
  • gridworld time-limits configurable (episode end when time runs out)
  • single-push button: change another tile's behavior
  • interruption tile: with p=.5 set stone status
@jjfiv
Copy link
Collaborator Author

jjfiv commented Apr 5, 2019

  • interruption tile immediately end episode instead with probability
  • button could lower probability of "stone" status

@jjfiv jjfiv added gridworld rust This should probably be solved in Rust. labels Apr 5, 2019
@jjfiv
Copy link
Collaborator Author

jjfiv commented Apr 5, 2019

  • "sokoban" push boxes
  • supervisor "tiles" - generalized is "negative rewards only occur during periodic time intervals with graphical representation"
  • directional reward: only give reward when entering tile from certain direction
  • whisky/confusion tile: adds random walk behavior (every future step is random with p=.9)
  • colored background tiles
  • monty-hall boxes (not from paper), related but not as complex

@etosch
Copy link
Collaborator

etosch commented Apr 11, 2020

@kclary @jjfiv one of you found a good gridworlds library. Do you think it's worth it to try to integrate?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
gridworld rust This should probably be solved in Rust.
Projects
None yet
Development

No branches or pull requests

2 participants