Skip to content
dh edited this page Jul 21, 2017 · 3 revisions

Todos

ideas

  • reward function learning (inverse RL)
  • bonus for surviving
  • objects detection
  • attention model
  • transfer learning
  • life -> give larger penalty for losing a life
  • semantic segmentation (edge) difference as bonus
Clone this wiki locally