Add DrivingGridworld
subclass that records each experience as it's played.
#15
Labels
enhancement
New feature or request
Override both
its_showtime
andplay
methods to record a copy of eachRoad
, reward, and discount factor before returning theObservation
, reward, and discount fromsuper
.The text was updated successfully, but these errors were encountered: