Updated 1.dqn for compatability with PyTorch 0.4 and 1.0 #24

joleeson · 2019-02-08T03:09:54Z

Updated for compatibility with latest PyTorch versions. (more thorough than recommendations in Update to run on torch 0.4 #20)

no longer uses the deprecated "Variable" class
use of appropriate dtypes
cpu/gpu agnostic code
use of tensor.item() for conversion of 0-dimensional tensors to ordinary python numbers

Made changes such that the algorithm more closely matches that in Mnih et al. (2015) and other DQN literature:

linear epsilon decay
frame stacking
training frequency is now once every 4 steps in the environment for Atari env
option of using Huber loss instead of RMS loss in def compute_td_loss()

Borrowed monitoring wrapper from OpenAI's Baselines to log progress of training.
Modified the wrappers such that it now accommodates stacked frames frame_stack default to False #9 , and outputs them as a LazyFrames object. Axes of the data is appropriately swapped for PyTorch i.e. (no. of channels)x(breadth)x(height)

Updated for PyTorch 0.4. Made changes such that the algorithm more closely matches that in Mnih et al. (2015) and other DQN literature: - linear epsilon decay - frame stacking - training frequency is now once every 4 steps in the environment for Atari env - option of using Huber loss instead of RMS loss in def compute_td_loss() Also borrowed logging facility from OpenAI's Baselines

-Borrowed monitoring wrapper from OpenAI's Baselines to log progress of training. -Modified the wrappers such that it now accommodates stacked frames, and outputs them as a LazyFrames object. Axes of the data is appropriately swapped for PyTorch i.e. (no. of channels)x(breadth)x(height)

colin-leu · 2019-05-21T13:28:16Z

1.dqn.ipynb

+    "import torch.nn.functional as F\n",
+    "\n",
+    "import os\n",
+    "import logger\n",


which specific module is this? (can't find a module named logger)

joleeson added 4 commits February 8, 2019 10:19

Add files via upload

1b0cedd

Update wrappers.py

04e897d

colin-leu reviewed May 21, 2019

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Updated 1.dqn for compatability with PyTorch 0.4 and 1.0 #24

Updated 1.dqn for compatability with PyTorch 0.4 and 1.0 #24

joleeson commented Feb 8, 2019

colin-leu May 21, 2019 •

edited

Loading

Updated 1.dqn for compatability with PyTorch 0.4 and 1.0 #24

Are you sure you want to change the base?

Updated 1.dqn for compatability with PyTorch 0.4 and 1.0 #24

Conversation

joleeson commented Feb 8, 2019

colin-leu May 21, 2019 • edited Loading

Choose a reason for hiding this comment

colin-leu May 21, 2019 •

edited

Loading