Skip to content
This repository has been archived by the owner on Aug 25, 2021. It is now read-only.

facebookresearch/td-delta

master
Switch branches/tags

Name already in use

A tag already exists with the provided branch name. Many Git commands accept both tag and branch names, so creating this branch may cause unexpected behavior. Are you sure you want to create this branch?
Code

Latest commit

 

Git stats

Files

Permalink
Failed to load latest commit information.
Type
Name
Latest commit message
Commit time
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Separating Value Functions Across Time-Scales

Read the full paper: https://arxiv.org/abs/1902.01883

@article{separatingvalues2019,
  title={Separating value functions across time-scales},
  author={Romoff, Joshua and Henderson, Peter and Touati, Ahmed and Olliver, Yann and Brunskill, Emma and Pineau, Joelle},
  journal={arXiv preprint arXiv:1902.01883},
  year={2019}
}

We based our code off of ikostrikov's pytorch-rl repo.

@misc{pytorchrl,
  author = {Kostrikov, Ilya},
  title = {PyTorch Implementations of Reinforcement Learning Algorithms},
  year = {2018},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/ikostrikov/pytorch-a2c-ppo-acktr}},
}

Installation

PyTorch

without cuda:

conda install pytorch=0.4.1 -c pytorch

with cuda:

conda install pytorch=0.4.1 cuda90 -c pytorch

(or cuda92, cuda80, cuda 75. depending on what you have installed)

Baselines for Atari preprocessing

git clone https://github.com/openai/baselines.git cd baselines pip install -e .

Other requirements

pip install -r requirements.txt

Replicating results

To replicate our atari experiments run

python main.py --run-index [0-720]

Visualization

To visualize performance (requires Visdom) first create a visdom server:

python -m visdom.server

Then run:

python visualize.py

License

This repo is CC-BY-NC licensed, as found in the LICENSE file.

About

Separating value functions across time-scales.

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages