RL-Experiments

RL-Experiments aims to modify and compare deep RL algorithms in single machine easily. For distributed training, I highly recommend ray.

The codes refer to openai/baselines mostly but are implemented by PyTorch. We also highlight the differences between implementation and paper which can be found by searching highlight in codes. Evaluated on 4 atari games, our implementation is 15% faster than baseline with similar performance on average in single machine.

Dependency

Python: 3.5+
PyTorch: 1.0.1+

Evaluation

With the same default parameters in openai/baselines, the FPS and performance with random seed 0 on four environments are illustrated as follows.

Devices:

1 NVIDIA GTX 1070
8 Intel(R) Core(TM) i7-7700K CPU @ 4.20GHz

Pong

	A2C	DQN	PPO	TRPO
Our	1667	277	1515	513
Baselines	1596	246	1089	501

SpaceInvaders

	A2C	DQN	PPO	TRPO
Our	1667	278	1550	501
Baselines	1572	247	1186	440

BeamRider

	A2C	DQN	PPO	TRPO
Our	1667	272	1515	494
Baselines	1543	243	1062	451

Seaquest

	A2C	DQN	PPO	TRPO
Our	1667	275	1515	501
Baselines	1572	236	1203	481

Usage

git clone https://github.com/Officium/RL-Experiments.git
cd RL-Experiments/src
python run.py --env=CartPole-v1 --env_type=classic_control --algorithm=dqn --number_timesteps=1e5

Implemented algorithms

A2C
DQN (with double, C51, dueling, noisy and prioritized replay)
PPO
REINFORCE
TRPO

Name	Name	Last commit message	Last commit date
Latest commit dependabot[bot] Bump opencv-python from 4.1.0.25 to 4.2.0.32 (#14 ) Oct 13, 2021 57cad37 · Oct 13, 2021 History 82 Commits
imgs	imgs	release comparasions on 4 envs	May 23, 2019
src	src	update to the newest version gym	Sep 8, 2019
.gitignore	.gitignore	add trpo demo	Mar 15, 2018
LICENSE	LICENSE	Initial commit	Mar 12, 2018
README.md	README.md	Update README.md	Jan 7, 2021
requirements.txt	requirements.txt	Bump opencv-python from 4.1.0.25 to 4.2.0.32 (#14 )	Oct 13, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RL-Experiments

Dependency

Evaluation