My content of CS294 Deep Reinforcement Learning course, conduced by Sergey Levine from UC Berkeley.
-
Updated
Jan 15, 2018 - Python
My content of CS294 Deep Reinforcement Learning course, conduced by Sergey Levine from UC Berkeley.
A RL agent that learns to play doom's deadly corridor based on DDQN and PER.
TensorFlow implementation of "Sample-efficient Imitation Learning via Generative Adversarial Nets"
PyTorch implementation of "Sample-efficient Imitation Learning via Generative Adversarial Nets"
PyTorch-implementation-DICE-algorithms
CURL: Contrastive Unsupervised Representation Learning for Sample-Efficient Reinforcement Learning
SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning
RAD: Reinforcement Learning with Augmented Data
Ensemble and Auxiliary Tasks for Data-Efficient Deep Reinforcement Learning
Contains PyTorch Implementation of the following off policy actor critic algorithms
PyTorch implementation of "Sample-efficient Imitation Learning via Generative Adversarial Nets"
This repository contains all of the Reinforcement Learning-related projects I've worked on. The projects are part of the graduate course at the University of Tehran.
Sample Policy Gradient
This is the pytorch implementation of Hindsight Experience Replay (HER) - Experiment on all fetch robotic environments.
ExORL: Exploratory Data for Offline Reinforcement Learning
Temporal Difference Method - Q-Learning Implementation for FrozenLake Grid Problem
PyTorch implementation of our work: "Lipschitzness Is All You Need To Tame Off-policy Generative Adversarial Imitation Learning"
PyTorch implementation of our work: "Lipschitzness Is All You Need To Tame Off-policy Generative Adversarial Imitation Learning"
An Optimistic Approach to the Q-Network Error in Actor-Critic Methods
Add a description, image, and links to the off-policy topic page so that developers can more easily learn about it.
To associate your repository with the off-policy topic, visit your repo's landing page and select "manage topics."