off-policy

This repository contains all of the Reinforcement Learning-related projects I've worked on. The projects are part of the graduate course at the University of Tehran.

monte-carlo epsilon-greedy policy-gradient sarsa dynamic-programming policy-iteration model-based-rl n-armed-bandit-problem on-policy off-policy double-q-learning model-free-rl n-step-bootstrapping n-step-expected-sarsa n-step-tree-backup ucb-algorithm

Updated Oct 2, 2021
HTML

DjAzDeck / SPG

Star

Sample Policy Gradient

learning algorithm control optimization deep policy continuous action reinforcement deterministic actor-critic model-free off-policy

Updated Oct 31, 2021
Python

TianhongDai / hindsight-experience-replay

Star

This is the pytorch implementation of Hindsight Experience Replay (HER) - Experiment on all fetch robotic environments.

reinforcement-learning exploration ddpg her pytorch-implmention off-policy hindsight-experience-replay

Updated Dec 11, 2021
Python

denisyarats / exorl

Star

ExORL: Exploratory Data for Offline Reinforcement Learning

python control reinforcement-learning deep-learning pytorch datasets mujoco model-free off-policy offline-rl unsupevised exporation

Updated Feb 8, 2022
Python

Kalyani011 / RL-Q_Learning_Implementation

Star

Temporal Difference Method - Q-Learning Implementation for FrozenLake Grid Problem

reinforcement-learning q-learning temporal-differencing-learning off-policy value-based

Updated Apr 5, 2022
Jupyter Notebook

lionelblonde / liayn-pytorch

Star

PyTorch implementation of our work: "Lipschitzness Is All You Need To Tame Off-policy Generative Adversarial Imitation Learning"

reinforcement-learning pytorch gan imitation-learning gail off-policy

Updated Apr 19, 2022
Python

lionelblonde / liayn-pytorch-complete-history

Star

PyTorch implementation of our work: "Lipschitzness Is All You Need To Tame Off-policy Generative Adversarial Imitation Learning"

reinforcement-learning pytorch gan imitation-learning gail off-policy

Updated Apr 19, 2022
Python

baturaysaglam / Q-Error-Exploration

Star

An Optimistic Approach to the Q-Network Error in Actor-Critic Methods

deep-reinforcement-learning actor-critic off-policy experience-replay exploration-exploitation

Updated Jun 23, 2022
Python

Improve this page

Add a description, image, and links to the off-policy topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the off-policy topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

off-policy

Here are 40 public repositories matching this topic...

mabirck / CS294-DeepRL

Puneet2000 / Agent-DOoM

lionelblonde / sam-tf-complete-history

lionelblonde / sam-pytorch

SaminYeasar / PyTorch-implementation-DICE-algorithms

MishaLaskin / curl

pokaxpoka / sunrise

MishaLaskin / rad

NUS-LID / RENAULT

SaminYeasar / off_policy_ac

lionelblonde / sam-pytorch-complete-history

Rosefintech / Rosefintech-RosefinAIEngine

narjesno / Reinforcement-Learning

DjAzDeck / SPG

TianhongDai / hindsight-experience-replay

denisyarats / exorl

Kalyani011 / RL-Q_Learning_Implementation

lionelblonde / liayn-pytorch

lionelblonde / liayn-pytorch-complete-history

baturaysaglam / Q-Error-Exploration

Improve this page

Add this topic to your repo