exploration-exploitation

This project focuses on comparing different Reinforcement Learning Algorithms, including monte-carlo, q-learning, lambda q-learning epsilon-greedy variations, etc.

python reinforcement-learning monte-carlo openai-gym q-learning policy rl-agents epsilon-greedy dynamic-programming markov-chains approximation-algorithms ucb1 q-lambda exploration-exploitation thomson-sampling frozen-lake multi-bandit-army

Updated Feb 15, 2022
Python

baturaysaglam / Q-Error-Exploration

Star

An Optimistic Approach to the Q-Network Error in Actor-Critic Methods

deep-reinforcement-learning actor-critic off-policy experience-replay exploration-exploitation

Updated Jun 23, 2022
Python

rom1mouret / exploration

Star

over-parameterization = exploration ?

global-optimization gradient-descent hypernetworks exploration-exploitation over-parameterization

Updated Aug 23, 2020
Python

Giovannibriglia / AgentGroup_CausalRL

Star

Repository for our paper: "Improving Reinforcement Learning Exploration with Causal Models of Core Environment Dynamics". (submitted to ECAI 2024)

reinforcement-learning exploration-exploitation causal-discovery

Updated May 23, 2024
Python

ruqoyyasadiq / deep_RL-multi-arm-bandit-exploration

Star

This is an implementation of the Reinforcement Learning multi-arm-bandit experiment using different exploration techniques.

reinforcement-learning reinforcement-learning-algorithms bandit-algorithms exploration-exploitation exploration-strategy

Updated Oct 4, 2021
Python

siavashadpey / MultiArmedBandits

Star

reinforcement-learning active-learning bandit-algorithms exploration-exploitation

Updated Mar 27, 2022
Python

Improve this page

Add a description, image, and links to the exploration-exploitation topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the exploration-exploitation topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

exploration-exploitation

Here are 20 public repositories matching this topic...

wzhe06 / Reco-papers

opendilab / DI-engine

david-cortes / contextualbandits

YaoYao1995 / MEEE

TianhongDai / self-imitation-learning-pytorch

holarissun / RewardShifting

Amshra267 / Thompson-Greedy-Comparison-for-MultiArmed-Bandits

mbhenaff / neural-e3

kakaobrain / leco

baturaysaglam / DISCOVER

guptav96 / bandit-algorithms

haoyangzheng1996 / ts_ulmc

hmishfaq / LMC-LSVI

hridayns / Research-Project-on-Reinforcement-learning

kochlisGit / Reinforcement-Learning-Algorithms

baturaysaglam / Q-Error-Exploration

rom1mouret / exploration

Giovannibriglia / AgentGroup_CausalRL

ruqoyyasadiq / deep_RL-multi-arm-bandit-exploration

siavashadpey / MultiArmedBandits

Improve this page

Add this topic to your repo