Skip to content

This repo contains my completed pre-screening exercise for the Reinforcement Learning Open Source Fest 2021.

Notifications You must be signed in to change notification settings

MoniFarsang/RL-Open-Source-Fest-Application-Exercise

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 

Repository files navigation

RL-Open-Source-Fest-Application-Exercise

This repo contains my completed pre-screening exercise for the Reinforcement Learning Open Source Fest 2021.

Python / Data Science Exercise

Analysis how non-stationarity affects different Contextual Bandit algorithms

Changing the reward distribution over time and adding varying noise

Comparing the results of different exploration algorithms

My code in based on the Simulating Content Personalization with Contextual Bandits Vowpal Wabbit tutorial https://vowpalwabbit.org/tutorials/cb_simulation.html.

About

This repo contains my completed pre-screening exercise for the Reinforcement Learning Open Source Fest 2021.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published