RL-Open-Source-Fest-Application-Exercise

This repo contains my completed pre-screening exercise for the Reinforcement Learning Open Source Fest 2021.

Python / Data Science Exercise

Analysis how non-stationarity affects different Contextual Bandit algorithms

Changing the reward distribution over time and adding varying noise

Comparing the results of different exploration algorithms

My code in based on the Simulating Content Personalization with Contextual Bandits Vowpal Wabbit tutorial https://vowpalwabbit.org/tutorials/cb_simulation.html.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
Contextual_Bandits .ipynb		Contextual_Bandits .ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RL-Open-Source-Fest-Application-Exercise

Python / Data Science Exercise

About

Releases

Packages

Languages

MoniFarsang/RL-Open-Source-Fest-Application-Exercise

Folders and files

Latest commit

History

Repository files navigation

RL-Open-Source-Fest-Application-Exercise

Python / Data Science Exercise

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages