#

multi-armed-bandit

Here are 116 public repositories matching this topic...

ali92hm / multi-armed-bandit

multi-armed-bandit problem

python algorithm multi-armed-bandit

Updated Mar 17, 2017
Python

arosh / multi-armed-bandit

Demo for NAIST Spring Seminar 2017

react redux redux-devtools multi-armed-bandit

Updated May 14, 2017
JavaScript

google / MAB

R package for Multi-Armed Bandit Simulation Study

multi-armed-bandit

Updated Aug 18, 2017
R

BardOfCodes / multi_arm_bandits

A simple implementation of the multi_arm_bandit problem which can be used in Open AI gym as well.

openai-gym multi-armed-bandit

Updated Aug 28, 2017
Python

levimcclenny / Reinforcement_Learning

Reinforcement Learning

reinforcement-learning jupyter-notebook markov-decision-processes multi-armed-bandit sutton barto barto-sutton

Updated Nov 30, 2017
Python

ardaegeunlu / Non-Stochastic-Bandit-Slate-Algorithms

Implementations of the bandit algorithms with unordered and ordered slates that are described in the paper "Non-Stochastic Bandit Slate Problems", by Kale et al. 2010.

machine-learning reinforcement-learning machine-learning-algorithms multi-armed-bandits multi-armed-bandit machine-learning-models

Updated May 30, 2018
Python

amanraj209 / multi-armed-bandit-problem

In probability theory, the multi-armed bandit problem is a problem in which a fixed limited set of resources must be allocated between competing (alternative) choices in a way that maximizes their expected gain, when each choice's properties are only partially known at the time of allocation, and may become better understood as time passes or by…

python reinforcement-learning jupyter-notebook multi-armed-bandit

Updated Jun 1, 2018
Jupyter Notebook

alextanhongpin / go-bandit-server

A bandit server that implements the multi-armed bandit for running a single experiment

go multi-armed-bandit bandit-server

Updated Jun 6, 2018
Go

michaelw123 / Reinforcement-Learning

Reinforcement Learning in Scala

scala reinforcement-learning artificial-intelligence markov-decision-processes multi-armed-bandit

Updated Jun 18, 2018
Scala

ardaegeunlu / X-armed-Bandits

Implementation of the X-armed Bandits algorithm, as detailed in the paper, "X-armed Bandits", Bubeck et al., 2011.

reinforcement-learning machine-learning-algorithms reinforcement-learning-algorithms multi-armed-bandits multi-armed-bandit

Updated Jul 12, 2018
Python

fanta-mnix / vw-bandit

Python implementation of multi-armed bandit using epsilon-greedy exploration and reward-average sampling estimation

python data-science reinforcement-learning vowpal-wabbit multi-armed-bandit

Updated Sep 25, 2018
Jupyter Notebook

tarric1 / MultiArmedBandit

Il bandito bracciuto: Introduzione leggera ma non troppo all'apprendimento con rinforzo.

reinforcement-learning epsilon-greedy multi-armed-bandit

Updated Jan 20, 2019
Python

rueian / gobandit

A golang library for solving multi armed bandit problem which can optimize your business choice on the fly without A/B testing

golang thompson-sampling enforcement multi-armed-bandit

Updated Apr 15, 2019
Go

mykeels / multi-armed-bandit-problem

An implementation of solvers for the multi-armed-bandit-problem in JavaScript.

thompson-sampling epsilon-greedy multi-armed-bandit ucb1

Updated Apr 25, 2019
JavaScript

HongyuJiang / roads_network_vis

multi-armed-bandit deck-gl road-recommendation

Updated May 6, 2019
JavaScript

HongyuJiang / road_recommendation

reinforcement-learning multi-armed-bandit monte-carlo-search-tree

Updated May 6, 2019
Python

gdmarmerola / advanced-bandit-problems

More about the exploration-exploitation tradeoff with harder bandits

machine-learning multi-armed-bandit bandit-algorithms

Updated May 12, 2019
Jupyter Notebook

ir-uam / EnsembleBandits

Software for the experiments reported in the RecSys 2019 paper "Multi-Armed Recommender System Bandit Ensembles"

ensemble recommender-system multi-armed-bandit

Updated Aug 16, 2019
Java

dancrew32 / ab

A/B Testing Framework for Python (with optional Multi-armed bandit implementation)

python python3 ab-testing no-dependencies deterministic multi-armed-bandit

Updated Aug 18, 2019
Python

SMPyBandits / SMPyBandits-benchmarks

Using the Airspeed Velocity tool (https://asv.readthedocs.io/) to benchmark SMPyBandits (https://github.com/SMPyBandits/SMPyBandits/)

python benchmarking multi-armed-bandit airspeed-velocity smpybandits

Updated Nov 6, 2019
Python

Improve this page

Add a description, image, and links to the multi-armed-bandit topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the multi-armed-bandit topic, visit your repo's landing page and select "manage topics."