multiarm-bandit

Star

Here are 18 public repositories matching this topic...

niffler92 / Bandit

Star

Bandit algorithms

simulation thompson-sampling multiarm-bandit contextual-bandit bandit-algorithms linucb

Updated Oct 12, 2017
Python

akshaykhadse / reinforcement-learning

Star

Implementations of basic concepts dealt under the Reinforcement Learning umbrella. This project is collection of assignments in CS747: Foundations of Intelligent and Learning Agents (Autumn 2017) at IIT Bombay

reinforcement-learning linear-programming thompson-sampling epsilon-greedy ucb policy-evaluation mdps multi-armed-bandits policy-iteration randomised-algorithms reinforcement-learning-excercises kl-divergence markovian-epidemic-processes reinforcement-learning-analysis multiarm-bandit ucb1 howards-pi batch-switching randomized-policy-iteration

Updated May 21, 2018
Python

sourcecode369 / ml-algorithms-on-scikit-and-keras

Star

Implementation scripts of Machine Learning algorithms on Scikit-learn and Keras for complete novice..

Updated Jul 22, 2018
Jupyter Notebook

viswanath57 / Bandit-Algorithms

Star

algorithms epsilon-greedy multiarm-bandit softmax-algorithm ucb1

Updated Apr 5, 2021
Jupyter Notebook

Nth-iteration-labs / streamingbandit-ui

Star

Client that handles the administration of StreamingBandit online, or straight from your desktop. Setup and run streaming (contextual) bandit experiments in your browser.

react javascript client machine-learning webapp bandit-learning contextual-bandits multiarm-bandit bandit-algorithm streamingbandit-client

Updated Dec 7, 2022
JavaScript

Xinjie-Lan / Multi-Armed_Bandit

Star

python implementation of e-Greedy, UCB, LinUCB, LinThompson, and offline evaluator

multiarm-bandit linucb

Updated Oct 17, 2019
Jupyter Notebook

MassimoGennaro / DIA_Project_PoliMi

Star

Data Intelligence Application project

reinforcement-learning pricing advertising multiarm-bandit

Updated Aug 2, 2020
Jupyter Notebook

CavenaghiEmanuele / Multi-armed-bandit

Star

Library on Multi-armed bandit

thompson-sampling multiarm-bandit multiarmed-bandits thompson-algorithm

Updated Jan 30, 2023
Python

Shahul-Rahman / MABSearch-Learning-the-learning-rate

Star

MABSearch: The Bandit Way of Learning the Learning Rate - A Harmony Between Reinforcement Learning and Gradient Descent

python machine-learning reinforcement-learning optimization global-optimization gradient-descent learning-rate multi-armed-bandit global-optimization-algorithms metaheuristics multiarm-bandit multiarmed-bandits global-minimum

Updated Oct 28, 2023
Jupyter Notebook

niazangels / bandits

Star

An introduction to multi arm bandits

reinforcement-learning multiarm-bandit bandit-algorithms multiarmed-bandits

Updated Aug 23, 2022
Jupyter Notebook

cormac-rynne / bandits

Star

Variety of Multi-Arm Bandit (MAB) algorithms using classic and advanced strategies, including tools for experiments and simulations in stationary and nonstationary environments

reinforcement-learning thompson-sampling ucb multiarm-bandit exp3-algorithm multi-arm

Updated Feb 10, 2024
Jupyter Notebook

FanchenBao / reinforcement_learning

Star

Code examples for simple reinforcement learning projects

reinforcement-learning actor-critic multiarm-bandit tabular-methods

Updated Oct 5, 2023
Jupyter Notebook

duoan / OpenMultiarmedBandits

Star

A open source multi arm bandit framework for optimize your website quickly. You’ll quickly use the benefits of several simple algorithms—including the epsilon-Greedy, Softmax, and Upper Confidence Bound (UCB) algorithms—by working through this framework written in Java, which you can easily adapt for deployment on your own website.