#

mab

Here are 9 public repositories matching this topic...

alison-carrera / onn

Online Deep Learning: Learning Deep Neural Networks on the Fly / Non-linear Contextual Bandit Algorithm (ONN_THS)

reinforcement-learning neural-network pytorch thompson-sampling reinforcement-learning-algorithms machine-learning-library neural-architecture-search contextual-bandits mab pytorch-implemention multiarmed-bandits pytorch-implementation thompson-algorithm

Updated Dec 11, 2019
Python

alison-carrera / mabalgs

👤 Multi-Armed Bandit Algorithms Library (MAB) 👮

arm algorithm reinforcement-learning simulation monte-carlo rank thompson-sampling reinforcement-learning-algorithms ucb reward multi-armed-bandit montecarlo-simulation contextual-bandits ranking-algorithm mab ranked-mab

Updated Sep 6, 2022
Python

Nth-iteration-labs / streamingbandit

Python application to setup and run streaming (contextual) bandit experiments.

streaming online sequential multi-armed-bandit bandit mab contextual cmab multi-armed

Updated Mar 31, 2023
Python

vmam

MatteoGuadrini / vmam

VLAN Mac-address Authentication Manager

ldap radius python3 ldap-authentication ldap-server vlan mac-address network-architecture ldap-manager radius-server nac mab 80211 pywinrm 8021x ldap3 rfc-3579 ldap-group ieee8021x

Updated Apr 5, 2021
Python

juliennonin / multiplayer-bandits

Multi-Player Bandits Revisited [L. Besson & É. Kaufmann]

reinforcement-learning multi-armed-bandit mab

Updated Jan 21, 2021
Python

duchuyle108 / SDN-EgressNode-Selection

The work in paper "A Reinforcement Learning-Based Solution for Intra-Domain Egress Selection" - Duc-Huy LE, Hai Anh TRAN

Updated Sep 11, 2022
Python

jiseongHAN / reinforcement

My Little Reinforcement Learning

reinforcement-learning pytorch dqn reinforce ddqn mab ppo-pytorch

Updated Jul 13, 2021
Python

DURUII / Replica-AUCB

🐯REPLICA of "Auction-based combinatorial multi-armed bandit mechanisms with strategic arms"

multi-armed-bandit bandits mab cmab bandit-algorithms aution aucb

Updated Dec 17, 2023
Python

JoelJa835 / MAB_Algorithms

Implementation of Multi-Armed Bandit (MAB) algorithms UCB and Epsilon-Greedy. MAB is a class of problems in reinforcement learning where an agent learns to choose actions from a set of arms, each associated with an unknown reward distribution. UCB and Epsilon-Greedy are popular algorithms for solving MAB problems.

reinforcement-learning-algorithms ucb bandits mab e-greedy

Updated Mar 26, 2023
Python

Improve this page

Add a description, image, and links to the mab topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the mab topic, visit your repo's landing page and select "manage topics."