#

bandit-algorithms

Here are 83 public repositories matching this topic...

rssalessio / py-lower-bound-bai

Python utilities to compute a lower bound of the expected sample complexity to identify the best arm in a bandit model

bandit-algorithms lower-bound best-arm-identification sample-complexity

Updated Sep 8, 2021
Python

fouratifares / RGL

Randomized Greedy Learning Under Full-bandit Feedback

agent machine-learning reinforcement-learning machine-learning-algorithms reinforcement-learning-algorithms machinelearning bandit-learning submodular-optimization submodularity bandit-algorithms

Updated Jan 22, 2024
Python

Rajarshi1001 / CS780

Repository contains codes for the course CS780: Deep Reinforcement Learning

reinforcement-learning-algorithms monte-carlo-simulation ddpg-algorithm bandit-algorithms d3qn dqn-pytorch policy-based-method td3-pytorch gymnasium-environment

Updated Apr 17, 2024
Jupyter Notebook

hughrawlinson / bandit-algorithms

🎩🤠Some Bandit Algorithms in Typescript

learning optimization bandit-algorithms

Updated Aug 27, 2021
TypeScript

jajajang / LowPopArt

2024 ICML Official code

reinforcement-learning-algorithms bandit-algorithms low-rank-matrix-recovery

Updated Oct 24, 2023
C

anselmeamekoe / Graphs_in_ML_MVA

semi-supervised-learning recommender-system bandit-algorithms graph-neural-networks graphs-theory

Updated Jul 11, 2021
Jupyter Notebook

Acemad / alphaNTBEA

An Implementation of the N-Tuple Bandits Evolutionary Algorithm.

java genetic-algorithm artificial-intelligence evolutionary-algorithms bandit-algorithms noisy-optimization game-agent-optimization

Updated Nov 6, 2021
Java

alextanhongpin / bandit-learn

A knowledge base for Bandit Algorithm

bandit-algorithms

Updated Oct 5, 2019

rafaol / no-regret-approximate-inference-via-bo

Code repository for the paper No-Regret Approximate Inference via Bayesian Optimisation, published at UAI 2021

bayesian-inference mcmc gaussian-processes bayesian-optimization approximate-inference bandit-algorithms pytorch-implementation

Updated Oct 21, 2022
Jupyter Notebook

2ailesB / RLD

An implementation of the TME from the Reinforcement Learning course given at Sorbonne University.

machine-learning reinforcement-learning gan dqn vae reinforcement-learning-algorithms ddpg imitation-learning policy-iteration value-iteration sac normalizing-flows curriculum-learning ppo gail bandit-algorithms maddpg

Updated Sep 9, 2022
Jupyter Notebook

brown9804 / Filters_through_electrical_circuits

Creation of filters using electric passive elements

bandwidth bandit-algorithms bandpass-filter

Updated Feb 23, 2020
MATLAB

siavashadpey / MultiArmedBandits

reinforcement-learning active-learning bandit-algorithms exploration-exploitation

Updated Mar 27, 2022
Python

rojagtap / value-function-methods

Implementation of greedy, ε-greedy and softmax methods for n-armed bandit problem

python reinforcement-learning numpy reinforcement-learning-algorithms matplotlib bandit-algorithms n-armed-bandit-problem

Updated Jun 17, 2023
Jupyter Notebook

jialinyi94 / matching-bandit

An implementation of the matching bandit algorithm in http://proceedings.mlr.press/v139/sentenac21a.html.

matching online-learning bandit-algorithms

Updated Oct 18, 2021
Jupyter Notebook

park-jihoo / RL_TIL

Today I Learned - Reinforcement Learning

data-science reinforcement-learning artificial-intelligence today-i-learned bandit-algorithms

Updated Mar 20, 2023
Python

JurajZelman / multi-armed-bandits

Several multi-armed bandit strategies with additional holding option for smoother exploration.

optimization multi-armed-bandits bandit-algorithms

Updated Feb 2, 2024
Jupyter Notebook

rsoaresp / bandits_notebooks

a collection of google colab notebooks with educational stuff about bandits and their variations

jupyter-notebook python3 bandit-algorithms

Updated Mar 26, 2020
Jupyter Notebook

duoan / OpenMultiarmedBandits

A open source multi arm bandit framework for optimize your website quickly. You’ll quickly use the benefits of several simple algorithms—including the epsilon-Greedy, Softmax, and Upper Confidence Bound (UCB) algorithms—by working through this framework written in Java, which you can easily adapt for deployment on your own website.

distribution machine-learning recommendation-system recommendation-engine optimization-algorithms abtest statistical-models website-optimization multiarm-bandit bandit-algorithms openmultiarmedbandits

Updated Feb 17, 2018

Khush-dev / multiplayer-multi-armed-bandits

This repo contains code for multi-armed bandit algorithm testing and local multiplayer competition.

multiplayer bandit-algorithms

Updated Aug 13, 2022
Python

hamzaghojaria / Ads_CTR_ThompsonSampling

Ads Click-through rate using thompson sampling

data-science machine-learning algorithms numpy scikit-learn ads pandas artificial-intelligence thompson-sampling matplotlib ctr-prediction ctr bandit-algorithms

Updated Jan 27, 2020
Python

Improve this page

Add a description, image, and links to the bandit-algorithms topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the bandit-algorithms topic, visit your repo's landing page and select "manage topics."