#

ucb

Here are 24 public repositories matching this topic...

Murtazali05 / Multi-armed-bandit

Multi Armed Bandits implementation using the Jester Dataset

thompson-sampling ucb multi-armed-bandits e-greedy

Updated Apr 5, 2021
Python

salimandre / Monte-Carlo-Tree-Search-for-checkers-game

We compare different policies for the checkers game using reinforcement learning algorithms.

python reinforcement-learning turtle-graphics ucb monte-carlo-tree-search checkers-game upper-confidence-bound mcts-algorithm

Updated Aug 24, 2020
Python

LittleWat / hyper-parameter-optimization-by-GMRF-GPUCB

R.I.T project

python3 ucb gaussian-processes gmrf markov-random-field gp

Updated Jul 29, 2019
Python

SarCode / ML-Code-Tutorials-Udemy

Complete Tutorial Guide with Code for learning ML

natural-language-processing random-forest svm scikit-learn artificial-neural-networks logistic-regression ucb polynomial-regression kmeans-clustering knearest-neighbor-algorithm apriori-algorithm classification-methods svr kernel-svm kernel-pca heirarchical-clustering decison-trees

Updated Apr 21, 2023
Python

JoelJa835 / Least-Loaded-Server

reinforcement-learning-algorithms ucb multiplicative-weights

Updated Apr 26, 2023
Python

JoelJa835 / MAB_Algorithms

Implementation of Multi-Armed Bandit (MAB) algorithms UCB and Epsilon-Greedy. MAB is a class of problems in reinforcement learning where an agent learns to choose actions from a set of arms, each associated with an unknown reward distribution. UCB and Epsilon-Greedy are popular algorithms for solving MAB problems.

reinforcement-learning-algorithms ucb bandits mab e-greedy

Updated Mar 26, 2023
Python

educup / ucb-python-api

Python package for Unity Cloud Build api

python api unity poetry unity3d python3 ucb typer python-package unity-tool unity-cloud-build ucb-api

Updated Sep 12, 2020
Python

paramrathour / Intelligent-and-Learning-Agents

My programs during CS747 (Foundations of Intelligent and Learning Agents) Autumn 2021-22

linear-programming thompson-sampling epsilon-greedy mountain-car sarsa ucb markov-decision-processes multi-armed-bandit policy-iteration value-iteration tile-coding kl-ucb policy-control

Updated Apr 17, 2022
Python

salimandre / Monte-Carlo-Tree-Search

We implemented a Monte Carlo Tree Search (MCTS) from scratch and we successfully applied it to Tic-Tac-Toe game.

reinforcement-learning graphics mcts ucb monte-carlo-tree-search tic-tac-toe-game upper-confidence-bound

Updated Jul 9, 2020
Python

sarthakmittal92 / multi-armed-bandits

Repository for the course project done as part of CS-747 (Foundations of Intelligent & Learning Agents) course at IIT Bombay in Autumn 2022.

python thompson-sampling reinforcement-learning-algorithms ucb multi-armed-bandits bandits kl-ucb

Updated Oct 14, 2022
Python

erdogant / thompson

Thompson is Python package to evaluate the multi-armed bandit problem. In addition to thompson, Upper Confidence Bound (UCB) algorithm, and randomized results are also implemented.

python machine-learning reinforcement-learning genetic-algorithm bayesian ucb multi-armed-bandit thompson thompson-algorithm

Updated Feb 21, 2023
Python

MaxenceGiraud / ucb-nonstationary

On Upper-Confidence Bound Policies for Non-Stationary Bandit Problems

ucb multi-armed-bandits non-stationary-bandit discounted-ucb sliding-ucb

Updated Oct 7, 2022
Python

amait41 / Hex-Game

Python implementation of the Hex game with AI based on MC and MCTS methods. Interactive mode with pygame.

game python hex reinforcement-learning ai ucb

Updated Mar 11, 2023
Python

Suchetaaa / CS747-Assignments

Foundations Of Intelligent Learning Agents (FILA) Assignments

reinforcement-learning monte-carlo linear-programming thompson-sampling ucb bootstrapping multi-armed-bandits bellman-equation temporal-differencing-learning howards-pi sarsa-learning kl-ucb windy-gridworld intelligent-learning-agents

Updated Nov 8, 2019
Python

woctezuma / puissance4

AI for the game "Connect Four". Available on PyPI.

Updated Mar 14, 2024
Python

rudrajit1729 / Machine-Learning-Codes-And-Templates

Codes and templates for ML algorithms created, modified and optimized in Python and R.

feature-selection datascience feature-extraction thompson-sampling dimensionality-reduction ucb ann regression-models nlp-machine-learning kmeans-clustering apriori-algorithm hierarchical-clustering classification-algorithims parameter-tuning regression-algorithms xgboost-model kfold-cross-validation cnn-classification eclat-algorithm

Updated Mar 28, 2020
Python

annieyan / Bandits-using-UCB-algorithm

Thompson Sampling for Bandits using UCB policy

reinforcement-learning thompson-sampling ucb bandits

Updated Jul 29, 2017
Python

csfive / CS61A

🚧

python cs61a sicp cs ucb

Updated Apr 28, 2024
Python

Correlated-AoI-Bandits

ishank-juneja / Correlated-AoI-Bandits

Author's implementation of the paper Correlated Age-of-Information Bandits.

thompson-sampling ucb multi-armed-bandit aoi age-of-information correlated-multi-armed-bandits correlated-arms aoi-regret

Updated Jun 19, 2021
Python

idanmoradarthas / MutiArmedBandit-DeepLearning

Multi-armed bandit algorithm with tensorflow and 11 policies

tensorflow deep-reinforcement-learning python3 ucb multi-armed-bandit epsilon softmax

Updated Dec 27, 2022
Python

Improve this page

Add a description, image, and links to the ucb topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the ucb topic, visit your repo's landing page and select "manage topics."