#

ucb

Here are 63 public repositories matching this topic...

zjsyhjh / ucb-cs61b

All projects about ucb-61b(2014 spring), http://www.cs.berkeley.edu/~jrs/61b/index.html

data-structures ucb

Updated Apr 1, 2016
Java

zjsyhjh / ucb-cs186

All projects about ucb-cs186(fall 2013), you can get information from the course website(https://sites.google.com/site/cs186fall2013)

Updated May 24, 2016
Java

annieyan / Bandits-using-UCB-algorithm

Thompson Sampling for Bandits using UCB policy

reinforcement-learning thompson-sampling ucb bandits

Updated Jul 29, 2017
Python

Kivy-CN / data8-textbook-zh

📖 [译] UCB DATA8 计算与推断思维

python statistics textbook ucb data8

Updated Mar 4, 2018
HTML

OMerkel / Oware

Oware and Ouril - traditional African Mancala games with computer AI using Monte Carlo Tree Search (MCTS) with UCB (Upper Confidence Bounds) applied to trees (UCT in short)

Updated Mar 30, 2018
HTML

OMerkel / Alquerque

Alquerque - a 2 player abstract strategic perfect information traditional board game with computer AI option.

game board-game mobile ai mobile-app artificial-intelligence mcts mobile-game entertainment ucb uct checkers draughts monte-carlo-tree-search ai-players upper-confidence-bounds perfect-information 2-player-strategy-game deterministic-game

Updated Mar 30, 2018
JavaScript

akshaykhadse / reinforcement-learning

Implementations of basic concepts dealt under the Reinforcement Learning umbrella. This project is collection of assignments in CS747: Foundations of Intelligent and Learning Agents (Autumn 2017) at IIT Bombay

reinforcement-learning linear-programming thompson-sampling epsilon-greedy ucb policy-evaluation mdps multi-armed-bandits policy-iteration randomised-algorithms reinforcement-learning-excercises kl-divergence markovian-epidemic-processes reinforcement-learning-analysis multiarm-bandit ucb1 howards-pi batch-switching randomized-policy-iteration

Updated May 21, 2018
Python

gracedong92 / d3-scatterplot

data analytics mining and ucb 2018

Updated Jun 19, 2018
JavaScript

v-i-s-h / MAB.jl

A Julia Package for providing Multi Armed Bandit Experiments

reinforcement-learning julia julia-language thompson-sampling reinforcement-learning-algorithms multi-arm-bandits ucb julia-package exp julialang mab bandit-experiments

Updated Jul 19, 2018
Julia

LittleWat / hyper-parameter-optimization-by-GMRF-GPUCB

R.I.T project

python3 ucb gaussian-processes gmrf markov-random-field gp

Updated Jul 29, 2019
Python

Suchetaaa / CS747-Assignments

Foundations Of Intelligent Learning Agents (FILA) Assignments

reinforcement-learning monte-carlo linear-programming thompson-sampling ucb bootstrapping multi-armed-bandits bellman-equation temporal-differencing-learning howards-pi sarsa-learning kl-ucb windy-gridworld intelligent-learning-agents

Updated Nov 8, 2019
Python

rudrajit1729 / Machine-Learning-Codes-And-Templates

Codes and templates for ML algorithms created, modified and optimized in Python and R.

feature-selection datascience feature-extraction thompson-sampling dimensionality-reduction ucb ann regression-models nlp-machine-learning kmeans-clustering apriori-algorithm hierarchical-clustering classification-algorithims parameter-tuning regression-algorithms xgboost-model kfold-cross-validation cnn-classification eclat-algorithm

Updated Mar 28, 2020
Python

Ralami1859 / Stochastic-Multi-Armed-Bandit

Implementation of 9 multi-armed bandit algorithm for the stationary stochastic environment

thompson-sampling ucb moss kl-ucb stochastic-bandit-algorithms bayes-ucb

Updated Apr 24, 2020
MATLAB

salimandre / Monte-Carlo-Tree-Search

We implemented a Monte Carlo Tree Search (MCTS) from scratch and we successfully applied it to Tic-Tac-Toe game.

reinforcement-learning graphics mcts ucb monte-carlo-tree-search tic-tac-toe-game upper-confidence-bound

Updated Jul 9, 2020
Python

SanketAgrawal / ReinforcementLearning

Chapter wise implementation & analysis of all the algorithms in RL : An Intoduction by Richard S. Sutton and Andrew G. Barto

reinforcement-learning artificial-intelligence epsilon-greedy python-3 ucb k-armed-bandit gradient-bandit optimistic-inital-values

Updated Jul 18, 2020
Jupyter Notebook

zamburak

mknbv / zamburak

Bandit algorithms in OCaml

trading ocaml ucb bandit-algorithms stochastic-bandit adversarial-bandit exp3

Updated Jul 22, 2020
OCaml

salimandre / Monte-Carlo-Tree-Search-for-checkers-game

We compare different policies for the checkers game using reinforcement learning algorithms.

python reinforcement-learning turtle-graphics ucb monte-carlo-tree-search checkers-game upper-confidence-bound mcts-algorithm

Updated Aug 24, 2020
Python

educup / ucb-python-api

Python package for Unity Cloud Build api

python api unity poetry unity3d python3 ucb typer python-package unity-tool unity-cloud-build ucb-api

Updated Sep 12, 2020
Python

czahie / CS61A

Structure and Interpretation of Computer Programs

python scheme data-structure sqlite ucb

Updated Sep 12, 2020
Python

Bachfischer / COMP90051-StatML-Assignment-2

Source code for Assignment 2 of COMP90051 (Semester 2 2020)

ucb multi-armed-bandit mab

Updated Oct 21, 2020
Jupyter Notebook

Improve this page

Add a description, image, and links to the ucb topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the ucb topic, visit your repo's landing page and select "manage topics."