All projects about ucb-61b(2014 spring), http://www.cs.berkeley.edu/~jrs/61b/index.html
-
Updated
Apr 1, 2016 - Java
All projects about ucb-61b(2014 spring), http://www.cs.berkeley.edu/~jrs/61b/index.html
All projects about ucb-cs186(fall 2013), you can get information from the course website(https://sites.google.com/site/cs186fall2013)
Thompson Sampling for Bandits using UCB policy
Oware and Ouril - traditional African Mancala games with computer AI using Monte Carlo Tree Search (MCTS) with UCB (Upper Confidence Bounds) applied to trees (UCT in short)
Alquerque - a 2 player abstract strategic perfect information traditional board game with computer AI option.
Implementations of basic concepts dealt under the Reinforcement Learning umbrella. This project is collection of assignments in CS747: Foundations of Intelligent and Learning Agents (Autumn 2017) at IIT Bombay
A Julia Package for providing Multi Armed Bandit Experiments
R.I.T project
Foundations Of Intelligent Learning Agents (FILA) Assignments
Codes and templates for ML algorithms created, modified and optimized in Python and R.
Implementation of 9 multi-armed bandit algorithm for the stationary stochastic environment
We implemented a Monte Carlo Tree Search (MCTS) from scratch and we successfully applied it to Tic-Tac-Toe game.
Chapter wise implementation & analysis of all the algorithms in RL : An Intoduction by Richard S. Sutton and Andrew G. Barto
Bandit algorithms in OCaml
We compare different policies for the checkers game using reinforcement learning algorithms.
Python package for Unity Cloud Build api
Structure and Interpretation of Computer Programs
Source code for Assignment 2 of COMP90051 (Semester 2 2020)
Add a description, image, and links to the ucb topic page so that developers can more easily learn about it.
To associate your repository with the ucb topic, visit your repo's landing page and select "manage topics."