Skip to content

It GRABs you, it consumes you. What is GRAB doing you ask? It is learning. With 0 domain knowledge.

Notifications You must be signed in to change notification settings

instance01/GRAB0

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Based on bootleg AlphaZero.

This repository implements GRAB0 with two approaches - GBTS (Gradient Bandit Tree Search), which is MCTS but with Gradient Bandits, and SGBS (Stacked Gradient Bandit Search) with a stack of Gradient Bandits.

Below is SGBS in a nutshell:

Experimental results for GRAB0 can be found in results.md. There are a few more that are currently not pushed yet, more information will follow here.

Below is the result of training 33 runs on Cartpole using the GRAB0 approach in combination with SGBS.

Installation

Run ./setup.sh. The src folder will contain a new binary GRAB0. You can adjust experiments in simulations.json.

About

It GRABs you, it consumes you. What is GRAB doing you ask? It is learning. With 0 domain knowledge.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published