Linear Tic-Tac-Toe

A framework for experimenting with different linear function approximators with gradient-descent Sarsa(lambda) following an epsilon-greedy policy in Tic-Tac-Toe. Each experiment is learning the optimal move vs. a random agent, where afterstates are used instead of Q values. This is an undiscounted (gamma = 1) episodic task.

The methods covered are as follows:

Naive coding
- 3 binary features per cell, one feature for each possible value of (opp, empty, self)
Tile coding: Maximum of [X, O, Empty] in a tile
- Horizontal (3 tiles, 9 features)
- Vertical (3 tiles, 9 features)
- diagonal (10 tiles, 30 features)
- All 3 (16 tiles, 48 features)
- NxM overlapping tiles
  - Each tile is parameterized by (min_x, min_y, width, height)
  - Tile counts explored: 5, 10, 16, 20, 25, 30, 50
  - Randomly generated
Kanerva coding [Each base feature is a board state with 9 ternary features]
- Hamming distance is the number of base features matched exactly
- Distance thresholds explore: 0, 1, 2, 3
- Feature counts explored: 15, 30, 45, 60, 75, 100
- Randomly generated

Created by Wesley Tansey 2/10/2013 Released under the MIT license.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Linear Tic-Tac-Toe

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
all_line_tiles		all_line_tiles
diagonal_tiles		diagonal_tiles
horizontal_tiles		horizontal_tiles
kanerva_100_features_and_0_threshold		kanerva_100_features_and_0_threshold
kanerva_100_features_and_1_threshold		kanerva_100_features_and_1_threshold
kanerva_100_features_and_2_threshold		kanerva_100_features_and_2_threshold
kanerva_100_features_and_3_threshold		kanerva_100_features_and_3_threshold
kanerva_15_features_and_0_threshold		kanerva_15_features_and_0_threshold
kanerva_15_features_and_1_threshold		kanerva_15_features_and_1_threshold
kanerva_15_features_and_2_threshold		kanerva_15_features_and_2_threshold
kanerva_15_features_and_3_threshold		kanerva_15_features_and_3_threshold
kanerva_30_features_and_0_threshold		kanerva_30_features_and_0_threshold
kanerva_30_features_and_1_threshold		kanerva_30_features_and_1_threshold
kanerva_30_features_and_2_threshold		kanerva_30_features_and_2_threshold
kanerva_30_features_and_3_threshold		kanerva_30_features_and_3_threshold
kanerva_45_features_and_0_threshold		kanerva_45_features_and_0_threshold
kanerva_45_features_and_1_threshold		kanerva_45_features_and_1_threshold
kanerva_45_features_and_2_threshold		kanerva_45_features_and_2_threshold
kanerva_45_features_and_3_threshold		kanerva_45_features_and_3_threshold
kanerva_60_features_and_0_threshold		kanerva_60_features_and_0_threshold
kanerva_60_features_and_1_threshold		kanerva_60_features_and_1_threshold
kanerva_60_features_and_2_threshold		kanerva_60_features_and_2_threshold
kanerva_60_features_and_3_threshold		kanerva_60_features_and_3_threshold
kanerva_75_features_and_0_threshold		kanerva_75_features_and_0_threshold
kanerva_75_features_and_1_threshold		kanerva_75_features_and_1_threshold
kanerva_75_features_and_2_threshold		kanerva_75_features_and_2_threshold
kanerva_75_features_and_3_threshold		kanerva_75_features_and_3_threshold
naive		naive
random_10_tiles		random_10_tiles
random_16_tiles		random_16_tiles
random_20_tiles		random_20_tiles
random_25_tiles		random_25_tiles
random_30_tiles		random_30_tiles
random_50_tiles		random_50_tiles
random_5_tiles		random_5_tiles
verticle_tiles		verticle_tiles
.DS_Store		.DS_Store
.gitignore		.gitignore
README.md		README.md
all_line_tiles.png		all_line_tiles.png
all_line_tiles_average.csv		all_line_tiles_average.csv
all_line_tiles_stderr.csv		all_line_tiles_stderr.csv
all_line_tiles_stdev.csv		all_line_tiles_stdev.csv
diagonal_tiles.png		diagonal_tiles.png
diagonal_tiles_average.csv		diagonal_tiles_average.csv
diagonal_tiles_stderr.csv		diagonal_tiles_stderr.csv
diagonal_tiles_stdev.csv		diagonal_tiles_stdev.csv
diagonaltiles.png		diagonaltiles.png
features.py		features.py
horizontal_tiles.png		horizontal_tiles.png
horizontal_tiles_average.csv		horizontal_tiles_average.csv
horizontal_tiles_stderr.csv		horizontal_tiles_stderr.csv
horizontal_tiles_stdev.csv		horizontal_tiles_stdev.csv
horizontaltiles.png		horizontaltiles.png
kanerva15featuresand0threshold.png		kanerva15featuresand0threshold.png
kanerva15featuresand2threshold.png		kanerva15featuresand2threshold.png
kanerva15featuresand3threshold.png		kanerva15featuresand3threshold.png
kanerva30featuresand0threshold.png		kanerva30featuresand0threshold.png
kanerva30featuresand1threshold.png		kanerva30featuresand1threshold.png
kanerva30featuresand2threshold.png		kanerva30featuresand2threshold.png
kanerva30featuresand3threshold.png		kanerva30featuresand3threshold.png
kanerva45featuresand0threshold.png		kanerva45featuresand0threshold.png
kanerva45featuresand1threshold.png		kanerva45featuresand1threshold.png
kanerva45featuresand2threshold.png		kanerva45featuresand2threshold.png
kanerva45featuresand3threshold.png		kanerva45featuresand3threshold.png
kanerva60featuresand0threshold.png		kanerva60featuresand0threshold.png
kanerva60featuresand1threshold.png		kanerva60featuresand1threshold.png
kanerva60featuresand2threshold.png		kanerva60featuresand2threshold.png
kanerva60featuresand3threshold.png		kanerva60featuresand3threshold.png
kanerva75featuresand0threshold.png		kanerva75featuresand0threshold.png
kanerva75featuresand1threshold.png		kanerva75featuresand1threshold.png
kanerva75featuresand2threshold.png		kanerva75featuresand2threshold.png
kanerva75featuresand3threshold.png		kanerva75featuresand3threshold.png
kanerva_100_features_and_0_threshold.png		kanerva_100_features_and_0_threshold.png
kanerva_100_features_and_0_threshold_average.csv		kanerva_100_features_and_0_threshold_average.csv
kanerva_100_features_and_0_threshold_stderr.csv		kanerva_100_features_and_0_threshold_stderr.csv
kanerva_100_features_and_0_threshold_stdev.csv		kanerva_100_features_and_0_threshold_stdev.csv
kanerva_100_features_and_1_threshold.png		kanerva_100_features_and_1_threshold.png
kanerva_100_features_and_1_threshold_average.csv		kanerva_100_features_and_1_threshold_average.csv
kanerva_100_features_and_1_threshold_stderr.csv		kanerva_100_features_and_1_threshold_stderr.csv
kanerva_100_features_and_1_threshold_stdev.csv		kanerva_100_features_and_1_threshold_stdev.csv
kanerva_100_features_and_2_threshold.png		kanerva_100_features_and_2_threshold.png
kanerva_100_features_and_2_threshold_average.csv		kanerva_100_features_and_2_threshold_average.csv
kanerva_100_features_and_2_threshold_stderr.csv		kanerva_100_features_and_2_threshold_stderr.csv
kanerva_100_features_and_2_threshold_stdev.csv		kanerva_100_features_and_2_threshold_stdev.csv
kanerva_100_features_and_3_threshold.png		kanerva_100_features_and_3_threshold.png
kanerva_100_features_and_3_threshold_average.csv		kanerva_100_features_and_3_threshold_average.csv
kanerva_100_features_and_3_threshold_stderr.csv		kanerva_100_features_and_3_threshold_stderr.csv
kanerva_100_features_and_3_threshold_stdev.csv		kanerva_100_features_and_3_threshold_stdev.csv
kanerva_15_features_and_0_threshold.png		kanerva_15_features_and_0_threshold.png
kanerva_15_features_and_0_threshold_average.csv		kanerva_15_features_and_0_threshold_average.csv
kanerva_15_features_and_0_threshold_stderr.csv		kanerva_15_features_and_0_threshold_stderr.csv
kanerva_15_features_and_0_threshold_stdev.csv		kanerva_15_features_and_0_threshold_stdev.csv
kanerva_15_features_and_1_threshold.png		kanerva_15_features_and_1_threshold.png
kanerva_15_features_and_1_threshold_average.csv		kanerva_15_features_and_1_threshold_average.csv
kanerva_15_features_and_1_threshold_stderr.csv		kanerva_15_features_and_1_threshold_stderr.csv
kanerva_15_features_and_1_threshold_stdev.csv		kanerva_15_features_and_1_threshold_stdev.csv
kanerva_15_features_and_2_threshold.png		kanerva_15_features_and_2_threshold.png
kanerva_15_features_and_2_threshold_average.csv		kanerva_15_features_and_2_threshold_average.csv
kanerva_15_features_and_2_threshold_stderr.csv		kanerva_15_features_and_2_threshold_stderr.csv

Folders and files

Latest commit

History

Repository files navigation

Linear Tic-Tac-Toe

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages