## Complexity and Optimal Performance by Maze

The measures come from Zatuchna and Bagnall paper and reported in each section that presents the results we get :

> Zhanna V. Zatuchna and Anthony Bagnall. 2009. Learning Mazes with Aliasing States: An LCS Algorithm with Associative Perception. Adaptive Behavior - Animals, Animats, Software Agents, Robots, Adaptive Systems 17, 1 (February 2009), 28-57. DOI=http://dx.doi.org/10.1177/1059712308099230

$\phi$ is about the average distance to reward in maze. Its calculation depends mainly on the type of maze. Here is reported $\phi$' if the maze is aliased, the original $\phi$ otherwise.

$\psi$ is about the complexity of the maze that depends on the average distance to reward and on the average steps taken by trained Q-learning.

The question marks (**?**) highlight some discrepancies present in the original paper.

The asterisks (**\***) indicate pieces of information were not provided.

For further details, please see the original paper.

## Description of the parameters
 
The parameters used are:
 
- no use of action planning
- use of subsumption in the anticipatory learning process
- no use genetic algorithms
- $\gamma$ = 0.95
- $\theta_i$ = 0.1
- $\theta_r$ = 0.9
- $u_{max}$ = 8 (length of the condition part of  classifiers)
- $\theta_{exp}$ = 20
- $\beta$ = 0.05 if all learning modules are used, otherwise $\beta$ = 0.00
- $\epsilon$ = 0.8 in exploration, otherwsise $\epsilon$ = 0.0
- $ACS2-NO-RL$ refers to ACS2 without the reinforcement module in the exploitation phase.
- $ACS2-RL$ refers to ACS2 using the reinforcement module in the exploitation phase.
- $BACS-NO-RL-1$ refers to BACS without the reinforcement module in the exploitation phase with $bs_{max} = 1$.
- $BACS-RL-1$ refers to BACS using the reinforcement module in the exploitation phase with $bs_{max} = 1$ and $\beta = 0.05$.
- $BACS-NO-RL-2$ refers to BACS without the reinforcement module in the exploitation phase with $bs_{max} = 2$.
- $BACS-RL-2$ refers to BACS using the reinforcement module in the exploitation phase with $bs_{max} = 2$ and $\beta = 0.05$.

We do not use the generalization mechanism provided by the genetic algorithms in these experiments considering it needs a complete update and a new protocol of experimentations. This is a work in progress.

The mazes in the following tables have been sorted by aliasing type and then by complexity top-down. 

To compute all averages and results, we have repeated each experiment thirty times. One experiment consist of having some trials in which the agent has at most 100 steps to find the exit. The agent has 1000 trials in exploration mode to create its internal representation of the environment and then, 500 in exploitation mode to find as fast as possible the exit.

## Type III Mazes

|       |MazeE2      |Woods101demi|Maze10      |Woods102    |Woods100    |Woods101    |MazeE1       |
|-------|:----------:|:----------:|:----------:|:----------:|:----------:|:----------:|:-----------:|
|$\phi$'|2.33        |3.1         |5.17        |3.31        |2.33        |2.9         |3.07         |
|$\psi$ |251.2       |251 **?**   |171         |167         |166         |149         |167 **?**    |

| | Maze | Exploration Avg | Exploration Std | Exploration Avg | Exploration Std | Best Exploration | Worst Exploration | Successful tries | Knowledge Avg | Knowledge Std | Population Avg | Population Std | Reliable Avg | Reliable Std |
|----|------------|:-----:|:-----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|
|ACS2-NO-RL|MazeE2-v0|28.14|1.02|59.79|3.44|52.42|71.86|0|35.92|1.97|307.43|19.18|104.33|5.32|
|ACS2-RL|MazeE2-v0|27.95|1.10|30.68|3.82|22.54|38.83|0|36.03|1.90|310.70|16.71|104.70|6.11|
|BACS-NO-RL-1|MazeE2-v0|30.46|4.27|18.83|15.82|3.73|59.30|0|39.07|4.67|1000.33|55.35|86.13|11.78|
|BACS-RL-1|MazeE2-v0|32.82|4.52|5.66|1.03|3.68|8.78|0|40.21|3.42|977.17|79.62|86.70|7.78|
|BACS-NO-RL-2|MazeE2-v0|35.67|5.36|23.42|14.51|3.79|51.14|0|29.70|3.10|1455.87|95.43|60.73|5.22|
|BACS-RL-2|MazeE2-v0|35.80|4.87|5.76|0.91|4.39|8.27|0|28.48|4.02|1468.07|91.32|58.23|6.75|

| | Maze | Exploration Avg | Exploration Std | Exploration Avg | Exploration Std | Best Exploration | Worst Exploration | Successful tries | Knowledge Avg | Knowledge Std | Population Avg | Population Std | Reliable Avg | Reliable Std |
|----|------------|:-----:|:-----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|
|ACS2-NO-RL|Woods101demi-v0|36.26|1.95|68.03|3.69|62.14|75.57|0|52.38|0.00|59.70|4.82|24.83|0.93|
|ACS2-RL|Woods101demi-v0|36.71|1.60|40.21|2.09|36.70|43.98|0|52.38|0.00|59.70|5.46|24.53|0.81|
|BACS-NO-RL-1|Woods101demi-v0|32.33|3.74|18.66|4.69|11.33|25.65|0|90.48|0.00|161.30|9.38|79.63|3.22|
|BACS-RL-1|Woods101demi-v0|32.35|4.35|4.72|0.97|3.16|6.82|0|90.48|0.00|168.37|11.31|79.83|3.38|
|BACS-NO-RL-2|Woods101demi-v0|27.54|3.48|22.36|1.49|19.36|24.71|0|90.40|0.43|208.70|6.90|78.03|2.48|
|BACS-RL-2|Woods101demi-v0|28.32|4.44|10.06|2.01|6.15|13.81|0|90.48|0.00|206.23|12.64|77.47|3.04|

| | Maze | Exploration Avg | Exploration Std | Exploration Avg | Exploration Std | Best Exploration | Worst Exploration | Successful tries | Knowledge Avg | Knowledge Std | Population Avg | Population Std | Reliable Avg | Reliable Std |
|----|------------|:-----:|:-----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|
|ACS2-NO-RL|Maze10-v0|51.69|1.10|75.96|7.04|53.54|94.06|0|64.89|1.72|53.27|4.47|28.87|1.36|
|ACS2-RL|Maze10-v0|52.12|1.23|63.27|4.80|55.85|71.85|0|65.25|2.15|53.87|4.05|28.97|1.38|
|BACS-NO-RL-1|Maze10-v0|55.62|4.47|72.05|16.75|48.32|94.60|0|90.07|1.14|149.07|8.66|53.17|1.67|
|BACS-RL-1|Maze10-v0|52.87|6.20|22.96|15.56|7.22|54.83|0|90.21|1.30|148.47|7.83|53.30|1.70|
|BACS-NO-RL-2|Maze10-v0|44.54|5.89|41.98|24.98|5.68|96.83|0|93.62|1.23|200.63|11.67|56.00|5.99|
|BACS-RL-2|Maze10-v0|44.83|6.80|8.06|1.76|5.82|12.56|0|93.48|1.55|200.47|14.31|56.23|3.31|

| | Maze | Exploration Avg | Exploration Std | Exploration Avg | Exploration Std | Best Exploration | Worst Exploration | Successful tries | Knowledge Avg | Knowledge Std | Population Avg | Population Std | Reliable Avg | Reliable Std |
|----|------------|:-----:|:-----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|
|ACS2-NO-RL|Woods102-v0|39.19|1.23|59.16|9.42|43.92|68.56|0|51.22|0.00|74.27|2.10|38.00|0.00|
|ACS2-RL|Woods102-v0|39.73|1.35|22.22|2.03|18.70|28.23|0|51.22|0.00|74.73|1.81|38.00|0.00|
|BACS-NO-RL-1|Woods102-v0|33.39|4.66|10.85|5.79|3.71|20.79|0|94.76|0.71|254.37|8.26|129.93|2.89|
|BACS-RL-1|Woods102-v0|33.89|3.99|4.24|0.26|3.68|4.88|0|94.72|0.73|255.40|8.28|128.57|3.86|
|BACS-NO-RL-2|Woods102-v0|34.92|4.71|13.42|5.25|3.57|26.09|0|93.46|1.20|390.43|25.71|144.50|7.03|
|BACS-RL-2|Woods102-v0|35.47|3.93|4.38|0.36|3.60|5.06|0|93.41|1.85|382.73|22.09|146.40|9.11|

| | Maze | Exploration Avg | Exploration Std | Exploration Avg | Exploration Std | Best Exploration | Worst Exploration | Successful tries | Knowledge Avg | Knowledge Std | Population Avg | Population Std | Reliable Avg | Reliable Std |
|----|------------|:-----:|:-----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|
|ACS2-NO-RL|Woods100-v0|15.73|0.48|34.29|2.25|29.82|39.49|0|60.00|0.00|10.00|0.00|6.00|0.00|
|ACS2-RL|Woods100-v0|15.92|0.44|9.82|1.05|7.90|12.80|0|60.00|0.00|10.00|0.00|6.00|0.00|
|BACS-NO-RL-1|Woods100-v0|11.43|1.69|2.33|0.06|2.24|2.47|0|100.00|0.00|15.03|0.55|10.00|0.00|
|BACS-RL-1|Woods100-v0|12.12|1.56|2.33|0.05|2.20|2.43|0|100.00|0.00|15.17|0.58|10.00|0.00|
|BACS-NO-RL-2|Woods100-v0|11.43|1.56|2.34|0.05|2.22|2.49|0|100.00|0.00|15.33|0.60|10.00|0.00|
|BACS-RL-2|Woods100-v0|11.38|1.57|2.34|0.05|2.22|2.44|0|100.00|0.00|15.13|0.72|10.00|0.00|

| | Maze | Exploration Avg | Exploration Std | Exploration Avg | Exploration Std | Best Exploration | Worst Exploration | Successful tries | Knowledge Avg | Knowledge Std | Population Avg | Population Std | Reliable Avg | Reliable Std |
|----|------------|:-----:|:-----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|
|ACS2-NO-RL|Woods101-v0|33.26|0.93|42.01|2.04|38.41|46.54|0|63.46|1.58|33.63|2.55|21.13|0.43|
|ACS2-RL|Woods101-v0|33.31|1.08|13.66|1.40|11.85|19.06|0|63.09|0.66|34.17|3.09|21.03|0.18|
|BACS-NO-RL-1|Woods101-v0|26.49|3.40|3.09|0.15|2.93|3.63|0|92.59|0.00|71.73|4.79|50.97|1.25|
|BACS-RL-1|Woods101-v0|26.38|3.15|3.06|0.11|2.91|3.43|0|92.59|0.00|74.20|4.04|51.23|1.09|
|BACS-NO-RL-2|Woods101-v0|25.85|2.77|3.10|0.14|2.91|3.44|0|92.59|0.00|75.37|5.78|53.87|2.46|
|BACS-RL-2|Woods101-v0|26.13|3.00|3.05|0.09|2.92|3.22|0|92.59|0.00|73.63|4.94|52.17|2.35|

| | Maze | Exploration Avg | Exploration Std | Exploration Avg | Exploration Std | Best Exploration | Worst Exploration | Successful tries | Knowledge Avg | Knowledge Std | Population Avg | Population Std | Reliable Avg | Reliable Std |
|----|------------|:-----:|:-----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|
|ACS2-NO-RL|MazeE1-v0|20.75|0.87|35.43|8.37|14.66|48.55|0|53.24|0.30|345.83|17.80|154.20|4.98|
|ACS2-RL|MazeE1-v0|20.89|0.82|5.11|0.80|4.00|6.68|0|53.29|0.35|343.70|14.74|155.23|5.68|
|BACS-NO-RL-1|MazeE1-v0|25.02|3.11|5.79|2.74|3.11|13.00|0|57.92|3.88|1112.27|46.29|206.17|18.64|
|BACS-RL-1|MazeE1-v0|24.90|3.64|3.33|0.16|3.00|3.74|0|57.46|4.13|1108.17|56.83|206.77|25.69|
|BACS-NO-RL-2|MazeE1-v0|28.59|4.37|5.63|2.64|3.21|13.31|0|41.58|5.49|1242.33|77.79|133.80|22.13|
|BACS-RL-2|MazeE1-v0|27.78|2.48|3.56|0.34|3.27|4.94|0|39.76|4.31|1253.77|47.61|131.43|19.24|

## Type II Mazes

|       |MazeF4      |Maze7       |MiyazakiB   |
|-------|:----------:|:----------:|:----------:|
|$\phi$'|4.5 **?**   |4.33        |3.33        |
|$\psi$ |47 **?**    |82 **?**    |1.03        |

| | Maze | Exploration Avg | Exploration Std | Exploration Avg | Exploration Std | Best Exploration | Worst Exploration | Successful tries | Knowledge Avg | Knowledge Std | Population Avg | Population Std | Reliable Avg | Reliable Std |
|----|------------|:-----:|:-----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|
|ACS2-NO-RL|MazeF4-v0|58.08|2.47|53.56|8.58|39.39|88.52|0|88.13|3.18|43.40|3.33|31.60|4.94|
|ACS2-RL|MazeF4-v0|56.90|2.47|46.53|8.96|31.91|66.90|0|88.00|2.73|43.10|3.79|31.13|4.68|
|BACS-NO-RL-1|MazeF4-v0|26.79|2.90|4.53|0.09|4.38|4.74|0|100.00|0.00|51.63|4.29|33.10|0.40|
|BACS-RL-1|MazeF4-v0|26.74|3.08|4.51|0.11|4.27|4.73|0|100.00|0.00|51.77|5.86|33.00|0.45|
|BACS-NO-RL-2|MazeF4-v0|26.92|4.46|4.23|0.40|3.56|4.67|0|100.00|0.00|55.57|5.01|34.30|1.16|
|BACS-RL-2|MazeF4-v0|26.62|5.85|4.22|0.46|3.56|5.05|0|100.00|0.00|59.53|7.42|34.83|1.65|

| | Maze | Exploration Avg | Exploration Std | Exploration Avg | Exploration Std | Best Exploration | Worst Exploration | Successful tries | Knowledge Avg | Knowledge Std | Population Avg | Population Std | Reliable Avg | Reliable Std |
|----|------------|:-----:|:-----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|
|ACS2-NO-RL|Maze7-v0|51.57|1.94|46.23|6.41|25.08|58.08|0|82.38|2.79|35.93|1.88|29.50|1.18|
|ACS2-RL|Maze7-v0|51.47|2.24|45.37|6.27|36.38|60.66|0|82.54|2.84|36.47|1.89|29.87|1.59|
|BACS-NO-RL-1|Maze7-v0|25.12|3.84|4.34|0.09|4.17|4.54|0|100.00|0.00|50.43|3.02|36.93|0.25|
|BACS-RL-1|Maze7-v0|25.27|2.62|4.36|0.11|4.14|4.55|0|100.00|0.00|50.20|3.06|37.00|0.00|
|BACS-NO-RL-2|Maze7-v0|23.18|3.82|3.98|0.42|3.46|4.98|0|100.00|0.00|57.77|6.10|38.87|1.86|
|BACS-RL-2|Maze7-v0|24.59|4.37|4.05|0.41|3.35|4.78|0|100.00|0.00|55.70|5.32|38.30|1.70|

| | Maze | Exploration Avg | Exploration Std | Exploration Avg | Exploration Std | Best Exploration | Worst Exploration | Successful tries | Knowledge Avg | Knowledge Std | Population Avg | Population Std | Reliable Avg | Reliable Std |
|----|------------|:-----:|:-----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|
|ACS2-NO-RL|MiyazakiB-v0|40.58|3.03|5.48|5.52|3.48|28.24|0|77.98|0.32|199.47|5.94|138.60|2.47|
|ACS2-RL|MiyazakiB-v0|40.31|3.01|3.75|0.12|3.54|4.07|0|77.86|0.26|197.77|9.61|138.10|4.34|
|BACS-NO-RL-1|MiyazakiB-v0|42.73|6.33|3.83|0.23|3.37|4.28|0|99.36|0.75|527.97|34.71|294.33|13.01|
|BACS-RL-1|MiyazakiB-v0|42.09|5.63|3.99|0.32|3.33|4.78|0|99.55|0.81|522.97|31.64|299.67|11.84|
|BACS-NO-RL-2|MiyazakiB-v0|42.41|6.53|4.19|1.01|3.50|8.76|0|96.29|2.48|616.90|45.12|292.80|18.37|
|BACS-RL-2|MiyazakiB-v0|43.91|5.09|4.05|0.32|3.44|4.85|0|96.67|2.75|620.23|55.28|298.87|22.20|

### Type I Mazes

|       |MazeB       |Littman89   |MiyazakiA   |MazeD       |Cassandra4x4|Littman57   |
|-------|:----------:|:----------:|:----------:|:----------:|:----------:|:----------:|
|$\phi$'|3.5         |3.77        |3.05        |2.75        |2.27        |3.71        |
|$\psi$ |1.26        |61 **?**    |69 **?**    |1.03        |1           |154 **?**   |

| | Maze | Exploration Avg | Exploration Std | Exploration Avg | Exploration Std | Best Exploration | Worst Exploration | Successful tries | Knowledge Avg | Knowledge Std | Population Avg | Population Std | Reliable Avg | Reliable Std |
|----|------------|:-----:|:-----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|
|ACS2-NO-RL|MazeB-v0|35.75|2.59|15.51|17.40|3.82|57.56|0|80.42|0.18|161.97|7.14|120.50|4.01|
|ACS2-RL|MazeB-v0|36.43|1.46|4.42|0.42|3.94|5.55|0|80.49|0.29|162.33|7.31|119.37|4.32|
|BACS-NO-RL-1|MazeB-v0|33.71|5.48|5.94|3.83|3.68|18.53|0|99.90|0.29|275.50|14.43|191.40|7.01|
|BACS-RL-1|MazeB-v0|32.33|3.82|4.11|0.20|3.78|4.47|0|99.97|0.18|279.93|19.61|192.73|8.72|
|BACS-NO-RL-2|MazeB-v0|37.47|4.55|6.27|3.66|3.66|16.61|0|99.77|0.55|297.93|19.57|191.77|7.85|
|BACS-RL-2|MazeB-v0|35.42|4.31|4.13|0.23|3.76|4.57|0|99.87|0.33|290.93|15.99|191.43|8.70|

| | Maze | Exploration Avg | Exploration Std | Exploration Avg | Exploration Std | Best Exploration | Worst Exploration | Successful tries | Knowledge Avg | Knowledge Std | Population Avg | Population Std | Reliable Avg | Reliable Std |
|----|------------|:-----:|:-----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|
|ACS2-NO-RL|Littman89-v0|33.14|3.67|30.08|28.90|4.45|66.84|0|71.47|0.23|99.97|3.95|71.17|0.64|
|ACS2-RL|Littman89-v0|33.11|3.24|6.08|1.45|4.22|11.04|0|71.56|0.51|99.73|4.43|71.17|0.78|
|BACS-NO-RL-1|Littman89-v0|31.59|4.42|5.23|4.39|4.10|28.84|0|100.00|0.00|205.73|11.21|150.53|5.35|
|BACS-RL-1|Littman89-v0|31.85|4.88|4.44|0.22|4.07|4.87|0|99.91|0.32|201.47|8.91|150.97|4.09|
|BACS-NO-RL-2|Littman89-v0|34.88|4.68|4.55|0.28|3.96|5.19|0|99.83|0.55|217.90|12.01|154.67|4.80|
|BACS-RL-2|Littman89-v0|32.77|4.12|4.44|0.30|3.93|5.06|0|100.00|0.00|218.20|13.03|154.00|5.46|

| | Maze | Exploration Avg | Exploration Std | Exploration Avg | Exploration Std | Best Exploration | Worst Exploration | Successful tries | Knowledge Avg | Knowledge Std | Population Avg | Population Std | Reliable Avg | Reliable Std |
|----|------------|:-----:|:-----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|
|ACS2-NO-RL|MiyazakiA-v0|27.57|1.80|8.13|11.26|3.35|42.08|0|70.64|0.12|213.10|8.95|130.57|4.89|
|ACS2-RL|MiyazakiA-v0|27.92|1.58|3.73|0.24|3.35|4.35|0|70.62|0.17|214.50|10.77|131.17|4.60|
|BACS-NO-RL-1|MiyazakiA-v0|28.73|4.66|4.38|3.85|3.18|24.57|0|93.73|2.93|675.00|47.53|302.63|28.28|
|BACS-RL-1|MiyazakiA-v0|30.11|4.79|3.58|0.28|3.07|4.33|0|93.80|2.70|682.07|34.25|306.07|27.83|
|BACS-NO-RL-2|MiyazakiA-v0|29.94|4.68|4.88|2.34|3.20|11.12|0|83.62|5.42|776.70|56.92|237.50|32.55|
|BACS-RL-2|MiyazakiA-v0|30.48|4.97|3.62|0.31|3.14|4.35|0|84.64|5.30|806.80|61.95|250.63|31.89|

| | Maze | Exploration Avg | Exploration Std | Exploration Avg | Exploration Std | Best Exploration | Worst Exploration | Successful tries | Knowledge Avg | Knowledge Std | Population Avg | Population Std | Reliable Avg | Reliable Std |
|----|------------|:-----:|:-----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|
|ACS2-NO-RL|MazeD-v0|24.64|2.02|2.79|0.07|2.62|2.95|0|87.50|0.00|128.50|5.45|109.10|4.11|
|ACS2-RL|MazeD-v0|24.64|2.05|2.77|0.06|2.64|2.90|0|87.50|0.00|126.63|4.25|106.77|2.84|
|BACS-NO-RL-1|MazeD-v0|26.55|3.78|2.98|0.17|2.64|3.27|0|99.86|0.35|211.37|9.48|180.07|5.18|
|BACS-RL-1|MazeD-v0|29.66|4.94|3.03|0.20|2.71|3.61|0|99.97|0.19|211.17|8.04|181.20|5.43|
|BACS-NO-RL-2|MazeD-v0|28.70|4.25|3.34|1.08|2.78|7.83|0|99.72|1.17|216.43|10.07|181.30|4.57|
|BACS-RL-2|MazeD-v0|28.83|3.65|3.02|0.16|2.79|3.46|0|99.90|0.41|217.43|9.68|180.07|3.54|

| | Maze | Exploration Avg | Exploration Std | Exploration Avg | Exploration Std | Best Exploration | Worst Exploration | Successful tries | Knowledge Avg | Knowledge Std | Population Avg | Population Std | Reliable Avg | Reliable Std |
|----|------------|:-----:|:-----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|
|ACS2-NO-RL|Cassandra4x4-v0|14.58|0.98|3.24|0.44|2.67|4.14|0|50.99|0.79|126.53|8.39|54.53|3.57|
|ACS2-RL|Cassandra4x4-v0|14.51|0.85|3.19|0.44|2.24|4.01|0|50.86|0.74|126.63|7.83|54.80|3.15|
|BACS-NO-RL-1|Cassandra4x4-v0|17.11|2.85|2.85|0.36|2.34|3.75|0|79.96|3.38|354.63|33.91|110.23|6.08|
|BACS-RL-1|Cassandra4x4-v0|17.05|3.34|2.98|0.41|2.39|4.15|0|81.07|2.82|355.80|25.58|110.23|7.93|
|BACS-NO-RL-2|Cassandra4x4-v0|18.56|3.51|3.53|3.20|2.42|20.65|0|79.79|4.46|488.97|34.95|105.03|11.40|
|BACS-RL-2|Cassandra4x4-v0|18.26|3.68|2.89|0.34|2.41|3.86|0|79.71|3.46|487.73|42.14|103.93|8.90|

| | Maze | Exploration Avg | Exploration Std | Exploration Avg | Exploration Std | Best Exploration | Worst Exploration | Successful tries | Knowledge Avg | Knowledge Std | Population Avg | Population Std | Reliable Avg | Reliable Std |
|----|------------|:-----:|:-----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|
|ACS2-NO-RL|Littman57-v0|21.47|0.98|33.09|27.46|3.53|61.65|0|73.82|3.67|34.33|1.01|27.00|1.15|
|ACS2-RL|Littman57-v0|22.20|1.09|6.93|5.87|3.53|28.20|0|73.33|3.02|34.73|0.68|27.27|0.96|
|BACS-NO-RL-1|Littman57-v0|23.48|3.12|10.22|16.51|3.83|59.97|0|90.33|0.44|57.07|2.59|31.13|0.50|
|BACS-RL-1|Littman57-v0|22.19|3.31|4.50|0.38|4.04|5.51|0|90.33|0.44|57.40|2.69|31.20|0.54|
|BACS-NO-RL-2|Littman57-v0|23.84|3.55|9.66|15.85|4.11|61.14|0|90.41|0.61|58.17|3.44|31.13|0.50|
|BACS-RL-2|Littman57-v0|21.31|2.51|4.68|0.84|3.99|8.18|0|90.57|0.83|57.50|2.79|31.50|0.67|

## Not Aliased Mazes

|       |Maze4       |Maze5       |MazeA       |MazeF1      |MazeF2      |MazeF3      |Woods1      |Woods14     |
|-------|:----------:|:----------:|:----------:|:----------:|:----------:|:----------:|:----------:|:----------:|
|$\phi$ |3.5         |4.61        |4.23        |1.8         |2.5         |3.38        |1.63        |9.5         |
|$\psi$ | **\***     | **\***     | **\***     | **\***     | **\***     | **\***     | **\***     | **\***     |

| | Maze | Exploration Avg | Exploration Std | Exploration Avg | Exploration Std | Best Exploration | Worst Exploration | Successful tries | Knowledge Avg | Knowledge Std | Population Avg | Population Std | Reliable Avg | Reliable Std |
|----|------------|:-----:|:-----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|
|ACS2-NO-RL|Maze4-v0|32.41|1.52|3.49|0.05|3.40|3.62|0|100.00|0.00|162.23|4.42|162.23|4.42|
|ACS2-RL|Maze4-v0|32.57|2.19|3.52|0.08|3.40|3.74|0|100.00|0.00|163.40|4.31|163.40|4.31|
|BACS-NO-RL-1|Maze4-v0|32.81|4.28|3.51|0.07|3.34|3.66|0|100.00|0.00|161.87|4.75|161.87|4.75|
|BACS-RL-1|Maze4-v0|32.46|2.62|3.48|0.07|3.36|3.64|0|100.00|0.00|162.97|5.59|162.97|5.59|
|BACS-NO-RL-2|Maze4-v0|32.74|4.44|3.50|0.04|3.42|3.57|0|100.00|0.00|161.47|3.69|161.47|3.69|
|BACS-RL-2|Maze4-v0|33.04|4.77|3.50|0.06|3.34|3.58|0|100.00|0.00|160.30|3.46|160.30|3.46|

| | Maze | Exploration Avg | Exploration Std | Exploration Avg | Exploration Std | Best Exploration | Worst Exploration | Successful tries | Knowledge Avg | Knowledge Std | Population Avg | Population Std | Reliable Avg | Reliable Std |
|----|------------|:-----:|:-----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|
|ACS2-NO-RL|Maze5-v0|48.15|1.89|4.66|0.12|4.47|5.06|0|100.00|0.00|212.90|6.97|212.87|6.96|
|ACS2-RL|Maze5-v0|47.93|2.40|4.62|0.09|4.49|4.86|0|100.00|0.00|214.50|6.06|214.20|5.75|
|BACS-NO-RL-1|Maze5-v0|48.41|5.44|4.65|0.10|4.48|4.96|0|100.00|0.00|212.07|6.16|212.03|6.13|
|BACS-RL-1|Maze5-v0|48.49|5.01|4.64|0.09|4.45|4.86|0|100.00|0.00|214.47|7.85|213.73|5.82|
|BACS-NO-RL-2|Maze5-v0|47.99|5.73|4.62|0.09|4.43|4.80|0|100.00|0.00|214.23|5.70|214.23|5.70|
|BACS-RL-2|Maze5-v0|46.72|4.84|4.62|0.08|4.47|4.81|0|100.00|0.00|214.13|6.65|214.13|6.65|

| | Maze | Exploration Avg | Exploration Std | Exploration Avg | Exploration Std | Best Exploration | Worst Exploration | Successful tries | Knowledge Avg | Knowledge Std | Population Avg | Population Std | Reliable Avg | Reliable Std |
|----|------------|:-----:|:-----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|
|ACS2-NO-RL|MazeA-v0|48.11|1.61|4.24|0.09|4.08|4.60|0|100.00|0.00|101.67|2.61|101.67|2.61|
|ACS2-RL|MazeA-v0|49.03|2.09|4.24|0.09|4.10|4.43|0|100.00|0.00|101.83|2.58|101.83|2.58|
|BACS-NO-RL-1|MazeA-v0|50.53|5.49|4.22|0.08|4.08|4.35|0|100.00|0.00|101.73|3.04|101.73|3.04|
|BACS-RL-1|MazeA-v0|49.03|4.99|4.21|0.07|4.08|4.37|0|100.00|0.00|102.07|2.22|102.07|2.22|
|BACS-NO-RL-2|MazeA-v0|48.44|4.64|4.24|0.06|4.07|4.35|0|100.00|0.00|101.60|2.18|101.60|2.18|
|BACS-RL-2|MazeA-v0|49.83|4.59|4.26|0.10|4.12|4.64|0|100.00|0.00|102.43|2.58|102.43|2.58|

| | Maze | Exploration Avg | Exploration Std | Exploration Avg | Exploration Std | Best Exploration | Worst Exploration | Successful tries | Knowledge Avg | Knowledge Std | Population Avg | Population Std | Reliable Avg | Reliable Std |
|----|------------|:-----:|:-----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|
|ACS2-NO-RL|MazeF1-v0|11.41|0.34|1.80|0.03|1.74|1.86|0|100.00|0.00|14.00|0.00|14.00|0.00|
|ACS2-RL|MazeF1-v0|11.40|0.31|1.81|0.03|1.75|1.87|0|100.00|0.00|14.00|0.00|14.00|0.00|
|BACS-NO-RL-1|MazeF1-v0|11.70|1.28|1.81|0.03|1.74|1.92|0|100.00|0.00|14.00|0.00|14.00|0.00|
|BACS-RL-1|MazeF1-v0|11.20|1.65|1.80|0.04|1.73|1.86|0|100.00|0.00|14.00|0.00|14.00|0.00|
|BACS-NO-RL-2|MazeF1-v0|11.85|1.61|1.81|0.03|1.76|1.87|0|100.00|0.00|14.00|0.00|14.00|0.00|
|BACS-RL-2|MazeF1-v0|11.54|1.29|1.81|0.03|1.74|1.86|0|100.00|0.00|14.00|0.00|14.00|0.00|

| | Maze | Exploration Avg | Exploration Std | Exploration Avg | Exploration Std | Best Exploration | Worst Exploration | Successful tries | Knowledge Avg | Knowledge Std | Population Avg | Population Std | Reliable Avg | Reliable Std |
|----|------------|:-----:|:-----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|
|ACS2-NO-RL|MazeF2-v0|19.83|0.68|2.50|0.04|2.43|2.60|0|100.00|0.00|18.00|0.00|18.00|0.00|
|ACS2-RL|MazeF2-v0|19.60|0.73|2.49|0.04|2.40|2.57|0|100.00|0.00|18.00|0.00|18.00|0.00|
|BACS-NO-RL-1|MazeF2-v0|21.12|2.72|2.50|0.04|2.40|2.57|0|100.00|0.00|18.00|0.00|18.00|0.00|
|BACS-RL-1|MazeF2-v0|21.01|2.47|2.49|0.06|2.37|2.60|0|100.00|0.00|18.00|0.00|18.00|0.00|
|BACS-NO-RL-2|MazeF2-v0|19.79|2.52|2.49|0.04|2.43|2.59|0|100.00|0.00|18.00|0.00|18.00|0.00|
|BACS-RL-2|MazeF2-v0|20.21|2.43|2.49|0.04|2.39|2.57|0|100.00|0.00|18.00|0.00|18.00|0.00|

| | Maze | Exploration Avg | Exploration Std | Exploration Avg | Exploration Std | Best Exploration | Worst Exploration | Successful tries | Knowledge Avg | Knowledge Std | Population Avg | Population Std | Reliable Avg | Reliable Std |
|----|------------|:-----:|:-----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|
|ACS2-NO-RL|MazeF3-v0|29.36|1.04|3.38|0.06|3.22|3.50|0|100.00|0.00|28.00|0.00|28.00|0.00|
|ACS2-RL|MazeF3-v0|29.67|0.95|3.41|0.13|3.25|4.01|0|100.00|0.00|28.00|0.00|28.00|0.00|
|BACS-NO-RL-1|MazeF3-v0|29.94|3.19|3.37|0.06|3.23|3.50|0|100.00|0.00|28.00|0.00|28.00|0.00|
|BACS-RL-1|MazeF3-v0|30.17|3.31|3.38|0.06|3.27|3.50|0|100.00|0.00|28.00|0.00|28.00|0.00|
|BACS-NO-RL-2|MazeF3-v0|30.46|2.49|3.38|0.06|3.28|3.51|0|100.00|0.00|28.00|0.00|28.00|0.00|
|BACS-RL-2|MazeF3-v0|30.10|2.51|3.37|0.04|3.27|3.47|0|100.00|0.00|28.00|0.00|28.00|0.00|

| | Maze | Exploration Avg | Exploration Std | Exploration Avg | Exploration Std | Best Exploration | Worst Exploration | Successful tries | Knowledge Avg | Knowledge Std | Population Avg | Population Std | Reliable Avg | Reliable Std |
|----|------------|:-----:|:-----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|
|ACS2-NO-RL|Woods1-v0|9.42|0.38|1.62|0.02|1.59|1.67|0|100.00|0.00|52.97|0.18|52.97|0.18|
|ACS2-RL|Woods1-v0|9.46|0.55|1.63|0.02|1.58|1.67|0|100.00|0.00|52.97|0.18|52.97|0.18|
|BACS-NO-RL-1|Woods1-v0|9.49|1.46|1.62|0.02|1.59|1.65|0|100.00|0.00|53.00|0.00|53.00|0.00|
|BACS-RL-1|Woods1-v0|9.55|1.19|1.62|0.02|1.57|1.66|0|100.00|0.00|53.00|0.00|53.00|0.00|
|BACS-NO-RL-2|Woods1-v0|9.35|1.36|1.62|0.02|1.58|1.67|0|100.00|0.00|53.00|0.00|53.00|0.00|
|BACS-RL-2|Woods1-v0|9.28|1.19|1.63|0.02|1.60|1.66|0|100.00|0.00|53.00|0.00|53.00|0.00|

| | Maze | Exploration Avg | Exploration Std | Exploration Avg | Exploration Std | Best Exploration | Worst Exploration | Successful tries | Knowledge Avg | Knowledge Std | Population Avg | Population Std | Reliable Avg | Reliable Std |
|----|------------|:-----:|:-----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|:----:|
|ACS2-NO-RL|Woods14-v0|70.35|1.76|9.49|0.22|8.95|9.83|0|100.00|0.00|35.00|0.00|35.00|0.00|
|ACS2-RL|Woods14-v0|70.55|1.70|9.49|0.21|9.14|10.03|0|100.00|0.00|35.00|0.00|35.00|0.00|
|BACS-NO-RL-1|Woods14-v0|69.05|4.89|9.52|0.23|9.03|9.94|0|100.00|0.00|35.00|0.00|35.00|0.00|
|BACS-RL-1|Woods14-v0|70.68|5.16|9.51|0.20|9.08|9.90|0|100.00|0.00|35.07|0.25|35.07|0.25|
|BACS-NO-RL-2|Woods14-v0|69.02|4.76|9.49|0.18|9.13|9.80|0|100.00|0.00|35.03|0.18|35.03|0.18|
|BACS-RL-2|Woods14-v0|71.27|5.50|9.46|0.24|8.95|9.84|0|100.00|0.00|35.00|0.00|35.00|0.00|