GitHub - ImaginaryLogs/CSINTSY-sokobot2024: An artificial intelligence designed to solve sokoban maps.

An Introduction to Artificial Intelligence (CSINTSY) Project - Sokobot2024; an artificial intelligence solving a japanese puzzle game with a detailed analysis of its computing performance.

Legend

Here are the icons we will be using for the rest of the discussion for the sake of consistency.
 wall       goal       crate      player    crate on goal
                          

1.1 Brief description of Sokoban

Sokoban is a simple single-player puzzle game. The goal of the player is to be able to rearrange the crates so that they occupy all the designated locations on the map. The difficulty arises from the fact that some moves result in configurations that offer no reversals: some moves cannot be undone. And once the player is stuck in a bad configuration, restarting becomes the only option.

1.2 NP-hard problems

Sokoban is an NP-hard problem, meaning to say it is at least as hard as the problems in the NP complexity class. NP-hard problems are difficult to solve, but once solutions are found they are easy to verify. This is true of Sokoban: finding the solution may take a considerable amount of effort, but running through the solution can easily verify its validity.

1.3 Running the program

In this section, we will gradually introduce the concepts underlying our approach and the different actions taken by the overall algorithm. Eventually, with domain language in place, we will be able to phrase our model succintly:

$$ \text{"} States \text{ evolve over time and can be assigned a } priority \text{ } score \text{ based on the state and its } map \text{."} $$

Likewise, the algorithm that operates over these concepts can be worded in about just as many words:

$$ \text{"}Viable \text{ } states \text{ are queued } by \text{ } priority \text{ until a } winning \text{ } state \text{ is found or the queue empties."} $$

The statements above will become clearer as we flesh out the meaning of the highlighted words within the context of the domain (Sokoban).

2.1 Defining the `SokoState` and `SokoMap` classes

A Brief Note on Nomeclature

Almost all classes created for this project are named with the prefix 'Soko'. This is only to ensure proper namespacing and to prevent collisions with common Java language constructs such as 'Map' or 'State'. However, the classes in the utils/ folder do not follow this convention: they are not specific concepts tied to Sokoban and are only general helper classes.

In a game of Sokoban, some of its elements can move about while its other components stay fixed. Specifically, both the player and the crates can end up in different locations after moves have been played, but the goals and walls will never shift about. Thus, when performing a search across the Sokoban state space, it makes sense to enlist only those variable properties of the game within our "state objects": we define a "state object" as an entity that encapsulates the configuration of the movable game objects following a series of moves.

In the case of our implementation, the SokoState class handles the duty of representing states. The elements of the game that remain unchanged can then be accessed by all these state objects through a shared reference to some other class. The SokoMap class encapsulates those non-variable properties of the game.

2.2 Defining the state priority score

Every state can be assigned a priority score that tells us how important it is. More important states are checked first when performing a search for a valid solution. There are a few things that can affect this evaluation, but we have decided to use the following parameters:

Heuristic for the priority score

move_count, turn_move_count, and crate_move_count

good_crate_count

crate_goal_centroid_distance

move_count, turn_move_count, and crate_move_count

These three values form one of the main heuristics and deal with the nature of the moves taken by the player so far. move_count simply refers to how many moves were needed to get to the current state. By default, our approach prefers states which require less moves. However, when inverting this heuristic, our algorithm actually finds solutions in less time for certain maps, despite the resulting solution being unreasonably long. Note that the way move_count is treated directly corresponds to whether or not we're doing depth-first search (DFS) or breadth-first search (BFS). Prioritizing longer solutions is akin to performing DFS, while the opposite mirrors BFS.

The other two values, turn_move_count and crate_move_count, refer to the count of specific subsets of the moves performed by the player. The former counts how many moves represent a turn (a change in direction) of the player: the way the value is set up prioritizes solutions that have less turns in order to make sure that the player does not wander aimlessly. The latter value counts how many moves involve pushing a crate. Such moves are preferred as they "get things done" (although it may have the unintended consequence of encouraging the bumping of crates off of their goals even after they've been placed there).

All in all, these three values are summed up (with crate_move_count being negated first to ensure it contributes to reducing the cost of the state), although do note that the sum is weighted and the three are not treated equally. These weights may be considered hyperparameters of some sort.
good_crate_count

This just refers to the number of crates on goals for a given state. Our approach prefers states where more of the crates are already on top of goals, which makes sense, although it is important to note that some solutions require the momentary shifting of "good crates" to reach a final solution.
crate_goal_centroid_distance

Centroid is just a fancy word for center of mass. In this case, we're comparing the average location of the crates to that of the goals (in other words, their centroids). States with the crates closer to the goals are preferred.

Computing centroids is much more efficient than manually comparing crates and goals on a pair-wise basis. Thus, we do not attempt to do the latter (the former is $\mathcal{O}(n)$ while the latter is $\mathcal{O}(n^2)$ ).

These three parameters are unified into a single value that represents the priority score of a given state. For added flexibility, coefficients were also defined which allows changing the "composition" of the priority score; that is, all three heuristics may not necessarily have equal weight, and the way the algorithm combines these heuristics can be modified. Again, these weights may be considered hyperparameters.

2.3 Identifying viable states

Of course, evaluating the priority of a state only makes sense when the state we're scoring is viable. Some states are pointless to try and continue, such as when a crate gets stuck in some corner of the map. In general, the only times states become futile are when crates get stuck in some way. We define precise meanings for "stuck" to help rigorize this idea.

Types of stuck

Wall-stuck crates

Group-stuck crates

Wall-stuck crates

This is easier to identify. Any crate that ends up in a corner OR on a wall it cannot be pushed out of is wall-stuck. Identifying crates that are wall-stuck can be done by preprocessing the map and identifying the cells that lead to these scenarios; however, it is important to note that cells with goals on them are exceptions to this rule, since crates can be stuck on goals. We elaborate our methods for preprocessing the map further in the succeeding section.
Group-stuck crates

Crates that are stuck because they are surrounded by other crates that are also stuck are called group-stuck crates. The check here performs a recursive call through uninspected crates: any adjacent crates are asked recursively whether or not they are stuck. If at least one of the recursive calls identifies a liberated crate, then the entire group is not permanently stuck; once the non-stuck crate is moved, it is possible for the other crates to become movable again. Otherwise, the entire group is actually stuck and the state is a dead-end.

2.4 Preprocessing the map

Preprocessing the map is a much more involved process. Although the idea of extracting metadata from the map may seem expensive at first, the optimization this entails is worth the implementation. A large number of states can be pruned from the search space by doing this.

The preprocessing routine works by first marking all interior wall corners as unreachable spots on the map (this makes sense since crates that end up here can no longer be moved). The preprocessor then iterates through all possible pairs of corners that lie on the same x or y coordinates (but not both, since if both are equal then they are one and the same point). Cells along a line connecting such a pair of corners are also unreachable if that line is completely surrounded by an unbroken wall on at least one of its sides. The cases described in the figure below offer a clearer picture of what this should look like. The exact logic for checking this condition is a bit more involved (and hopefully the internal documentation suffices to outline the exact process), but the idea is no more complicated than that.

2.5 Avoiding repeat states

When searching the space of possible solutions, it is very much possible for the bot to encounter the same state more than once through a number of different paths. However, the feasibility of any state does not depend on the actions that were taken to get there; in other words, a state can be evaluated independent of the moves before it. With that in mind, it is then possible (and highly necessary) for us be able to avoid states that have been visited. Visiting such states more than once can waste time at best and lead to infinite loops at worst.

The process by which repeat states are pruned is quite simple: every state is serialized into a single unique integer by taking into account only the locations of the player and the crates for that given state. After all, we do not need to check the locations of the walls and the goals since these are constant anyway (we've discussed this idea before). The locations of the movable components are combined by first representing each location with its own unique integer (this is implicitly done by the project code since this is how we've decided to handle locations) and then combining the different integers into one by means of a byte stream. Think of it as concatenating different strings by successively appending one after the other, except instead of character strings we're using byte strings (each integer is just a sequence of 4 bytes). Eventually, we end up with a long string of bytes. If we interpret this byte string as a Java BigInteger, then we've essentially just created a way to index our states without having to maintain a lot of overhead. Storing these indexes in a set optimizies lookup, and is the exact way we keep track of states in the code.

2.6 The `SokoSolver` class

The SokoSolver represents the driver class that manages the entire high-level structure of the algorithm. It contains the iterator that processes the queue of states until a solution is found. It also utilizes the SokoStateFactory class which helps us queue the valid states that are reachable from the current state.

<!-- ! // ! explain the actual algo here + pseudocode -->

Disclaimer This part is more of an addendum to the actual algorithm. It is not necessary to browse this portion, but for those who get their kicks out of the nerdy bits, do read on.

3.1 Storing coordinates more effectively

It is a very common implementation to store coordinates as a pair of integers. However, this offers some considerable drawbacks. When iterating over the neighbors of a given location, the pair-wise notation of coordinates necessitates an imperative approach of the following sort:

// Current coordinates
int currentXCoordinate;
int currentYCoordinate;

// Iterate over neighbors... big sad...
for(int i = 0; i < 4; i++) {
    switch(i) {

        // 0 - Neighbor on top
        case 0: doSomething(currentXCoordinate, currentYCoordinate - 1); break; 

        // 1 - Neighbor on right
        case 1: doSomething(currentXCoordinate + 1, currentYCoordinate); break;

        // 2 - Neighbor on bottom
        case 2: doSomething(currentXCoordinate, currentYCoordinate + 1); break;

        // 3 - Neighbor on left
        case 3: doSomething(currentXCoordinate - 1, currentYCoordinate); break;
    }
}

This is a bit too involved, and having to do this repeatedly clearly violates DRY principles and couples the code to the implementation of grid coordinates. However, by representing locations with a single integer, we can end up doing something like this instead:

// Current location
int currentLocation;

// Iterate over neighbors... woah, wtf?
for(int direction : Location.DIRECTIONS) {
    doSomething(currentLocation + direction);
}

Perhaps I'm a little biased, but the brevity this entails is highly attractive. How exactly do we accomplish this? First off, the x and y coordinates can be combined into a single integer by sharing bits. An integer has $32$ bits or $4$ bytes... given that the grids we will be considering will surely have no more than $65535$ tiles on each side, $16$ bits or $2$ bytes (or perhaps considerably less) are more than enough to represent either coordinate. Essentially, we can let the x coordinate occupy the $16$ leftmost bits of an integer, while the y coordinate occupies the remaining bits. We call $16$ the maskLength of the location (since when retrieving coordinates, we use a bit mask of that given length to isolate the values from the location integer).

With this in mind, we can then discuss how directions are implemented. To be able to write code with the declarative proclivities of the previous snippet, we define the following constants:

// These are stored in the location class inside a map called DIRECTIONS
public static final int NORTH = -1;
public static final int SOUTH = 01;
public static final int EAST = (01 << maskLength);
public static final int WEST = (-1 << maskLength);

These simply represent different offset values associated with each coordinate. When adding NORTH to a location integer, the net effect is as if we modified y on its own by subtracting $1$ from it. The same can be said for the other constants: they add or subtract $1$ from either x or y to generate new locations adjacent to the current one along one of the cardinal axes. Voila!

Of course, if we want our code to use this effectively, we have to treat make sure all our functions receive and process locations as integers... that is, location integers are now some sort of "primitive".

3.2 Separating state from constants

Not all components of Sokoban are movable. In fact, some stay quite constant for the entire duration of the game. Goals and walls are static, and repeating information about the map across instances of a State class would incur a significant overhead (in terms of space complexity for the most part). A more performant implementation would abstract a Map class that holds all unchanging aspects of the game and provides an interface for querying facts about these components.

In the case of our implementation, the SokoState class stores crate and player information and has responsibility for determining the statuses of the crates. To do this, it makes calls to one the methods of the SokoMap class and injects the required dependencies; this way, the statuses of the crates can be evaluated with respect to the walls and goals of the map. Such an implementation also allows a much more effective separation of concerns.

3.3 Rationale of overarching design patterns

There are some other minor design considerations that found themselves into the implementation of the project.

A factory was created for managing state creation. It felt a bit awkward having to include a number of different constructors for handling state creation logic within the State class itself, so a separate dedicated class was created entirely for this purpose. Initial states would be created by the SokoStateFactory, and so would adjacent states when traversing the state space.

A separate crate class was also necessitated by the algorithm. Including all crate-related logic within the State class would have considerably bloated the file. Crates have a lot of logic to check on their own, and maintaining information about the vacancy / occupation of their neighbors felt beyond the immediate responsibility of the State class. The SokoCrate class deals with stuckness checks, crate state management, and a number of other things to lift some burden off of the SokoState class.

4.1 Testing approach

To automate the testing process, a mock of the original Java files were created. These were then used to check and play the solutions the bot would be given

It was easier to copy over the provided implementation of the game rather than to code one from scratch. Also, it looked more visually appealing to watch

4.2 The Test class

The test class was made to create and isolate unit tests. It follows RAII, so makes sure to instantiate the involved objects and releases them each time it is called, so no state is preserved betweem tests.

4.3 The problem with having a Java test driver

Apparently, if a method isn't finished running within a thread, calling its .interrupt() method does nothing. The only way to kill those threads would be by exiting the main program thread. It is thus necessary to start each test as a separate process

4.4 Test Driver

The file tester.py represents the test driver.

4.5 Map generation and map corpuses

Maps are stored in a .txt file where each character represents a type of tile. The map generation code was lifted from here

5.1. Analysis Approach

This portion deals with the analysis of the data that tester.py produces. It tests on 2750 different maps. Some maps are given my Ms. Shirley Chu from De La Salle University - Manila, while others are webscrapped from the internet.

Tests where run on a computer with the following specs:

Type	Information
OS and Manufacturer	Windows 10, Microsoft Corporation
System Type	x64-based PC
Processor	AMD Ryzen 5 3500 6-Core Processor, 3600 Mhz, 6 Core(s), 6 Logical Processor(s)
Installed Physical Memory (RAM)	16.0 GB
Graphics Processing Unit	NVIDIA GeForce GTX 1660 SUPER

5.2 Correlation Heat map of values

Correlation maps help distinguish what values has high likely hood to correlate with each other. By using Spearman's rank correlation coefficient, we can induce what possible correlation there is to the information gathered.

Spearman is used rather than Pearson as it includes non-linear relationships asssuming they are monotonically increasing.

To classify the strength of correlation, we will use the following terms:

Strong correlation. This is for values $v$ where it fits within the range $|v|\ge0.7$. Values that have a strong correlation must be investigated as it leads to interesting insight what factors in an intelligent system's success.
Moderate correlation. This is for values $v$ that fits the range of $0.3 \le |v|\le0.7$. There may be some worth checking the relationship.
Weak correlation. $0\le|v|\le0.3$.

Based on correlation map, we can choose which type of information to inspect for each factor that is worth to analyze.

Here are some pairs of factors to the sokoban bot to inspect:

Success Rate (Number of bot wins)
- Number of moves
- Time taken to solve
- New nodes created and nodes processed
- Crate Count
- Branching factor
- Map size
- Number of Blocks
Time Taken to solve
- Node Created
- Node Processed
- Crate Count
- Branching factor
- Map size
- Number of Blocks
Branching Factor
- Number of Crates
- Map Size

5.3 Success rate Factors

Success rate measures the likelihood for the sokobot to find a solution within 15.0 seconds in a given map.

Based on the correlation map we have induce the possible factors that have a strong correlation:

Type	Reasoning
Number of moves	It is a given since those with an actual solution have moves greater than zero. Thanks to data cleaning, we can gurantee that any timed out or impossible maps have a solution length of zero. Thus, it is intuitive to have direct relationship there.
Time taken to solve	The more time a bot needs to solve a problem, the less likely a map it is solvable given the time limit. An increase of difficulty of a map would naturally take longer to process.
New nodes created and nodes processed	The more states (nodes) the bot needs to solve and process, the farther the goal is from the time limit.
Crate Count	Increasing crate count increases difficulty to solve as you have more goals to keep track of to win.

There are also those with a moderate correlation that may be worth to check:

Type	Reasoning
Branching factor	Increasing the branching factor means that there are more states to explore. The more computationally harder it is to explore, the less likely a bot can find a solution within 15.0 seconds.
Map Size	Increasing the map size increases the amount of tiles needed to be account by the program.
Number of blocks	This may be an indicator that does not factor into a bots success as it increases with Map size in general.

5.3.1 Number of Moves

"It is a given since those with an actual solution have moves greater than zero. Thanks to data cleaning, we can gurantee that any timed out or impossible maps have a solution length of zero. Thus, it is intuitive to have direct relationship there."

The average number of moves that sokoban does it around 54 moves with an std of 84. However, this in accounting with failed maps, which has a solution length of 0.

Number of moves
Mean               : 54.09
Standard Deviation : 84.26
Min, Max           : (0.00, 1326.00)

Accounting for successful solvable maps that fit within the 15.0 second limit, the average number of moves in a solvable map is around 122, with a standard deviation of 88.

Number of moves
Mean               : 122.04
Standard Deviation : 87.89
Min, Max           : (1.00, 1326.00)

The order of magnitude for the number of moves in solvable maps seems to be from 2.0 to 2.5, so a majority of the moves will be between around 100 - 316 moves.

5.3.2 Time Taken to Solve

"The more time a bot needs to solve a problem, the less likely a map it is solvable given the time limit. An increase of difficulty of a map would naturally take longer to process."

For any given map, whether it is solvable or not, takes around an average of 9.05 seconds with a standard deviation of 6.9. This is including maps that the sokoban bot was not able to solve.

Time taken to solve
Mean               : 9.06
Standard Deviation : 6.94
Min, Max           : (0.06, 15.25)

However, the average time for the sokobot to find a solution in a solvable map is around 1.6 with a standard deviation of 3.0.

Time taken to solve
Mean               : 1.67
Standard Deviation : 3.01
Min, Max           : (0.06, 15.00)

O notation Likelihood in the relationship for time_taken and has_bot_win_numeric
Exponential   (r2, rmse): 1.0, 0.0
Linear        (r2, rmse): 1.0, 0.0
Logarithmic   (r2, rmse): 1.0, 0.0

The figure below shows that most maps where the sokobot failed is stuck at the 15.0 seconds time limit. The other few red dots represent when there is no solution to be found.

5.3.3 Nodes Created and Processed

"The more states the bot needs to solve and process, the farther the goal is from the time limit."

States are referred to as nodes.

Failed maps are interesting to note for state creation and processing, for they are in certain range of values for nodes created and processed.

The logarithm of the child nodes created, or more intuitively its the order of magnitude, seems to be around 6 to 6.5. Looking back at the original dataset, the failed dataset have child nodes produced at range from 1.0 to 3.1 millions nodes.

This is an indicator that some hard maps require more moves to process and to find the solution. Hard maps within a given time limit means only a limit range of nodes can ever be produced. it may give rise to place a boundary of amount of nodes to create and process within that 15.0 time limit.

New nodes created (Both solvable and unsolvable)
Mean               : 1060005.47
Standard Deviation : 833897.99
Min, Max           : (1.00, 2822984.00)

New nodes created (Solvable Only)
Mean               : 231375.22
Standard Deviation : 434399.90
Min, Max           : (2.00, 2363304.00)

The nodes proccessed by the sokobot for a given map ranges from 0.7 to 2.0 million. The order of magnitude is around 6 to 6.5.

Nodes processed (Both solvable and unsolvable)
Mean               : 704914.47
Standard Deviation : 568641.17
Min, Max           : (1.00, 2124784.00)

Nodes processed (Solvable Only)
Mean               : 173691.09
Standard Deviation : 322619.90
Min, Max           : (2.00, 1674763.00)

5.3.4 Number of Crates

"Increasing crate count increases difficulty to solve as you have more goals to keep track of to win."

The figure below displays the percentage of maps completed as the number of crates increases. The number of maps complete sharply decreases as the number of boxes increases.

Number of crates (Both solvable and unsolvable)
Mean               : 7.88
Standard Deviation : 4.00
Min, Max           : (1.00, 16.00)

Number of crates (Solvable)
Mean               : 4.51
Standard Deviation : 1.81
Min, Max           : (1.00, 12.00)

Number of crates (Unsolvable)
Mean               : 10.56
Standard Deviation : 3.14
Min, Max           : (1.00, 16.00)

5.3.5 Branching factor

"Increasing the branching factor means that there are more states to explore. The more computationally harder it is to explore, the less likely a bot can find a solution within 15.0 seconds."

It is worth to note that the number of children nodes produces is four for both invalid and valid nodes, with one processed per iteration.

The branching factor for any given map has a mean of 1.44 with an std of 0.22.

Based on the data below, unsolvable maps may take the sokoban solver to have branching factor mean of 1.56 states produced per state processsed with a standard deviation of 0.19. Solvable maps have a mean branching factor of 1.29 state produced per state processed with a standard deviation of 0.22.

Branching factor (Both solvable and unsolvable)
Mean               : 1.44
Standard Deviation : 0.22
Min, Max           : (1.00, 2.27)

Branching factor (Solvable)
Mean               : 1.29
Standard Deviation : 0.14
Min, Max           : (1.00, 2.12)

Branching factor (Unsolvable)
Mean               : 1.56
Standard Deviation : 0.19
Min, Max           : (1.00, 2.27)

5.3.6 Map size

"Increasing the map size increases the amount of tiles needed to be account by the program. Although, it is moderate correlation."

Taking the logarithm of the map size has produced these interesting gaussian distributions.

Most solvable maps are a size of 10^4.37 or 23,442 tiles squared, while most unsolvable ones are around 1e6 tiles squared.

Logarithm of Map Size (Both solvable and unsolvable)
Mean               : 4.72
Standard Deviation : 0.51
Min, Max           : (2.71, 7.59)

Logarithm of Map Size (Solvable)
Mean               : 4.37
Standard Deviation : 0.37
Min, Max           : (2.71, 6.72)

Logarithm of Map Size (Unsolvable)
Mean               : 5.00
Standard Deviation : 0.44
Min, Max           : (3.74, 7.59)

If the map is solvable within 15.0 seconds, the mean is 1.29. This means difficult maps tends to produce more valid states for the program to explore than the rate it can explore each one.

5.4 Time Taken to Solve

This is the amounnt of time needed to solve a given problem within the 15.0 second time limit.

Here are some factors to explore:

Type	Reasoning
Node Created	This is an indicator that whatever map the sokoban solver is trying to solve has a wide state space.
Node Processed	Another indicator along side node creation.
Crate Count	An increase in crates means there's more goal states to keep track, this increases computation time and thus time taken to solve.
Branching factor	This is an indicator of the difficulty of the map. A higher branching factor means more computation time to solve.
Map size	An increase in map size means there may be more possible states to keep track of.
Number of Blocks	This is an indicator since it includes crates, goal tiles, and unpassable walls.

5.4.1 Nodes Created and Processed

"This is an indicator that whatever map the sokoban solver is trying to solve has a wide state space. Node processed is another indicator along side node creation."

As time increases, the amount of states produced and processed increases also. By using linear regression, we can verify that both are in a linear relationship.

O notation Likelihood in the relationship for time_taken and child_nodes_made
Exponential   (r2, rmse): 0.415277131593743, 1.8997376722239567
Linear        (r2, rmse): 0.9864438703026616, 41902.57308817826
Logarithmic   (r2, rmse): 0, 0

O notation Likelihood in the relationship for time_taken and nodes_expanded
Exponential   (r2, rmse): 0.4196290677490232, 1.8711048629894318
Linear        (r2, rmse): 0.9818120553561996, 37265.63670688951
Logarithmic   (r2, rmse): 0, 0

5.4.2 Number of Crate count

"An increase in crates means there's more goal states to keep track, this increases computation time and thus time taken to solve."

As crate count increases, the average time to complete the map increases. Near the end, it plateaus near 15.0 seconds.

O notation Likelihood in the relationship for Number of crates and Time taken to solve
Exponential   (r2, rmse): 0.5106553678813445, 0.9649800759471752
Linear        (r2, rmse): 0.17334848750464726, 2.1484334363306714
Logarithmic   (r2, rmse): -1.9730302174962753, 47448.10470512729

5.4.3 Map Size

"This is an indicator of the difficulty of the map. A higher branching factor means more computation time to solve."

What is interesting is that the map size correlates with the time taken with a interesting relationship. When plotted, the graph seems to be a linear relationship.

By passing multiple models of linear regression, such as transforming the range by passing in a natural log, the nature of the relationship maybe exponential with very small coefficients.

A normal linear regression is very similar though.

O notation Likelihood in the relationship for Time taken to solve and Map Size
Exponential   (r2, rmse): 0.27519752949436804, 0.43612937364812704
Linear        (r2, rmse): 0.04941436214593087, 106.18797722098664
Logarithmic   (r2, rmse): 0, 0

5.5 Branching Factor

The branching factor is the amount valid children nodes produced as the sokoban solver produces.

The higher the branching factor is, the more computationally expensive it is.

$$B=\frac{\text{Node children created}}{\text{Nodes processed}}$$

The formula of the branching factor above shows that it is a ratio of nodes created over the nodes processed.

Some factors to explore:

Type	Reasoning
Number of Crates	Increasing the number of crates increases the requirements to find the goal.
Map Size	Increasing the map size may possibly increase the amount of valid states to explore.

What is the estimated weighted average for the branching factor for sokobot? Overall Branching factor: 1.44

$$B_o = 1.440693$$

For tests that are solvable in 15.0 seconds, what is the branching factor? Wins Branching factor: 1.29

$$B_w = 1.291435$$

For tests that failed, what is the branching factor? Fail Branching factor: 1.56

$$B_l = 1.559535$$

5.5.1 Number of Crates

"Increasing the number of crates increases the requirements to find the goal. "

Based on exploring linear regression scores, it may be exponential and linear in nature.

O notation Likelihood in the relationship for Number of crates and Branching factor
Exponential   (r2, rmse): 0.12840344352475708, 0.09377821760111434
Linear        (r2, rmse): 0.12762641693957566, 0.13155513524590956
Logarithmic   (r2, rmse): 0.12158142992477827, 0.5692050120280475

5.5.2 Map Size

"Increasing the map size may possibly increase the amount of valid states to explore."

Interestingly, the linear regression scores indicate that there is no inherent of relationship.

O notation Likelihood in the relationship for Branching factor and Map Size
Exponential   (r2, rmse): -0.03942294325699769, 0.29567537640684316
Linear        (r2, rmse): -0.14869646607435572, 25.00113457527222
Logarithmic   (r2, rmse): 0, 0

Other Observations

The time taken, number of creates, states produced and processed for solutions correlate strongly with each other. When plotted, it produces an interesting scatter plot that depicts the inverse relationship of the number of crates a map has to the states produced, states processed and time taken to solve.

6.1 Recommendations

Implementation of Macro-moves would significantly improve the bot's performance, though it would require for some pre-processing for the sokoban solver to identify rooms and tunnels.

One algorithm for room and tunnel implementation may use a breath first search in a starting valid open space and check adjacent tiles if they are unpassable walls with tunnels recognized having a characteristic of a width or height of 1. Once tunnels are recognized, macro moves identification is possible by traversing the ends of the tunnels.

The Sokoban solver maybe refer to a graph generated from rooms and tunnels and use the macro moves whenever the player change rooms through tunnels. This may be explored further in a future project.

Name		Name	Last commit message	Last commit date
Latest commit History 93 Commits
.idea		.idea
README		README
maps		maps
src		src
tests		tests
visualizer		visualizer
.TODO.txt		.TODO.txt
.gitignore		.gitignore
README.md		README.md
debug.bat		debug.bat
debug.sh		debug.sh
main.py		main.py
run.bat		run.bat
run.sh		run.sh
sokobot2024.iml		sokobot2024.iml

ImaginaryLogs/CSINTSY-sokobot2024

Folders and files

Latest commit

History

Repository files navigation