Day 11 Write-Up

This is one of the Advent of Code 2020 solutions that I'm most proud of. I loved working through this problem, discovering new optimizations, and explaining it as clearly as I can.

Day 11 Write-Up

The prompt for this challenge can be found at Advent of Code Day 11.

Challenge Description

The input for this Advent of Code challenge consists of a grid of states, referring to whether or not a space is a Floor (or .), an Empty seat (or L), or an Occupied seat (or #).

An example grid:

L.LL.LL.LL
LLLLLLL.LL
L.L.L..L..
LLLL.LL.LL
L.LL.LL.LL
L.LLLLL.LL
..L.L.....
LLLLLLLLLL
L.LLLLLL.L
L.LLLLL.LL

Fig. 1 - Example starting grid

The goal is to look for a stable equilibrium of grid state (i.e. the grid stops changing) and report the number of Occupied seats at the equilibrium. (Think of Conway's Game of Life). The three heuristics for seat state changes between iterations are:

An Empty seat becomes Occupied if there are no adjacent Occupied seats.
An Occupied seat becomes Empty if there are four or more Occupied adjacent seats.
Floor never changes.

A seat is considered adjacent to another seat if it is located at one of the eight positions immediately up, down, left, right, or diagonal from the other.

The goal is to keep finding the next state of the grid until the grid stops changing. Then once the grid stops changing, we count how many Occupied seats there are.

Approach

The challenge description mentioned that:

All decisions are based on the number of occupied seats adjacent to a given seat.

so, the brute-force method to deriving the grid's next state would be to iterate through every seat and check if it will change based on the seats adjacent to it, all at run time.

However, there's opportunity for some pre-processing, since a seat's state change is only and directly related to its adjacent seats and not the rest of the grid. Given any 3x3 chunk of states, we can derive the next state of the center seat.

For example, given the following chunk,

#.#
##L
L.#

Fig. 2 - 3x3 chunk

we can derive that the center seat, currently Occupied (#), will become Empty (L) in the next state since there are four Occupied seats adjacent to it.

This means that we can iterate through every seat in a grid, take its adjacent seats, and determine what its next state will be.

What about edge seats?

For seats on the edges, we can add missing adjacent seats as Floor since Floor doesn't affect what the next state of a seat will be.

So a seat on the bottom right corner of the grid can be represented as so: (with the Empty seat at the bottom right corner of the grid in the center of the chunk)

##?    ##.
.L? -> .L.
???    ...

Fig. 3 - Filling in corners

And a seat along the right side of the grid can be represented as:

#.?    #..
.L? -> .L.
##?    ##.

Fig. 4 - Filling in edges

Pre-processing time

Since seats can only be in three states and are completely dependent on their adjacent seats, we can note every 3x3 chunk to see what its center seat will be derived to. In total, if we wanted to derive every possible state chunk, that would be 3⁹, or 19,683 state chunks to derive. (There are 9 spaces with 3 potential states each.) We can serialize (turn into a string) all these state chunks and put them into a mapping from state chunk to its next center seat. Here's what serializing a state chunk could look like:

#.#
##L -> #.###LL.#
L.#

Fig. 5 - Serializing a 3x3 chunk

We're just listing out every seat from left-to-right then top-to-bottom.

If we made a lookup table of these mappings, the entry would look like the middle one, since #.###LL.# would mean the center seat would turn into L:

...
#.###LL.L -> #

#.###LL.# -> L // Nice, we know really quick that the next center seat will be an "L"

#.###LLL. -> #
...

Fig. 6 - Mapping a serialized 3x3 chunk to its next state

So whenever we see a chunk that looks like this...

?????
?#.#?
?##L?
?L.#?
?????

Fig. 7 - Fig. 5 in context

...we know that the center seat will derive to an Empty.

The algorithm for finding the next state of a grid would then require iterating through every seat, finding its adjacent seats, serializing the chunk of seats, and then lookup up the seat in the mapping. Hm... That doesn't seem very much faster than the original brute force approach since we still have to grab all of a seat's adjacent seats. What could we improve?

Let me do you one better. 3¹⁶ = 43,046,721

Rather than deriving all the possible 3x3 state chunks, why not do all the possible 4x4 state chunks? The principle is still the same - if we can split the grid into state chunks of 2x2 states and then get the chunk's surrounding adjacent seats, we can derive the next state of every 2x2 state chunk just by looking it up in a mapping.

For example, given this state chunk,

###.
.L.# (focus on this:) L.
.LL#                  LL
LL.#

Fig. 8 - 4x4 chunk

we can derive that the next 2x2 center will be:

L.
#L

Fig. 9 - Derived state chunk from the center of Fig. 8

We can do this derivation for every possible 4x4 state chunk, and serialize them into a mapping. The entry might look like:

###..L.#.LL#LL.# -> L.#L

Fig. 10 - Mapping a serialized 3x3 chunk to its next state

In total, the number of entries we would have would be 3¹⁶, or 43,046,721 entries; there are 3 possible states for 16 seats. That's a lot!

Let me do you one better?? 3²⁵ = 847,288,609,443...!

Woah, woah, hold up. Computers can have a lot of memory, but not that much memory (as of 2020 at least). In order to load all the 4x4 state chunks (43 million) into memory, it would take approximately 1 GB. However, 3²⁵ or 847 billion, the number of possible 5x5 state chunks, is nearly 20,000x larger than 43 million, and each entry size would be approximately at least 25 + 9 bytes (5x5 chunk and its 3x3 derived chunk) rather than 16 + 4 bytes which would result in about a 1.7x increase in entry size.

Instead, the next places for optimization would probably compressing the mappings (to reduce the 1.7x increase) and/or getting two terabytes of RAM. We could also eliminate all rotations of a state chunk and check if a given state chunk is in the mapping in any of its rotations. For all possible 5x5 state chunks, that would reduce the number of entries by more than 25% of 847 billion (still about 350 GB).

In any case, we'll just stick with the 4x4 state chunks without compression for now.

Calculate all the things

We can use this line of Python to generate our table in the form of a CSV (comma-separated values):

$ python -c 'open("./state_chunk_map.csv", "w").write("\n".join(["{state},{derived_state}".format(state=state, derived_state="".join([(lambda state, pos: "." if state[pos] == "." else ("L" if 0 < len([1 for adj_pos in [pos-5, pos-4, pos-3, pos-1, pos+1, pos+3, pos+4, pos+5] if state[adj_pos] == "#"]) else "#") if state[pos] == "L" else ("L" if 4 <= len([1 for adj_pos in [pos-5, pos-4, pos-3, pos-1, pos+1, pos+3, pos+4, pos+5] if state[adj_pos] == "#"]) else "#"))(state, pos) for pos in [5, 6, 9, 10]])) for state in (lambda f: (lambda x: f(f, x)))(lambda gen_states, iter: [""] if iter == 0 else [state for list_of_states in [[char + next_char for next_char in gen_states(gen_states, iter - 1)] for char in ".L#"] for state in list_of_states])(16)]))'

See this file for the result. (~~Be warned - it might take a lot of data to load~~ GitHub isn't letting me upload this even with git-lfs so... you can generate it if you want.) (Also, a cool Where's Waldo type of thing: Anonymous recursion is pretty cool.)

This will calculate all 43 million 4x4 state chunks, and their derived 2x2 state chunks, keeping in mind all the heuristics of the state changes. Locally, it takes me about 5.5 minutes to run, since it's generating about 1 GB worth of mappings and writing them to disk.

The top of our CSV will end up looking like this:

................,....
...............L,....
...............#,....
..............L.,....
..............LL,....
..............L#,....
..............#.,....
..............#L,....
..............##,....
.............L..,....

...

Fig. 11 - `cat state_chunk_map.csv | head -10`

This is denoting the serialized state chunk to its serialized derived state chunks (separated by the comma). We're starting by generating all the state chunks that look like,

.... .... .... .... .... .... .... .... .... ....
.... .... .... .... .... .... .... .... .... ....
.... .... .... .... .... .... .... .... .... ....
.... ...L ...# ..L. ..LL ..L# ..#. ..#L ..## .L.. ...

Fig. 12 - State chunks represented in Fig. 11

and for these first few entries, they all map to .... since the center of the state chunks are all Floor and Floor does not change.

Later on, we get entries that look more like this, which will probably be used a lot more:

...

#.L#L#..L###..LL,#.##
#.L#L#..L###..L#,#.#L
#.L#L#..L###..#.,#.#L
#.L#L#..L###..#L,#.#L
                                               #.L#
#.L#L#..L###..##,#.#L <- this line represents: L#.. -> #.
                                               L###    #L
#.L#L#..L###.L..,#.##                          ..##
#.L#L#..L###.L.L,#.##
#.L#L#..L###.L.#,#.#L
#.L#L#..L###.LL.,#.##

...

Fig. 13 - Serialized 4x4 chunk to its representation and derived state chunk

The file ends up being about 1 GB, but now our program will be a matter of doing some mapping lookups rather than having to calculate every seat's adjacent seat states. Much faster! (One in-memory lookup for four seats rather than 9 lookups per seat, or 36 lookups for the equivalent four seats.)

Steps

Here is the algorithm we'll be following:

(As a pre-processing step) Parse and load the mapping into a lookup table.
Iterate through every 2x2 state chunk and serialize it its adjacent neighbors and any additional Floor to get a 4x4 state chunk.
Look up the serialized state chunk in the look up table and write the result to a new grid.
Check to see if the new grid is the same as the old grid.
1. If not, repeat from step 1
Count how many Occupied seats there are.

And that's that! Oh, the joys of pre-processing. I love the idea of pre-processing because we're doing all the work beforehand. It's like driving a car with a 10 gallon tank as opposed to the same car with a 2 gallon tank. Sure, the fill-up time takes longer, but once we're on our way, there's no stopping us!

Next Steps

We can introduce some parallelism into this now. Since each 4x4 state chunk has enough information to calculate its derived 2x2 state chunk, we can start to delegate batches of calculations to threads that can run in parallel to each other.

For example, we can split out the top half of the grid to one thread and the bottom half to another. Or maybe have each row of state chunks split into its own thread. Then once each of the threads is complete, they circle back to the main thread and update the new grid with the derived 2x2 state chunks. This might not see a performance benefit with a grid of 90 x 92 like we have in the input, but we'll see a massive performance gain in grids with thousands of rows and columns.

Downsides to this approach

This approach worked really well because the heuristics of state changes were very localized to the seat that was changing. That is, we could grab a chunk of states and that was all we needed to find the next state. However, part two of the problem (which I won't be doing but instead crying over) includes a heuristic that looks beyond the immediately adjacent seats further into the row or column of the seat.

There are plenty of ways to preprocess for this step as well though, for example, putting an additional state Unknown (denoted as ?) if the state of a center piece can't be immediately determined to mark it for further derivation.

Result

Implementation Setup Notes

I ended up parsing the initial layout and then putting it into a grid where it would be padded top and left with a row and column of Empty and right and bottom with as many rows and columns of Empty as necessary to reach a multiple of four in grid row length and column length.

Then I followed the algorithm above. Much of the writing above is framed as parsing non-overlapping 2x2 state chunks, but I approached the implementation thinking about 4x4 overlapping chunks. I think it definitely helped with writing the implementation and slicing the 2D array cake.

Output

loading state chunk look up map into memory from: ./day11/state_chunk_map.csv
  expecting 43046721 entries...
  loaded 10 million entries
  loaded 20 million entries
  loaded 30 million entries
  loaded 40 million entries
  loaded 43046721 entries in 13.423943286s.
loading layout with 90 x 92 into 92 x 96 grid
  done.
iterating through grid states
  found 2126 occupied seats after 116 state changes in 76.049754ms.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
day11		day11
README.md		README.md
day1to9.go		day1to9.go
main.go		main.go

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

day11

day11

README.md

README.md

day1to9.go

day1to9.go

main.go

main.go

Repository files navigation

Day 11 Write-Up

Challenge Description

Approach

What about edge seats?

Pre-processing time

Let me do you one better. 3¹⁶ = 43,046,721

Let me do you one better?? 3²⁵ = 847,288,609,443...!

Calculate all the things

Steps

Next Steps

Downsides to this approach

Result

Implementation Setup Notes

Output

About

Releases

Packages

Languages

anwyho/aoc2020

Folders and files

Latest commit

History

Repository files navigation

Day 11 Write-Up

Challenge Description

Approach

What about edge seats?

Pre-processing time

Let me do you one better. 316 = 43,046,721

Let me do you one better?? 325 = 847,288,609,443...!

Calculate all the things

Steps

Next Steps

Downsides to this approach

Result

Implementation Setup Notes

Output

About

Topics

Resources

Stars

Watchers

Forks

Languages

Let me do you one better. 3¹⁶ = 43,046,721

Let me do you one better?? 3²⁵ = 847,288,609,443...!