# --- Day 1: Historian Hysteria ---

The Chief Historian is always present for the big Christmas sleigh launch, but nobody has seen him in months! Last anyone heard, he was visiting locations that are historically significant to the North Pole; a group of Senior Historians has asked you to accompany them as they check the places they think he was most likely to visit.

As each location is checked, they will mark it on their list with a star. They figure the Chief Historian must be in one of the first fifty places they'll look, so in order to save Christmas, you need to help them get fifty stars on their list before Santa takes off on December 25th.

Collect stars by solving puzzles. Two puzzles will be made available on each day in the Advent calendar; the second puzzle is unlocked when you complete the first. Each puzzle grants one star. Good luck!

You haven't even left yet and the group of Elvish Senior Historians has already hit a problem: their list of locations to check is currently empty. Eventually, someone decides that the best place to check first would be the Chief Historian's office.

Upon pouring into the office, everyone confirms that the Chief Historian is indeed nowhere to be found. Instead, the Elves discover an assortment of notes and lists of historically significant locations! This seems to be the planning the Chief Historian was doing before he left. Perhaps these notes can be used to determine which locations to search?

Throughout the Chief's office, the historically significant locations are listed not by name but by a unique number called the location ID. To make sure they don't miss anything, The Historians split into two groups, each searching the office and trying to create their own complete list of location IDs.

There's just one problem: by holding the two lists up side by side (your puzzle input), it quickly becomes clear that the lists aren't very similar. Maybe you can help The Historians reconcile their lists?

For example:

3  -- 4

4  -- 3

2  -- 5

1  -- 3

3  -- 9

3  -- 3

Maybe the lists are only off by a small amount! To find out, pair up the numbers and measure how far apart they are. Pair up the smallest number in the left list with the smallest number in the right list, then the second-smallest left number with the second-smallest right number, and so on.

Within each pair, figure out how far apart the two numbers are; you'll need to add up all of those distances. For example, if you pair up a 3 from the left list with a 7 from the right list, the distance apart is 4; if you pair up a 9 with a 3, the distance apart is 6.

In the example list above, the pairs and distances would be as follows:

The smallest number in the left list is 1, and the smallest number in the right list is 3. The distance between them is 2.
The second-smallest number in the left list is 2, and the second-smallest number in the right list is another 3. The distance between them is 1.
The third-smallest number in both lists is 3, so the distance between them is 0.
The next numbers to pair up are 3 and 4, a distance of 1.
The fifth-smallest numbers in each list are 3 and 5, a distance of 2.
Finally, the largest number in the left list is 4, while the largest number in the right list is 9; these are a distance 5 apart.
To find the total distance between the left list and the right list, add up the distances between all of the pairs you found. In the example above, this is 2 + 1 + 0 + 1 + 2 + 5, a total distance of 11!

Your actual left and right lists contain many location IDs. What is the total distance between your lists?

# Puzzle Attempt

To solve this puzzle I we will try to create two separate lists in Python code. List are easy to sort and iterate over. The trickier part will be gathering the lists in the first place.

We are given the lists in one big body of text. So, my strategy for handling this will be to use a text file. Instead of copying and pasting the entire body in this code, I will paste it into a plain text file. I will then read the file using code and split up the lists into the 'left' list and the 'right' list.

I have saved the text file as puzzle_input_day_1.txt in the local directory.

In [1]:
# This is just to help display data.

import pprint

# This is how we can open a file and read what is in it.

with open('puzzle_input_day_1.txt') as file:
    puzzle_input = file.read().splitlines()

In [2]:
# Using readlines() gives up a list with each element being
# made up of each line in the text document.
# This displays the rirst 10 entries in the list.

pprint.pprint(puzzle_input[:10])

['15244   50562',
 '81245   49036',
 '92897   21393',
 '63271   60643',
 '49672   33212',
 '92232   76048',
 '53492   58600',
 '92941   61161',
 '58509   86979',
 '28174   73806']


In [None]:
# There are probably multiple ways to gather the data from this list
# I am going to intialize some empty list to represent our
# 'left' list and 'right' list. Then, I will use a loop to iterate through the list
# As I work through the list, I will simply split the lines up and add them to the new lists.

left_list = []
right_list = []

for row in puzzle_input:
    left_list.append(row[:5])
    right_list.append(row[-5:])

In [4]:
# Let's check the 'left' list.

pprint.pprint(left_list[:10])

['15244',
 '81245',
 '92897',
 '63271',
 '49672',
 '92232',
 '53492',
 '92941',
 '58509',
 '28174']


In [5]:
# Let's check the 'right' list.

pprint.pprint(right_list[:10])

['50562',
 '49036',
 '21393',
 '60643',
 '33212',
 '76048',
 '58600',
 '61161',
 '86979',
 '73806']


## Moving On!

We now have what we need to continue. We can start to actually implement our plan of sorting the lists and performing calculations on them. The first step is simple. Python comes with a method that will help us sort the values in our lists from the smallest to the largest. The next step will be to work our way through the lists with a loop. Each step of the loop we will calculate the difference between each number. We can save that to a new list. Finally, after we have finished making our new list we can just use another built in Python function to add them all together.

In [9]:
# Let's go ahead and sort both lists.

left_list.sort()
right_list.sort()

# We can print out the first few just to verify that the order has changed and that they are sorted.

pprint.pprint(left_list[:10])
print()
pprint.pprint(right_list[:10])

['10038',
 '10073',
 '10109',
 '10175',
 '10346',
 '10452',
 '10612',
 '10667',
 '10713',
 '10728']

['10033',
 '10076',
 '10095',
 '10100',
 '10108',
 '10157',
 '10161',
 '10257',
 '10573',
 '11213']


In [11]:
# The next part is simple. We just need to remember to use the absolute value function provided by Python
# to make sure we return positive numbers only.

list_of_distances = []

for index in range(len(left_list)):
    distance_value = abs(int(left_list[index]) - int(right_list[index]))
    list_of_distances.append(distance_value)

In [12]:
pprint.pprint(list_of_distances[:10])

[5, 3, 14, 75, 238, 295, 451, 410, 140, 485]


In [13]:
# Now we just use the sum() function to get the sum of all the values in the list of distances

sum_of_distances = sum(list_of_distances)
print(f"The sum of all the distances is: {sum_of_distances}")

The sum of all the distances is: 1660292


### All together

This is what the code would look like all together and without print statements or comments.

In [14]:
with open('puzzle_input_day_1.txt') as file:
    puzzle_input = file.read().splitlines()

left_list = []
right_list = []

for row in puzzle_input:
    left_list.append(row[:5])
    right_list.append(row[-5:])

left_list.sort()
right_list.sort()

list_of_distances = []

for index in range(len(left_list)):
    distance_value = abs(int(left_list[index]) - int(right_list[index]))
    list_of_distances.append(distance_value)

sum_of_distances = sum(list_of_distances)
print(f"The sum of all the distances is: {sum_of_distances}")

The sum of all the distances is: 1660292


# --- Part Two ---
Your analysis only confirmed what everyone feared: the two lists of location IDs are indeed very different.

Or are they?

The Historians can't agree on which group made the mistakes or how to read most of the Chief's handwriting, but in the commotion you notice an interesting detail: a lot of location IDs appear in both lists! Maybe the other numbers aren't location IDs at all but rather misinterpreted handwriting.

This time, you'll need to figure out exactly how often each number from the left list appears in the right list. Calculate a total similarity score by adding up each number in the left list after multiplying it by the number of times that number appears in the right list.

Here are the same example lists again:

3  -- 4

4  -- 3

2  -- 5

1  -- 3

3  -- 9

3  -- 3

For these example lists, here is the process of finding the similarity score:

The first number in the left list is 3. It appears in the right list three times, so the similarity score increases by 3 * 3 = 9.
The second number in the left list is 4. It appears in the right list once, so the similarity score increases by 4 * 1 = 4.
The third number in the left list is 2. It does not appear in the right list, so the similarity score does not increase (2 * 0 = 0).
The fourth number, 1, also does not appear in the right list.
The fifth number, 3, appears in the right list three times; the similarity score increases by 9.
The last number, 3, appears in the right list three times; the similarity score again increases by 9.
So, for these example lists, the similarity score at the end of this process is 31 (9 + 4 + 0 + 0 + 9 + 9).

Once again consider your left and right lists. What is their similarity score?

## Part Two Attempt

My initial plan here is to 'brute force' it. I am thinking of just going one by one in the 'left' list, setting up a counter for each number, and checking it against every single number in the right list. I think this might be inefficient but I am unsure of a better way for this in the moment. After I solve this on my own I may ask Chat GPT for a solution and see if we get something more efficient.

For now, the first few steps will be the same. We will take in the text file as input and split it up into the two separate list. This time, there is no need to sort the list.

In [15]:
with open('puzzle_input_day_1.txt') as file:
    puzzle_input = file.read().splitlines()

left_list = []
right_list = []

for row in puzzle_input:
    left_list.append(row[:5])
    right_list.append(row[-5:])

In [None]:
# Let's initialize an empty list for our similarity scores

list_of_similarity_scores = []

# Next, we will set up a loop to check each number in the 'left'
# list to each one in the 'right' list
# We use a variable that we call 'counter' to keep track
# of how many times the number shows up.

for left_value in left_list:
    counter = 0
    for right_value in right_list:
        if left_value == right_value:
            counter += 1
    similarity_score = counter * int(left_value)
    if similarity_score != 0:
        list_of_similarity_scores.append(similarity_score)

In [29]:
sum_of_similarity_scores = sum(list_of_similarity_scores)
print(f"The similarity score for the puzzle is: {sum_of_similarity_scores}")

The similarity score for the puzzle is: 22776016
