# Advent of Code 2025


# Puzzle - part 1

**--- Day 2: Gift Shop ---**

You get inside and take the elevator to its only other stop: the gift shop.\
"Thank you for visiting the North Pole!" gleefully exclaims a nearby sign.
You aren't sure who is even allowed to visit the North Pole,\
but you know you can access the lobby through here,\
and from there you can access the rest of the North Pole base.

As you make your way through the surprisingly extensive selection,\
one of the clerks recognizes you and asks for your help.

As it turns out, one of the younger Elves was playing on a gift shop computer\
and managed to add a whole bunch of invalid product IDs to their gift shop database!\
Surely, it would be no trouble for you to identify the invalid product IDs for them, right?

They've even checked most of the product ID ranges already;\
they only have a few product ID ranges (your puzzle input) that you'll need to check.\
For example:
```
11-22,95-115,998-1012,1188511880-1188511890,222220-222224,
1698522-1698528,446443-446449,38593856-38593862,565653-565659,
824824821-824824827,2121212118-2121212124
```

(The ID ranges are wrapped here for legibility; in your input, they appear on a single long line.)

The ranges are separated by commas (`,`); each range gives its **first ID** and **last ID** separated by a dash (`-`).

Since the young Elf was just doing silly patterns,\
you can find the **invalid IDs** by looking for any ID which is made only of some sequence of digits repeated twice.\
So, `55` (`5` twice), `6464` (`64` twice), and `123123` (`123` twice) would all be invalid IDs.

None of the numbers have leading zeroes; `0101` isn't an ID at all. (`101` is a **valid** ID that you would ignore.)

Your job is to find all of the invalid IDs that appear in the given ranges. In the above example:

- `11-22` has two invalid IDs, `11` and `22`.
`95-115` has one invalid ID, `99`.
- `998-1012` has one invalid ID, `1010`.
- `1188511880-1188511890` has one invalid ID, `1188511885`.
- `222220-222224` has one invalid ID, `222222`.
- `1698522-1698528` contains no invalid IDs.
- `446443-446449` has one invalid ID, `446446`.
- `38593856-38593862` has one invalid ID, `38593859`.
- The rest of the ranges contain no invalid IDs.

Adding up all the invalid IDs in this example produces `1227775554`.

**What do you get if you add up all of the invalid IDs?**

## Input

In [68]:
# Load the input file

with open('input - Day 2.txt', 'r') as file:
    file_input = file.read()


print(file_input[:30])

4077-5314,527473787-527596071,


## Input Formatting

Right now we just have a text file but what we want is a list.\
We will separate the text by the `,` character to extract each ID ranges.

In [69]:
from pprint import pprint

input_text = file_input.split(',')
pprint(input_text[:10])

print(f"\nThere are {len(input_text)} ID ranges")

['4077-5314',
 '527473787-527596071',
 '709-872',
 '2487-3128',
 '6522872-6618473',
 '69137-81535',
 '7276-8396',
 '93812865-93928569',
 '283900-352379',
 '72-83']

There are 38 ID ranges


In [70]:
print(f"Last line: {input_text[-1]}")
# Check the last line, if it's empty then let's remove it
if input_text[-1] == "":
    del input_text[-1]

print(len(input_text))

Last line: 53-71
38


Since we are dealing with ranges let's split them up into lower and upper bounds.\
As per the examples, the bounds are inclusive.

In [71]:
import re

id_ranges = [] # (int(lower_bound), int(upper_bound))

for id_range in input_text:
    id_range = re.split(r"-", id_range)

    lower_bound = id_range[0]
    upper_bound = id_range[1]

    id_ranges.append((int(lower_bound), int(upper_bound)))

pprint(id_ranges[:10])

[(4077, 5314),
 (527473787, 527596071),
 (709, 872),
 (2487, 3128),
 (6522872, 6618473),
 (69137, 81535),
 (7276, 8396),
 (93812865, 93928569),
 (283900, 352379),
 (72, 83)]


## Solution

First things first, we will definitely need every integer in the range.\
In other words we need to change ranges to lists.

In [72]:
id_list = [] # unstructured

for lower_bound, upper_bound in id_ranges:

    ids_int = range(lower_bound, upper_bound + 1)
    id_list.extend(ids_int)

print(f"There are {len(id_list)} IDs")
pprint(id_list[:10])
pprint(id_list[10_000:10_005])

There are 2489768 IDs
[4077, 4078, 4079, 4080, 4081, 4082, 4083, 4084, 4085, 4086]
[527482549, 527482550, 527482551, 527482552, 527482553]


We need to split up ID numbers into equal sections.\
Then for each ID we need to check if the sections all match.\
For now we only need to worry about two sections.

For example a number like this `123123123123` can be split into:\
`123123` `123123` and `123` `123` `123` `123` and `12` `31` `23` `12` `31` `23`.\
Where the first two cases are what gives us our invalid detection for this ID.\
But in this puzzle they say "some sequence of digits repeated twice",\
se we only care about `123123` `123123`

Note that an ID with an odd number of digits, like this `11011`, cannot be split evenly.\
As such all IDs with odd length are always valid by default.


In [None]:
from tqdm.notebook import tqdm

valid_ids = []
invalid_ids = []

for id_int in tqdm(id_list, desc="Processing IDs"):

    # Let's convert it into a string so we can work with it
    id_str = str(id_int)

    # IDs with an odd number of digits are valid by default
    id_len = len(id_str)
    if id_len % 2 != 0:
        valid_ids.append(id_int)
        continue

    # Split in half
    first_half  = id_str[:id_len//2]
    second_half = id_str[id_len//2:]

    if first_half == second_half:
        invalid_ids.append(id_int)
        continue

    # If the two halves don't match up it's a valid ID
    valid_ids.append(id_int)

print(f"There are {len(invalid_ids)} invalid IDs")

Processing IDs:   0%|          | 0/2489768 [00:00<?, ?it/s]

In [56]:
print(f"The total sum of all invalid IDs is {sum(invalid_ids)}")

The total sum of all invalid IDs is 13108371860


# Puzzle - part 2

**--- Part Two ---**

The clerk quickly discovers that there are still invalid IDs in the ranges in your list.\
Maybe the young Elf was doing other silly patterns as well?

Now, an ID is invalid if it is made only of some sequence of digits repeated **at least** twice.\
So, `12341234` (`1234` two times), `123123123` (`123` three times), `1212121212` (`12` five times), and `1111111` (`1` seven times) are all invalid IDs.

From the same example as before:

- `11-22` still has two invalid IDs, `11` and `22`.
- `95-115` now has two invalid IDs, `99` and `111`.
- `998-1012` now has two invalid IDs, `999` and `1010`.
- `1188511880`-1188511890 still has one invalid ID, `1188511885`.
- `222220-222224` still has one invalid ID, `222222`.
- `1698522-1698528` still contains no invalid IDs.
- `446443-446449` still has one invalid ID, `446446`.
- `38593856-38593862` still has one invalid ID, `38593859`.
- `565653-565659` now has one invalid ID, `565656`.
- `824824821-824824827` now has one invalid ID, `824824824`.
- `2121212118-2121212124` now has one invalid ID, `2121212121`.

Adding up all the invalid IDs in this example produces `4174379265`.

**What do you get if you add up all of the invalid IDs using these new rules?**


## Solution

Okay we can immediately see that our assumption about IDs with the odd number of digits doesn't hold anymore.

My first instinct was to handle IDs with even and odd digits separately but actually there is no need for that.\
All we need is to define a simple sliding window that starts at `1` and goes up to half the digit length of the ID.

For each window size we check if it divides the number of digits we have in our ID,\
there is no point in having a window of size `7` if the total number of digits is `15`.

For each sliding window we get subsection, for a number like `123123` this we will get the following:
- window `1`: `1` `2` `3` `1` `2` `3`
- window `2`: `12` `31` `23`
- window `3`: `123` `123`

all we need to do is  check if all subsection are the same.

Note that this also holds for odd number of digits like so `11111` or `123123123`.

In [57]:
import textwrap

valid_ids = []
invalid_ids = []

for id_int in tqdm(id_list, desc="Processing IDs"):

    # Let's convert it into a string so we can work with it
    id_str = str(id_int)
    id_len = len(id_str)

    # We do +1 because range's upper bound is not inclusive
    # so range(1, 12//2) or simply range(1, 6) will give us [1, 2, 3, 4, 5)
    # It doesn't matter for odd lengths because we check divisibility
    for window_size in range(1, (id_len//2)+1):

        # Check if divisible
        if id_len % window_size != 0:
            continue

        # Handy function for splitting strings into even sections
        sections = textwrap.wrap(id_str, window_size)

        # Note clever use of set()
        # set("123", "123", "123") == set("123")
        # since sets can't have duplicates
        if len(set(sections)) == 1:
            invalid_ids.append(id_int)
            break


print(f"There are {len(invalid_ids)} invalid IDs")

Processing IDs:   0%|          | 0/2489768 [00:00<?, ?it/s]

There are 780 invalid IDs


In [58]:
print(f"The total sum of all invalid IDs is {sum(invalid_ids)}")

The total sum of all invalid IDs is 22471660255
