# Part 1

--- Day 2: Gift Shop ---

You get inside and take the elevator to its only other stop: the gift shop. "Thank you for visiting the North Pole!" gleefully exclaims a nearby sign. You aren't sure who is even allowed to visit the North Pole, but you know you can access the lobby through here, and from there you can access the rest of the North Pole base.

As you make your way through the surprisingly extensive selection, one of the clerks recognizes you and asks for your help.

As it turns out, one of the younger Elves was playing on a gift shop computer and managed to add a whole bunch of invalid product IDs to their gift shop database! Surely, it would be no trouble for you to identify the invalid product IDs for them, right?

They've even checked most of the product ID ranges already; they only have a few product ID ranges (your puzzle input) that you'll need to check. For example:

11-22,95-115,998-1012,1188511880-1188511890,222220-222224,
1698522-1698528,446443-446449,38593856-38593862,565653-565659,
824824821-824824827,2121212118-2121212124

(The ID ranges are wrapped here for legibility; in your input, they appear on a single long line.)

The ranges are separated by commas (,); each range gives its first ID and last ID separated by a dash (-).

Since the young Elf was just doing silly patterns, you can find the invalid IDs by looking for any ID which is made only of some sequence of digits repeated twice. So, 55 (5 twice), 6464 (64 twice), and 123123 (123 twice) would all be invalid IDs.

None of the numbers have leading zeroes; 0101 isn't an ID at all. (101 is a valid ID that you would ignore.)

Your job is to find all of the invalid IDs that appear in the given ranges. In the above example:

    11-22 has two invalid IDs, 11 and 22.
    95-115 has one invalid ID, 99.
    998-1012 has one invalid ID, 1010.
    1188511880-1188511890 has one invalid ID, 1188511885.
    222220-222224 has one invalid ID, 222222.
    1698522-1698528 contains no invalid IDs.
    446443-446449 has one invalid ID, 446446.
    38593856-38593862 has one invalid ID, 38593859.
    The rest of the ranges contain no invalid IDs.

Adding up all the invalid IDs in this example produces 1227775554.

What do you get if you add up all of the invalid IDs?


GOAL: find the invalid IDs, which have a sequence of digts repeated twice, and they have to make up the entire number

INPUT: number ranges seperated by commas in a separate text file

STRATEGY: We can make a function that finds the repeated digits pretty easily.
Then we simply scan through the input, converting the ranges to Python ranges with another function, and store the invalid IDs in a list.
Finally we sum the list.

## Part 1 Solution

In [1]:
def check_id(id:str):
    #returns true if ID is invalid
    
    assert id.isdigit()
    n = len(id)

    #This check works even with an odd number of digits
    if id[:n//2]==id[n//2:]:
        return True
    return False

def range_list(strrange:str):
    #converts a range as given above into a list of numeric strings. Note that the ranges above include both ends
    rlist = strrange.split('-')
    return list(range(int(rlist[0]), int(rlist[1])+1))

with open("D2 input.txt") as f:
    #the file contains only one line, so can convert it straightaway into a list
    ranges = list(f.read().split(','))

#now we can use the above funcitons to construct the list of invalid ids
#make a confusing one-liner for fun
invalids = [number for id_range in ranges for number in range_list(id_range) if check_id(str(number))]

Check:

In [2]:
invalids

[246246,
 247247,
 248248,
 249249,
 250250,
 251251,
 252252,
 253253,
 254254,
 255255,
 256256,
 257257,
 258258,
 259259,
 260260,
 261261,
 262262,
 263263,
 264264,
 265265,
 266266,
 267267,
 268268,
 269269,
 270270,
 271271,
 272272,
 273273,
 274274,
 275275,
 276276,
 277277,
 278278,
 279279,
 280280,
 281281,
 282282,
 283283,
 284284,
 285285,
 798798,
 799799,
 800800,
 801801,
 802802,
 803803,
 804804,
 805805,
 806806,
 807807,
 808808,
 809809,
 810810,
 811811,
 812812,
 813813,
 814814,
 815815,
 816816,
 817817,
 818818,
 819819,
 820820,
 821821,
 822822,
 823823,
 824824,
 825825,
 826826,
 827827,
 828828,
 829829,
 830830,
 831831,
 832832,
 833833,
 834834,
 835835,
 836836,
 837837,
 838838,
 839839,
 840840,
 841841,
 842842,
 843843,
 844844,
 845845,
 846846,
 847847,
 848848,
 849849,
 850850,
 851851,
 852852,
 853853,
 854854,
 855855,
 856856,
 857857,
 858858,
 859859,
 860860,
 861861,
 862862,
 863863,
 864864,
 865865,
 866866,
 867867,
 868868,
 

In [3]:
print(sum(invalids))

54234399924


# Part 2

--- Part Two ---

The clerk quickly discovers that there are still invalid IDs in the ranges in your list. Maybe the young Elf was doing other silly patterns as well?

Now, an ID is invalid if it is made only of some sequence of digits repeated at least twice. So, 12341234 (1234 two times), 123123123 (123 three times), 1212121212 (12 five times), and 1111111 (1 seven times) are all invalid IDs.

From the same example as before:

    11-22 still has two invalid IDs, 11 and 22.
    95-115 now has two invalid IDs, 99 and 111.
    998-1012 now has two invalid IDs, 999 and 1010.
    1188511880-1188511890 still has one invalid ID, 1188511885.
    222220-222224 still has one invalid ID, 222222.
    1698522-1698528 still contains no invalid IDs.
    446443-446449 still has one invalid ID, 446446.
    38593856-38593862 still has one invalid ID, 38593859.
    565653-565659 now has one invalid ID, 565656.
    824824821-824824827 now has one invalid ID, 824824824.
    2121212118-2121212124 now has one invalid ID, 2121212121.

Adding up all the invalid IDs in this example produces 4174379265.

What do you get if you add up all of the invalid IDs using these new rules?


STRATEGY: We need to make a new check id function that checks all of the possible repititions.
- We have to find the factors of the number length to find all of the potential repititions. Make another function for this Could probably brute force it, since each id has a limited length. 
- Only need the prime number factors
- once we know the prime factors of the length, can easily check all of the invalids

In [46]:
def factorize(n:int):
    #find all of the factors of a number, except for the trivial factors 1 and n
    factors=[]

    #no factor will be greater than half of the number, since 2 is the lowest nontrivial factor
    for i in range(2,n//2+1): #skip zero and 1. 
        if (n//i)*i==n:
            factors.append(i)
    return factors

def check_id_factors(id:str):
    #returns true if ID is invalid, checking the id for every possible repeated sequence
    #assuming that an invalid id has to be entirely made of the repeats. 
    
    assert id.isdigit()
    n = len(id)
    if n==1:# we need at least two repititions
        return False
    #we do need to include 1 in the potential factors to get invalid ids like '11'
    factors = factorize(n)
    factors.append(1)

    for factor in factors:
        if id[:factor]*(n//factor)==id:
            return True
    return False

#reuse rangelist from above
with open("D2 input.txt") as f:
    #the file contains only one line, so can convert it straightaway into a list
    ranges = list(f.read().split(','))

#now we can use the above funcitons to construct the list of invalid ids
#make a confusing one-liner for fun
invalids = [number for id_range in ranges for number in range_list(id_range) if check_id_factors(str(number))]

In [47]:
print(invalids)

[246246, 247247, 248248, 249249, 250250, 251251, 252252, 252525, 253253, 254254, 255255, 256256, 257257, 258258, 259259, 260260, 261261, 262262, 262626, 263263, 264264, 265265, 266266, 267267, 268268, 269269, 270270, 271271, 272272, 272727, 273273, 274274, 275275, 276276, 277277, 278278, 279279, 280280, 281281, 282282, 282828, 283283, 284284, 285285, 797979, 798798, 799799, 800800, 801801, 802802, 803803, 804804, 805805, 806806, 807807, 808080, 808808, 809809, 810810, 811811, 812812, 813813, 814814, 815815, 816816, 817817, 818181, 818818, 819819, 820820, 821821, 822822, 823823, 824824, 825825, 826826, 827827, 828282, 828828, 829829, 830830, 831831, 832832, 833833, 834834, 835835, 836836, 837837, 838383, 838838, 839839, 840840, 841841, 842842, 843843, 844844, 845845, 846846, 847847, 848484, 848848, 849849, 850850, 851851, 852852, 853853, 854854, 855855, 856856, 857857, 858585, 858858, 859859, 860860, 861861, 862862, 863863, 864864, 865865, 866866, 867867, 868686, 868868, 869869, 870870,

In [33]:
t = factorize(3)
t.append(1)

In [48]:
print(sum(invalids))

70187097315


In [None]:
70187097315