# Day 3: Binary Diagnostic

(From [Advent of Code 2021, day 3](https://adventofcode.com/2021/day/3))

## Part 1

In [1]:
# Libraries used:
import pandas as pd
import numpy as np

The submarine has been making some odd creaking noises, so you ask it to produce a diagnostic report just in case.

The diagnostic report (your puzzle input) consists of a list of binary numbers which, when decoded properly, can tell you many useful things about the conditions of the submarine. The first parameter to check is the power consumption.

You need to use the binary numbers in that list to generate two new binary numbers (called the **gamma rate** and the **epsilon rate**).

Each bit in the gamma rate can be determined by finding the most common bit in the corresponding position of all numbers in the diagnostic report. For example, given the following diagnostic report:

```txt
00100
11110
10110
10111
10101
01111
00111
11100
10000
11001
00010
01010
```

Considering only the first bit of each number, there are five `0` bits and seven `1` bits. Since the most common bit is `1`, the first bit of the gamma rate is `1`.

The most common second bit of the numbers in the diagnostic report is `0`, so the second bit of the gamma rate is `0`.

The most common value of the third, fourth, and fifth bits are `1`, `1`, and `0`, respectively, and so the final three bits of the gamma rate are `110`.

So, the gamma rate is the binary number `10110`, or `22` **in decimal**.

The epsilon rate is calculated in a similar way; rather than use the most common bit, the least common bit from each position is used. So, the epsilon rate is `01001`, or `9` **in decimal**. Multiplying the gamma rate (`22`) by the epsilon rate (`9`) gives `198`.

Use the binary numbers in your input list to calculate the gamma rate and epsilon rate, then multiply them together. *What is the power consumption of the submarine?* (Be sure to represent your answer in decimal, not binary.)

### Solution

In [2]:
# Read in input data
# (Set this to the correct path for you!)
## Make sure to read in data as string
## otherwise it will be read as int
## with leading zeros removed
data = pd.read_csv('input.txt', header=None, names=['binary_input'], dtype=str)

In [3]:
# Split digits from each input data and store as arrays
data['array'] = data['binary_input'].apply(lambda x: np.array([int(i) for i in x])).values

# Take a peak
data.head()

Unnamed: 0,binary_input,array
0,111110110111,"[1, 1, 1, 1, 1, 0, 1, 1, 0, 1, 1, 1]"
1,100111000111,"[1, 0, 0, 1, 1, 1, 0, 0, 0, 1, 1, 1]"
2,11101111101,"[0, 1, 1, 1, 0, 1, 1, 1, 1, 1, 0, 1]"
3,11011010010,"[0, 1, 1, 0, 1, 1, 0, 1, 0, 0, 1, 0]"
4,1010001010,"[0, 0, 1, 0, 1, 0, 0, 0, 1, 0, 1, 0]"


In [4]:
# Separate each digit into a separate column
## Save parsed data into a new dataframe
parsed_data = pd.DataFrame(data['array'].to_list(), dtype=int)

# Take a peak
parsed_data.head()

Unnamed: 0,0,1,2,3,4,5,6,7,8,9,10,11
0,1,1,1,1,1,0,1,1,0,1,1,1
1,1,0,0,1,1,1,0,0,0,1,1,1
2,0,1,1,1,0,1,1,1,1,1,0,1
3,0,1,1,0,1,1,0,1,0,0,1,0
4,0,0,1,0,1,0,0,0,1,0,1,0


In [5]:
# Gamma rate calculation
def get_gamma_rate(parsed_data):
    gamma_rate_code = []
    for col in list(parsed_data.columns):
        # Get the MOST common value (a.k.a. mode) for each column
        gamma_rate_code.append(str(np.bincount(parsed_data[col].values).argmax()))

    # Combine digits into one binary number 
    gamma_rate_code = (str.join('', gamma_rate_code))

    # Convert binary number to decimal
    ## We can use `int` and set the base to 2 for binary:
    gamma_rate = int(gamma_rate_code, 2)
    return gamma_rate

gamma_rate = get_gamma_rate(parsed_data)

In [6]:
# Epsilon rate calculation
def get_epsilon_rate(parsed_data):
    epsilon_rate_code = []
    for col in list(parsed_data.columns):
        # Get the LEAST common value for each column
        epsilon_rate_code.append(str(np.bincount(parsed_data[col].values).argmin()))

    # Combine digits into one binary number 
    epsilon_rate_code = (str.join('', epsilon_rate_code))

    # Convert binary number to decimal
    ## As previously mentioned, use `int` and set the base to 2 for binary:
    epsilon_rate = int(epsilon_rate_code, 2)
    return epsilon_rate

epsilon_rate = get_epsilon_rate(parsed_data)

In [7]:
gamma_rate * epsilon_rate

3969000

*Answer*: 3969000