## Problem

After submitting a satisfactory program that can calculate the probabilities as mentioned in the question, Dr. Michael tells you about the original task that he had in his mind:

Dr. Michael with his advanced genotype detector, has detected the genotypes of a few of the colorful rabbits for a locus that he presumes determines the colour of the rabbits. Using this knowledge he has planned out a breeding experiment with several pedigree trees depicting the mating lines.

The experiment is as follows. First, $2^N$ individuals with known genotypes are grouped into $2^{N-1}$ groups of 2. Each group has one progeny. This is repeated. Thus every generation, the number of individuals reduce by a factor of 2. At the end of N generations, one individual remains.

Dr. Micheal wants you to write a program which is able to predict the probability of the last individual of this epic lineage to be `CC` `cc` and `cC/Cc`.

### Input
Input will be $N$ and a list of $2^N$ initial individual genotypes. Groups are to be made with consecutive members.
- N = 2
- `cc` `cc` `cc` `cc`

### Output
3 numbers displaying the probabilities of last individual being `CC` `cc` and `cC/Cc`.
- `0 1 0`

## Solution

A little bit of calculation would make the question a lot easier.  
We need a function that can give us the definitive probability of the child given the probabilities of both its parents.  
Then we need to calculate the probability over and over again till we reach the end of the line.

So first we need to derive a formula for the probability of the child.

Let us assume that parent `A` has genotype probabilities - `x`, `y`, `z` for `CC`, `Cc`, and `cc` respectively. Similarly, parent `B` has genotype probabilities - `a`, `b`, and `c`.

Let the probabilities `p`, `q`, and `r` be the probabilities for genotypes `CC`, `Cc`, and `cc` of the child.  
The probability of getting genotype `CC` in the child consists of 3 terms.  
- `xa` as the cross between two `CC`s always produces `CC`
- `0.25yb` as the cross between two `Cc`s produces `CC` with probability 0.25
- `0.5(xb + ya)` as the cross between a `Cc` and `CC` or a `CC` and `Cc`

Thus, `p = xa + 0.5(xb + ya) + 0.25yb`

Similarly,  
- `r = zc + 0.5(zb + cy) + 0.25yb`

As for the probability of genotype `Cc`, it will also have 2 parts
- `xc + za` as the cross between `CC` and `cc` always produces `Cc`
- `0.5(xb + ya + yc + bz + yb)` as crosses `CC-Cc`, `cc-Cc`, `Cc-CC`, `Cc-cc`, and `Cc-Cc` produce `Cc` half of the times

Thus, `q = xc + za + 0.5(xb + ya + yc + bz + yb)`

Now, we need to do just translate the equation to python which is really really easy.  
And finally, we need to write a function that pairs up all the individuals and calls the probability function on it.

In [1]:
# Function to calculate genotype of child based on parents
def child_prob(a,b):
    AA = a[0]*b[0] + 0.5*(a[0]*b[1] + a[1]*b[0] + 0.5*a[1]*b[1])
    Aa = a[0]*b[2] + a[2]*b[0] + 0.5*(a[0]*b[1] + a[1]*b[0] + a[1]*b[1] + a[2]*b[1] + a[1]*b[2])
    aa = a[2]*b[2] + 0.5*(a[1]*b[2] + a[2]*b[1] + 0.5*a[1]*b[1])
    return [AA,Aa,aa]

In [2]:
# Function to convert initial categorical genotypes to numerical genotypes
def convert_genotype(list_of_genotypes):
    res = []
    for genotype in list_of_genotypes:
        if genotype == 'CC':
            res.append([1, 0, 0])
        elif genotype == 'Cc':
            res.append([0, 1, 0])
        elif genotype == 'cc':
            res.append([0, 0, 1])
    return res

In [3]:
# Function to generate progeny
def predict_genotype(N, list_of_genotypes):
    genotypes = convert_genotype(list_of_genotypes)
    for i in range(N):
        new_genotypes = []
        for j in range(0, len(genotypes), 2):
            new_genotypes.append(child_prob(genotypes[j], genotypes[j+1]))
        genotypes = new_genotypes.copy()
    return genotypes[0]

In [4]:
predict_genotype(2, ['CC', 'cc', 'Cc', 'CC'])

[0.375, 0.5, 0.125]