# 1. Population Genetics Simulation

Create a program that simulates the allelic frequency in a finite diploid population for a certain number of generations.

The program takes as input the initial allele frequencies, the fitness of each genotype, the population size, and the number of generations. Because these simulations are stochastic each run of the simulation will give a different result, to allow an idea of the behavior of the allelic frequencies, your program should repeat the simulations many times for each parameter set and plot all the results in a single graph. The number of simulations should also be determined by the user. You can start your program using the variable definitions in the cell below.

Your program should output two graphs. The first should show the allele frequency at each generation, and the other should be a histogram with the final values of the allele frequency. Something like this:

![simulation](Sim1.png)

![histogram](Sim2.png)

Last year a student used this homework as the starting point for her project to create a population genetics simulator for BIOL040. You can see the final project here: http://dna.pomona.edu:5006/pop_gen_sim

In [22]:
#Packages
import numpy as np
from numpy import random as rd
from plotly.graph_objs import *
import plotly.graph_objs as go 

#Allele frequencies
initA = 0.50
inita = 0.50

#Fitnesses
fAA = 1
fAa = 1
faa = 1

#Pop Size
pop = 1000

#Number of generations
gen = 100

#Number of simulations
sim = 100

In [27]:
def pop_sim(initA, inita, fAA, fAa, faa, pop, gen, sim):

    total_plot = []
    histo = []

    # simulations
    for n in range(sim):
        sim_A = initA
        sim_a = inita
        x_data = []
        y_data = []

        # generations
        for i in range(gen):
            AA = pop * (fAA * sim_A * sim_A)
            Aa = pop * (fAa * 2 * sim_A * sim_a)
            aa = pop * (faa * sim_a * sim_a)

            tot = AA + Aa + aa

            per_AA = AA / tot
            per_Aa = Aa / tot
            per_aa = aa / tot

            randAA = np.random.binomial(pop, per_AA)
            randAa = np.random.binomial(pop, per_Aa)
            randaa = np.random.binomial(pop, per_aa)

            freq_A = randAA + 1/2 * randAa
            freq_a = randaa + 1/2 * randAa
            sim_A = freq_A / (freq_A + freq_a)
            sim_a = freq_a /(freq_A + freq_a)

            y_data += [sim_A]
            x_data += [i]
        
        histo += [sim_A]
        gen_line = go.Scatter( x = x_data, y = y_data, mode = 'lines', name = f"Simulation {n}")
        total_plot += [gen_line]

    fig_allele = go.Figure(total_plot)
    fig_allele.update_layout(yaxis_range=[0,1],
        title= f"Allele Frequency of {sim} Simulations with Population {pop}",
        xaxis_title= f"# of Generations",
        yaxis_title="A Frequency")
    fig_allele.show()
    
    # histogram
    fig_his = go.Figure(go.Histogram(x=histo))
    fig_his.update_layout(xaxis_range=[0,1],
        title= f"Distribution of Allele Frequencies over {sim} Simulations",
        xaxis_title="A Frequency",
        yaxis_title="# of Simulations")
    fig_his.show()

pop_sim(initA, inita, fAA, fAa, faa, pop, gen, sim)