Session 2 Overview

Outline :
1. GC and AT Composition on DNA
2. Melting point of DNA
3. Nucleotide Molecular Weight

# GC and AT Composition on DNA
    GC-Content (or Guanine-Cytosine content) is the percentage of nitrogenous bases in a DNA or RNA molecule that are either Guanine (G)
    or Cyctosine(C)

    Usefullness :
    • In polymerase chain reaction (PCR) experiments, the GC-content of short oligonucleotides known as primers is often used to predict their annealing temperature to the template DNA.
    • A higher GC-content level indicates a relatively higher melting temperature.
    • DNA with low GC-content is less stable than DNA with high GC-content
    • High GC content DNA can make it difficult to perform PCR amplication due to difficulty in designing a primer long enough to provide great specifity

# Melting point of DNA
    • Higher GC means high melting point
    • Tm Wallace: 'Rule of thumb'
    • Tm GC: Empirical formulas based on GC content. Salt and mismatch corrections can be included.
    • Tm NN: Calculation based on nearest neighbor thermodynamics. Several tables for DNA/DNA, DNA/RNA and RNA/RNA hybridizations are included. Correction for mismatches, dangling ends, salt concentration and other additives are available.

# Nucleotide Molecular Weight
    • Check for the Molecular Weight
    • ProtParam.ProteinAnalysis
    • Counter from collections


In [None]:
from Bio.Seq import Seq

In [8]:
Sequence_A = Seq('CGTAGCTAGCTAGCTAGCTAGCTACGTAGCTACAGCT')

def find_GC(input):
    g_count = input.count('G')
    c_count = input.count('C')
    # print(g_count)
    # print(c_count)
    total_count = len(input)
    return (g_count + c_count) / total_count



In [None]:
print("The Percentage of nitrogenous bases in DNA or RNA molecuce that are either G or C")
GC_A = find_GC(Sequence_A)
print(GC_A)
AT_A = 1 - GC_A
print(AT_A)

The Percentage of nitrogenous bases in DNA or RNA molecuce that are either G or C
0.5135135135135135
0.4864864864864865


In [12]:
from Bio.SeqUtils import GC

In [15]:
print(GC(Sequence_A))
print(100 - GC(Sequence_A))

51.351351351351354
48.648648648648646


In [17]:
from Bio.SeqUtils import MeltingTemp as mt

In [21]:
print(mt.Tm_GC(Sequence_A))
print(mt.Tm_Wallace(Sequence_A))
print(mt.Tm_NN(Sequence_A))

64.49273010325301
112.0
64.91989930511858


In [22]:
from Bio.SeqUtils import molecular_weight

In [23]:
print(molecular_weight(Sequence_A))

11429.2837
