# Construction of pYPKa_A_EcfabD

This notebook describe the construction of the _E. coli_ vector [pYPKa_A_EcfabD](pYPKa_A_EcfabD.gb).

![pYPKa_A plasmid](pYPK_A.png "pYPKa_A plasmid")

A part of the [pydna](https://pypi.python.org/pypi/pydna/) package is imported in the code cell below.

In [1]:
from pydna.readers import read
from pydna.genbank import Genbank
from pydna.parsers import parse_primers
from pydna.amplify import pcr
from pydna.amplify import Anneal

The vector backbone [pYPKa](pYPKa.gb) is read from a local file.

In [2]:
pYPKa = read("pYPKa.gb")

The restriction enzyme [AjiI](http://rebase.neb.com/rebase/enz/AjiI.html) is imported from [Biopython](http://biopython.org)

In [3]:
from Bio.Restriction import AjiI

The plasmid is linearized with the enzyme.

In [4]:
pYPKa_AjiI  = pYPKa.linearize(AjiI)

Access to [Genbank](http://www.ncbi.nlm.nih.gov/nuccore) is needed in order to download the template.
If the email address below is not yours, change it before executing this script as you must always give NCBI a way to contact you when using their service.

In [5]:
gb = Genbank("bjornjobb@gmail.com")

The template is downloaded from Genbank below.

In [6]:
template  = gb.nucleotide(" NC_000913 REGION: 1149728..1150657")
template

The two primers below are used to amplify the insert.

In [7]:
fp,rp =  parse_primers(""">706_EcfabD_fw
                          aaATGACGCAATTTGCATT
                          >707_EcfabD_rv
                          TTAAAGCTCGAGCGCC""")

The gene is amplifed using the primers specified above.

In [8]:
ins = pcr(fp, rp, template)

The primers anneal on the template like this.

In [9]:
ins.figure()

  5ATGACGCAATTTGCATT...GGCGCTCGAGCTTTAA3
                       ||||||||||||||||
                      3CCGCGAGCTCGAAATT5
5aaATGACGCAATTTGCATT3
   |||||||||||||||||
  3TACTGCGTTAAACGTAA...CCGCGAGCTCGAAATT5

A suggested PCR program.

In [10]:
ins.program()

|95°C|95°C               |    |tmf:52.6
|____|_____          72°C|72°C|tmr:59.2
|3min|30s  \ 58.1°C _____|____|45s/kb
|    |      \______/ 0:41|5min|GC 55%
|    |       30s         |    |932bp

The final vector is:

In [11]:
pYPKa_A_EcfabD = (pYPKa_AjiI  + ins).looped().synced("gttctgatcctcgagcatcttaagaattc")

The vector with reverse insert is created below. This vector theoretically make up
fifty percent of the clones. The PCR strategy below is used to identify the correct clones.

In [12]:
pYPKa_A_EcfabDb = (pYPKa_AjiI  + ins.rc()).looped().synced("gttctgatcctcgagcatcttaagaattc")

A combination of standard primers and the newly designed primers are
used for the strategy to identify correct clones.
Standard primers are listed [here](standard_primers.txt).
The standard primers are read into a dictonary in the code cell below.

In [13]:
p = { x.id: x for x in parse_primers("standard_primers.txt") }

## Diagnostic PCR confirmation of pYPKa_A_EcfabD
The correct structure of pYPKa_A_EcfabD is confirmed by PCR using standard primers
577 and 342 that are vector specific together with the EcfabDfw primer specific for the insert
in a multiplex PCR reaction with three primers present in the PCR reaction.

Two PCR products are expected if the insert was sucessfully cloned, sizes depending
on the orientation of the insert.
If the vector is empty, only one short product is formed.

## Expected PCR products sizes:

pYPKa_A_EcfabD with insert in correct orientation.

In [14]:
Anneal( (p['577'], p['342'], fp), pYPKa_A_EcfabD).products

[Amplicon(1866), Amplicon(1648)]

pYPKa_A_EcfabD with insert in reverse orientation.

In [15]:
Anneal( (p['577'], p['342'], fp), pYPKa_A_EcfabDb).products

[Amplicon(1866), Amplicon(1150)]

Empty clone

In [16]:
Anneal( (p['577'], p['342'], fp), pYPKa).products

[Amplicon(934)]

The cseguid checksum for the resulting plasmid is calculated for future reference.
The [cseguid checksum](http://pydna.readthedocs.org/en/latest/pydna.html#pydna.utils.cseguid)
uniquely identifies a circular double stranded sequence.

In [17]:
pYPKa_A_EcfabD.cseguid()

IZGI2dn5ZeLsGYQmjUqYcA5VUoc

The file is given a name based on the cloned insert

In [18]:
pYPKa_A_EcfabD.locus = "pYPKa_A_EcfabD"[:16]

Sequence is stamped with the cseguid checksum.
This can be used to verify the integrity of the sequence file.

In [19]:
pYPKa_A_EcfabD.stamp()

cSEGUID_IZGI2dn5ZeLsGYQmjUqYcA5VUoc

The sequence is written to a local file.

In [20]:
pYPKa_A_EcfabD.write("pYPKa_A_EcfabD.gb")