Skip to content

noconnor29/CodonAnalysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

29 Commits
 
 
 
 
 
 
 
 

Repository files navigation

CodonAnalysis

Tool to analyze DNA sequences to identify relevant amino acids.

Initially in python with potential for a version in a compiled language.

Inputs

  • path to directory of files containing DNA sequences
  • prompt for sequence direction (forward/coding vs reverse/complement)
    • May need to reverse the sequence direction then generate complementary base sequence
  • nucleotide sequence of start codon (or arbitrary string)
    • default: ATG
  • amino acid sequence of tag/ stop codon
    • default: GAACAAAAGCTTATTTCTGAAGAGGACTTG [3]

Procedure

Given a DNA sequence (~1k bases), a start codon, and a stop codon...

  1. Read DNA sequence base-wise in the specified direction
  2. Identify location of start codon
  3. Identify location of tag
  4. Create open reading frame (ORF) between start and tag
  5. Translate into amino acids backwards from tag
  6. Continue translation until start codon reached
  7. Report following data points:
    • Sample metadata
    • Original and transformed sequence
    • ORF sequence and nucleotide count
    • In frame? yes/no
    • AA sequence from start to stop

Resources

  1. DNA to Protein in Python 3
  2. DNA and RNA codon tables
  3. myc-tag

About

Tool to analyze DNA sequences to identify relevant amino acids

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages