## gc content in dna

In [18]:
def gc_content(dna_sequence):
  """
  This function calculates the GC content of a DNA sequence.

  Args:
    dna_sequence: A string of DNA nucleotides.

  Returns:
    The GC content of the DNA sequence, as a float.
  """

  gc_count = 0
  for nucleotide in dna_sequence:
    if nucleotide == "G" or nucleotide == "C":
      gc_count += 1

  gc_content = gc_count / len(dna_sequence) * 100
  return gc_content


dna_sequence ="""AATTCGATAGCTAGAAGTAGACCCAAAATATACCATAGGGGCATAGCATAGCAGTAGAC"""


gc_content = gc_content(dna_sequence)
print(gc_content)

40.67796610169492


This code first defines a function called `gc_content()`, which takes a DNA sequence as input and returns the GC content of the sequence. The function works by first counting the number of "G" and "C" nucleotides in the sequence. It then divides this number by the length of the sequence and multiplies by 100 to get the GC content as a percentage.

The code then assigns the DNA sequence "ACGTACGTACGT" to the variable `dna_sequence`. It then calls the `gc_content()` function with `dna_sequence` as the input. The output of the function is then printed to the console.



## dna to protein

In [20]:
def dna_to_protein(dna):
  """
  Converts a DNA sequence to a protein sequence.

  Args:
    dna: The DNA sequence, as a string.

  Returns:
    The protein sequence, as a string.
  """

  codons = {
      'AAA': 'K',
      'AAC': 'N',
      'AAG': 'K',
      'AAT': 'N',
      'ACA': 'T',
      'ACC': 'T',
      'ACG': 'T',
      'ACT': 'T',
      'AGA': 'R',
      'AGC': 'S',
      'AGG': 'R',
      'AGT': 'S',
      'ATA': 'I',
      'ATC': 'I',
      'ATG': 'M',
      'ATT': 'I',
      'CAA': 'Q',
      'CAC': 'H',
      'CAG': 'Q',
      'CAT': 'H',
      'CCA': 'P',
      'CCC': 'P',
      'CCG': 'P',
      'CCT': 'P',
      'CGA': 'R',
      'CGC': 'R',
      'CGG': 'R',
      'CGT': 'R',
      'CTA': 'L',
      'CTC': 'L',
      'CTG': 'L',
      'CTT': 'L',
      'GAA': 'E',
      'GAC': 'D',
      'GAG': 'E',
      'GAT': 'D',
      'GCA': 'A',
      'GCC': 'A',
      'GCG': 'A',
      'GCT': 'A',
      'GGA': 'G',
      'GGC': 'G',
      'GGG': 'G',
      'GGT': 'G',
      'GTA': 'V',
      'GTC': 'V',
      'GTG': 'V',
      'GTT': 'V',
      'TAA': '*',
      'TAC': 'Y',
      'TAG': '*',
      'TAT': 'Y',
      'TCA': 'S',
      'TCC': 'S',
      'TCG': 'S',
      'TCT': 'S',
      'TGA': '*',
      'TGC': 'C',
      'TGG': 'W',
      'TGT': 'C',
      'TTA': 'L',
      'TTC': 'F',
      'TTG': 'L',
      'TTT': 'F',
  }

  protein = ''
  for i in range(0, len(dna), 3):
    codon = dna[i:i + 3]
    protein += codons[codon]
  else:
    protein += '?'

  return protein


def main():
  dna="""ATTCTGCCCTCGAGCCCACCGGGAACGAAAGAGAAGCTCTATCTCCCCTCCAGGAGCCCAGCTATGAAC"""

  protein = dna_to_protein(dna)
  print(protein + dna_to_protein(dna))


if __name__ == '__main__':
  main()


ILPSSPPGTKEKLYLPSRSPAMN?ILPSSPPGTKEKLYLPSRSPAMN?


This code first creates a dictionary that maps each codon to its corresponding amino acid. Then, it takes the DNA sequence as input and iterates over it, three nucleotides at a time. For each codon, it looks up the corresponding amino acid in the dictionary and adds it to the protein sequence. Finally, it prints the protein sequence.

## dna to rna

In [21]:
def dna_to_rna(dna_sequence):
  """
  Converts a DNA sequence to RNA sequence.

  Args:
    dna_sequence: A string of DNA nucleotides.

  Returns:
    A string of RNA nucleotides.
  """

  rna_sequence = ""
  for nucleotide in dna_sequence:
    if nucleotide == "A":
      rna_sequence += "U"
    elif nucleotide == "C":
      rna_sequence += "G"
    elif nucleotide == "G":
      rna_sequence += "C"
    elif nucleotide == "T":
      rna_sequence += "A"
  return rna_sequence


def main():
  dna_sequence = "ATTCTGCCCTCGAGCCCACCGGGAACGAAAGAGAAGCTCTATCTCCCCTCCAGGAGCCCAGCTATGAAC"
  rna_sequence = dna_to_rna(dna_sequence)
  print(rna_sequence)

if __name__ == "__main__":
  main()



UAAGACGGGAGCUCGGGUGGCCCUUGCUUUCUCUUCGAGAUAGAGGGGAGGUCCUCGGGUCGAUACUUG


This code first defines a function called dna_to_rna(), which takes a DNA sequence as input and returns the corresponding RNA sequence. The function works by iterating over the DNA sequence and replacing each "T" with a "U".

The main function then defines a DNA sequence and calls the dna_to_rna() function to convert it to RNA. The result is then printed to the console.

## rna to protein

In [27]:
def rna_to_dna(dna_sequence):
  """
  Converts a RNA sequence to DNA sequence.

  Args:
    dna_sequence: A string of RNA nucleotides.

  Returns:
    A string of DNA nucleotides.
  """

  rna_sequence = ""
  for nucleotide in rna_sequence:
    if nucleotide == "U":
      dna_sequence += "A"
    elif nucleotide == "C":
      dna_sequence += "G"
    elif nucleotide == "G":
      dna_sequence += "C"
    elif nucleotide == "A":
      dna_sequence += "T"
  return dna_sequence


def main():
  rna_sequence = "UAAGACGGGAGCUCGGGUGGCCCUUGCUUUCUCUUCGAGAUAGAGGGGAGGUCCUCGGGUCGAUACUUG"
  rna_sequence = rna_to_dna(rna_sequence)
  print(dna_sequence)

if __name__ == "__main__":
  main()

AATTCGATAGCTAGAAGTAGACCCAAAATATACCATAGGGGCATAGCATAGCAGTAGAC
