Completed gene finder #1

vivienyuwenchen · 2017-09-14T22:04:11Z

I finally understood the purpose of if name == "main": and moved the gene_finder(dna) portion under that branch.

SeunginLyu

It looks really good in general! I like your clean syntax practices and the code is really well documented with appropriate docstings. I added a few comments that are mostly stylistic guides.

SeunginLyu · 2017-09-25T15:03:36Z

gene_finder.py


 """

 import random
 from amino_acids import aa, codons, aa_table   # you may find these useful
 from load import load_seq
+dna = load_seq("./data/X73525.fa")
+import doctest
+from pickle import dump, load


Wow pickle toolbox! great job

SeunginLyu · 2017-09-25T15:06:08Z

gene_finder.py


 """

 import random
 from amino_acids import aa, codons, aa_table   # you may find these useful
 from load import load_seq
+dna = load_seq("./data/X73525.fa")


you usually want to separate any function calls from these import statements. It would make more sense to put it under if name == "main":

SeunginLyu · 2017-09-25T15:07:40Z

gene_finder.py

+    elif nucleotide == 'T':
+        return 'A'
+
+#doctest.run_docstring_examples(get_complement, globals(), verbose=True)


please remove comments in your final upload

SeunginLyu · 2017-09-25T15:09:09Z

gene_finder.py

+        I added two more doctests because each complementary nucleotide is in
+        its own if/else if branch. If one branch doesn't work and there is no
+        doctest for that branch, the mistake will not be caught. In fact, I
+        found out that I spelled nucleotide wrong in the G branch.


great practice that you are adding your own tests

SeunginLyu · 2017-09-25T15:12:50Z

gene_finder.py

+
+    return comp_str
+
+#doctest.run_docstring_examples(get_reverse_complement, globals(), verbose=True)


another comment to remove

SeunginLyu · 2017-09-25T15:17:00Z

gene_finder.py

+    for nuc in range(0, len(reversed_dna)):
+        comp_nuc = get_complement(reversed_dna[nuc])
+        comp_list.append(comp_nuc)
+


if you wanna get fancy with just one line of code :
return ''.join([get_complement(reversed_dna[nuc]) for nuc in dna[::-1]])

SeunginLyu · 2017-09-25T15:21:27Z

gene_finder.py

+        I added two more doctests: one in which the frame stop codon is TAA and
+        another in which there is no frame stop codon. This way, all the
+        branches are tested for error.
+


SeunginLyu · 2017-09-25T15:21:55Z

gene_finder.py

+
+    return dna
+
+#doctest.run_docstring_examples(rest_of_ORF, globals(), verbose=True)


another comment to remove

SeunginLyu · 2017-09-25T15:24:50Z

gene_finder.py

+    frame1 = find_all_ORFs_oneframe(dna)
+    frame2 = find_all_ORFs_oneframe(dna[1:])
+    frame3 = find_all_ORFs_oneframe(dna[2:])
+


you could also use a for loop here
[find_all_ORFs_oneframe(dna[i:]) for i in range 3]

SeunginLyu · 2017-09-25T15:26:35Z

gene_finder.py

+            humongest = longest_ORF(dna_shuffles[i+1])
+
+    return len(humongest)
+


it would be more memory efficient to just save the max_length of longest_ORF than to save the actual longest_ORF

vivienyuwenchen added 5 commits September 13, 2017 02:40

Completed gene finder

de73a03

Updated with cooler features

d7e632a

Completed gene finder with pickling

5b768c4

List of genes

78b7d7e

Finally understood the purpose of if __name__ == __main__:

0bf340b

SeunginLyu reviewed Sep 25, 2017

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Completed gene finder #1

Completed gene finder #1

vivienyuwenchen commented Sep 14, 2017 •

edited

Loading

SeunginLyu left a comment

SeunginLyu Sep 25, 2017

SeunginLyu Sep 25, 2017

SeunginLyu Sep 25, 2017

SeunginLyu Sep 25, 2017

SeunginLyu Sep 25, 2017

SeunginLyu Sep 25, 2017

SeunginLyu Sep 25, 2017

SeunginLyu Sep 25, 2017

SeunginLyu Sep 25, 2017

SeunginLyu Sep 25, 2017


		return comp_str

		#doctest.run_docstring_examples(get_reverse_complement, globals(), verbose=True)


		return dna

		#doctest.run_docstring_examples(rest_of_ORF, globals(), verbose=True)

		humongest = longest_ORF(dna_shuffles[i+1])

		return len(humongest)

Completed gene finder #1

Are you sure you want to change the base?

Completed gene finder #1

Conversation

vivienyuwenchen commented Sep 14, 2017 • edited Loading

SeunginLyu left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

vivienyuwenchen commented Sep 14, 2017 •

edited

Loading