improving process_results #40

antgonza · 2015-12-02T18:48:27Z

No description provided.

wasade · 2015-12-03T18:12:51Z

platypus/parse.py

                db_seqs_counts_a[subject_id_a] += 1
                db_seqs_counts_b[subject_id_b] += 1
            elif vals['a']['bit_score'] > vals['b']['bit_score']:
                if not subject_id_b:
                    results[i]['perfect_interest'] += 1
-                    results[i]['summary'].append('%s\t%s\t' % (seq_name,
-                                                               subject_id_a))
+                    results[i]['summary_fh'].write('%s\t%s\t\n' % (


the header on line 239 has 3 columns but this is only writing two. Would it be possible to write an explicit null value so the resulting file is not jagged?

Results are not gonna be jagged are they?

You are right, it expects 3 values: seq_id best_A best_B. In this case, there is no best_B. Thus, just adding a new line is fine cause there is no value there. However, I can add something more specific but not sure what.

The results shouldn't be jagged. BTW there is a test that checks that the resulting files are the same.

antgonza · 2015-12-08T18:36:09Z

@ElDeveloper @wasade ready for another pass ... thanks!

ElDeveloper · 2015-12-09T20:22:37Z

platypus/parse.py

    for (perc_id_a, aln_len_a), (perc_id_b, aln_len_b) in izip(iter_a, iter_b):
        filename = "p1_%d-a1_%d_p2_%d-a2_%d" % (perc_id_a, aln_len_a,
                                                perc_id_b, aln_len_b)
+        summary_filename = join(output_dir, "summary_" + filename + ".txt")
+        summary_fh = open(summary_filename, 'w')


Hmmm, I just noticed that this file handle is not being closed, or am I missing something.

That's right, it's not and I think is fine cause their will be closed once the program finishes. If we want to close them, we will need to put a for loop at the end of this one to close them. Should I do that?

I think this would be a good idea specially, if the number of files grows, we might run out of file handles 😱

ElDeveloper · 2015-12-09T20:24:10Z

Just ☝️ comment.

improving process_results

first commit

ee14974

wasade reviewed Dec 3, 2015
View reviewed changes

adressing comments

f0faaa0

ElDeveloper reviewed Dec 9, 2015
View reviewed changes

addressing @ElDeveloper comment

d491868

ElDeveloper added a commit that referenced this pull request Dec 19, 2015

Merge pull request #40 from antgonza/improving-process-results

3b9e790

improving process_results

ElDeveloper merged commit 3b9e790 into biocore:master Dec 19, 2015

ElDeveloper mentioned this pull request Dec 19, 2015

Refactor to move large lists to simple files #41

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

improving process_results #40

improving process_results #40

antgonza commented Dec 2, 2015

wasade Dec 3, 2015

ElDeveloper Dec 4, 2015

antgonza Dec 8, 2015

antgonza commented Dec 8, 2015

ElDeveloper Dec 9, 2015

antgonza Dec 9, 2015

ElDeveloper Dec 9, 2015

ElDeveloper commented Dec 9, 2015

improving process_results #40

improving process_results #40

Conversation

antgonza commented Dec 2, 2015

wasade Dec 3, 2015

Choose a reason for hiding this comment

ElDeveloper Dec 4, 2015

Choose a reason for hiding this comment

antgonza Dec 8, 2015

Choose a reason for hiding this comment

antgonza commented Dec 8, 2015

ElDeveloper Dec 9, 2015

Choose a reason for hiding this comment

antgonza Dec 9, 2015

Choose a reason for hiding this comment

ElDeveloper Dec 9, 2015

Choose a reason for hiding this comment

ElDeveloper commented Dec 9, 2015