change translation frame type description #58

pranathivemuri · 2020-04-23T00:07:46Z

No description provided.

pranathivemuri · 2020-04-24T02:23:29Z

addressing the issue - #59

snafees · 2020-04-24T02:36:17Z

I understand this is the json output that is desired classification_value_counts': { 'All translations shorter than peptide k-mer size + 1': 1, 'All translation frames have stop codons': 3, 'Coding': 5, 'Non-coding': 11, 'Low complexity nucleotide': 0, 'Read length was shorter than 3 * peptide k-mer size': 2, 'Low complexity peptide in dayhoff6 alphabet': 1}, but, are we trying to ultimately tell the user, e.g., "all translations shorter than peptide k-mer size + 1 = 1" and "all translation frames that have stop codons = 3" and "num of coding reads=5", etc. ? If so, maybe the output of the json file could be written slightly differently so it is easier to make sense of it to a new user.
Maybe that is the goal of another PR at another time.. but just wanted to check!

pranathivemuri · 2020-04-24T02:37:05Z

= is not a valid json character

pranathivemuri · 2020-04-24T02:37:46Z

pranathivemuri · 2020-04-24T02:38:14Z

also dictionaries are universally always written as key: value

snafees · 2020-04-24T02:39:01Z

right, I recall that now. I guess my issue is not so much with : vs. +. It has more to do with our phrasing. But that is minor! No big deal rn.

olgabot

Thanks for doing this. Had a few comments/suggestions

olgabot · 2020-04-24T20:38:16Z

khtools/create_save_summary.py

+                counts[
+                    'Read length was shorter than 3 * peptide k-mer size'] += 1
+            elif len(unique_categories) == 1:
+                counts[unique_categories[0]] += 1


How does the index 0 work here?

if there is only unique category writing incrementing for that

olgabot · 2020-04-24T20:38:21Z

khtools/create_save_summary.py

+        read_id_category = coding_scores.filter(["read_id", "category"])
+        read_ids = coding_scores.read_id.unique()
+
+        for read_id in read_ids:


For this I'd use pandas groupby to do:

for read_id, df in read_id_category.groupby('read_id'): categories_for_read_id = df

thanks, changed it

olgabot · 2020-04-24T20:40:36Z

tests/test_create_save_summary.py

@@ -95,29 +95,29 @@ def test_get_n_per_coding_classification(
        peptide_bloom_filter_path,
        alphabet, peptide_ksize, jaccard_threshold)
    data = [
-        ['read1', 'All translations shorter than peptide k-mer size + 1'],
-        ['read2', 'All translation frames have stop codons'],
+        ['read1', 'Translation is shorter than peptide k-mer size + 1'],


This data should get updated to have multiple results per read, so that we can test that e.g. if any result is coding, it is called coding

change translation frame feature

823ae1f

pranathivemuri requested a review from olgabot April 23, 2020 00:07

pranathivemuri assigned olgabot Apr 23, 2020

fix khtools bug

5fc42ab

replace path

7289fc0

pranathivemuri added 2 commits April 23, 2020 19:54

fix error

5653f91

remove unused csv files added by mistake

506325d

pranathivemuri merged commit e5083f1 into master Apr 24, 2020

pranathivemuri mentioned this pull request Apr 24, 2020

Update summary json with new csv #59

Closed

olgabot reviewed Apr 24, 2020

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

change translation frame type description #58

change translation frame type description #58

pranathivemuri commented Apr 23, 2020

pranathivemuri commented Apr 24, 2020

snafees commented Apr 24, 2020 •

edited

Loading

pranathivemuri commented Apr 24, 2020

pranathivemuri commented Apr 24, 2020

pranathivemuri commented Apr 24, 2020

snafees commented Apr 24, 2020

olgabot left a comment

olgabot Apr 24, 2020

pranathivemuri Apr 24, 2020

olgabot Apr 24, 2020

pranathivemuri Apr 25, 2020

olgabot Apr 24, 2020

change translation frame type description #58

change translation frame type description #58

Conversation

pranathivemuri commented Apr 23, 2020

pranathivemuri commented Apr 24, 2020

snafees commented Apr 24, 2020 • edited Loading

pranathivemuri commented Apr 24, 2020

pranathivemuri commented Apr 24, 2020

pranathivemuri commented Apr 24, 2020

snafees commented Apr 24, 2020

olgabot left a comment

Choose a reason for hiding this comment

olgabot Apr 24, 2020

Choose a reason for hiding this comment

pranathivemuri Apr 24, 2020

Choose a reason for hiding this comment

olgabot Apr 24, 2020

Choose a reason for hiding this comment

pranathivemuri Apr 25, 2020

Choose a reason for hiding this comment

olgabot Apr 24, 2020

Choose a reason for hiding this comment

snafees commented Apr 24, 2020 •

edited

Loading