added pseudocount to raw codon counts in `syn_selection_by_codon` #40

fwelsh · 2019-11-08T23:06:34Z

No description provided.

jbloom · 2019-11-09T12:46:37Z

@frances: You should make several changes to this:

Make the value of the pseudocount an optional argument in the function signature, and set a reasonable default. Such as pseudocount=0.5. Then expand the documentation to explain what the pseudocount argument means.
Do not use the pseudocount for the Fisher's exact test. That test should be performed on the raw values as the Fisher's test already works fine with zero counts. The pseudocounts should only be used for computing the odds ratios. Implementation-wise, this probably means doing the Fisher test and then adding the pseudocount. Clearly explain this in the docs.
You'll have to fix so it passes the doctest (right now it fails, see here). Note that if you do point (2) above, you should be changing the odds ratios in the doctest output but not the P-values, since the P-value counts won't change.

Pseudocount is now added after calculating Fisher P-values, for manual calculation of odds ratios only.

Overlooked second pseudocount addition that led to failed doctest.

jbloom

@frances: Looks good to me.

@skhilton: Do you have any comments? Otherwise I'll go ahead and merge.

skhilton

@jbloom, @fwelsh: Looks good to me! You might consider setting a default value of the psuedocounts, especially if you think that 0.5 is going to be the best value for the majority of use cases. You can do this by changing https://github.com/jbloomlab/dms_tools2/blob/syn_selection_updates/dms_tools2/syn_selection.py#L16 to def syn_selection_by_codon(counts_pre, counts_post, pseudocount=0.5):

Also, very small note, but I don't think you need the parenthesis around df in the function return statement. https://github.com/jbloomlab/dms_tools2/blob/syn_selection_updates/dms_tools2/syn_selection.py#L157

jbloom · 2019-11-11T02:11:11Z

OK, I agree with @skhilton's comments especially about default. @fwelsh, do you want to push that, and then I'll merge.

jbloom · 2019-11-11T04:18:28Z

@fwelsh: Sorry, one more change I realized that is also needed:

Can you update the version number here to 2.6.3.
Can you add to the CHANGELOG a short explanation of the change. Then I can merge this as the new version.

fwelsh · 2019-11-11T05:00:01Z

Sorry I didn't notice that, but we should be set now!

added pseudocount to raw codon counts

41e5d2f

fwelsh added 2 commits November 10, 2019 12:19

added pseudocount arg, edited calculations

d5757db

Pseudocount is now added after calculating Fisher P-values, for manual calculation of odds ratios only.

removed extra pseudocount

9f5bc8f

Overlooked second pseudocount addition that led to failed doctest.

jbloom reviewed Nov 11, 2019

View reviewed changes

skhilton reviewed Nov 11, 2019

View reviewed changes

Made default pseudocount 0.5

c59f584

fwelsh added 2 commits November 10, 2019 20:55

Update to 2.6.3

793cdd1

Update CHANGELOG.rst

74d167c

jbloom approved these changes Nov 11, 2019

View reviewed changes

jbloom merged commit d658fed into master Nov 11, 2019

jbloom deleted the syn_selection_updates branch November 11, 2019 13:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

added pseudocount to raw codon counts in `syn_selection_by_codon` #40

added pseudocount to raw codon counts in `syn_selection_by_codon` #40

fwelsh commented Nov 8, 2019

jbloom commented Nov 9, 2019

jbloom left a comment

skhilton left a comment

jbloom commented Nov 11, 2019

jbloom commented Nov 11, 2019

fwelsh commented Nov 11, 2019

added pseudocount to raw codon counts in syn_selection_by_codon #40

added pseudocount to raw codon counts in syn_selection_by_codon #40

Conversation

fwelsh commented Nov 8, 2019

jbloom commented Nov 9, 2019

jbloom left a comment

Choose a reason for hiding this comment

skhilton left a comment

Choose a reason for hiding this comment

jbloom commented Nov 11, 2019

jbloom commented Nov 11, 2019

fwelsh commented Nov 11, 2019

added pseudocount to raw codon counts in `syn_selection_by_codon` #40

added pseudocount to raw codon counts in `syn_selection_by_codon` #40