Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Keep all variants, add gene set membership check #168

Merged
merged 12 commits into from
Sep 6, 2018
Merged

Conversation

julia326
Copy link
Contributor

@julia326 julia326 commented Aug 8, 2018

No description provided.

@julia326 julia326 requested a review from iskandr August 8, 2018 18:26
_DRIVER_VARIANT = 'cancer_driver_variant'


class GenePathwayCheck:
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Something about calling this a "pathway check" seems very odd to me.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Until we stop supporting Python 2, inherit from object:

https://forum.dabeaz.com/t/inheriting-from-object/161


@staticmethod
def is_present(df, value, column):
return len(df.loc[df[column] == value]) > 0
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

What about (df[column] == value).any()?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Maybe worth noting that this expects the DataFrame to be small

@@ -67,6 +70,8 @@ def test_mutant_amino_acids_in_mm10_chrX_8125624_refC_altA_pS460I():
min_alt_rna_reads=1,
min_variant_sequence_coverage=1,
variant_sequence_assembly=True)
core_logic.process_variants()
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can we make this more "functional" by having process_variants just return its results? Or is there other persistent data that gets used later?

@coveralls
Copy link

coveralls commented Aug 8, 2018

Coverage Status

Coverage increased (+0.4%) to 86.387% when pulling 43f90cc on keep-all-variants into 379a442 on master.

@iskandr
Copy link
Contributor

iskandr commented Aug 22, 2018

Idea for a future branch:

Make the MHC binding and RNA support columns quantitative instead of binary.

vaxrank/gene_pathway_check.py Outdated Show resolved Hide resolved
vaxrank/gene_pathway_check.py Outdated Show resolved Hide resolved
vaxrank/data/class1-mhc-presentation-pathway.csv Outdated Show resolved Hide resolved
vaxrank/cli.py Show resolved Hide resolved
@julia326 julia326 merged commit 47decce into master Sep 6, 2018
@julia326 julia326 deleted the keep-all-variants branch September 6, 2018 19:59
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants