-
Notifications
You must be signed in to change notification settings - Fork 21
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Keep all variants, add gene set membership check #168
Conversation
…ms around between functions)
…ounts from core logic
… against known gene lists
vaxrank/gene_pathway_check.py
Outdated
_DRIVER_VARIANT = 'cancer_driver_variant' | ||
|
||
|
||
class GenePathwayCheck: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Something about calling this a "pathway check" seems very odd to me.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Until we stop supporting Python 2, inherit from object
:
vaxrank/gene_pathway_check.py
Outdated
|
||
@staticmethod | ||
def is_present(df, value, column): | ||
return len(df.loc[df[column] == value]) > 0 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
What about (df[column] == value).any()?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Maybe worth noting that this expects the DataFrame to be small
test/test_mutant_protein_sequence.py
Outdated
@@ -67,6 +70,8 @@ def test_mutant_amino_acids_in_mm10_chrX_8125624_refC_altA_pS460I(): | |||
min_alt_rna_reads=1, | |||
min_variant_sequence_coverage=1, | |||
variant_sequence_assembly=True) | |||
core_logic.process_variants() |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can we make this more "functional" by having process_variants
just return its results? Or is there other persistent data that gets used later?
Idea for a future branch: Make the MHC binding and RNA support columns quantitative instead of binary. |
No description provided.