Planned Analysis: Co-Occurence / Mutual Exclusivity #13

PichaiRaman · 2019-07-12T15:04:03Z

Determining genetic lesions (Mutation, CNV, Fusion) and/or pathways which co-occur or are mutually exclusive across the PBTA. This could help associate lesions with pathways or define potential synthetic lethality relationship.

cansavvy · 2019-07-25T19:03:07Z

Would we be looking to have something like this figure for this analysis, but with the disease type labels? This is from Rotika et al, 2019 doi: http://dx.doi.org/10.1101/566455.

cansavvy · 2019-07-25T19:03:57Z

And after #19 is done, the molecular subtypes could also be added.

cansavvy · 2019-07-25T19:12:55Z

One of my initial questions is how we'd like to combine results that measure similar things.

For example, we have two fusions results. Do we want only take the common fusions from both files?

arriba.fusions.tsv
star-fusion.fusions.tsv

And if I understand correctly, these all have results that overlap:

strelka2.maf
manta-sv.maf
mutect2.maf

Note that I'm still digging into all these files and determining what's here, so after I do some initial analyses I may come back with a suggestion on how to handle this, but if someone knows this data better and has an idea of what we want to do first, then I'm of course open to those suggestions.

jharenza · 2019-07-25T19:17:12Z

One of my initial questions is how we'd like to combine results that measure similar things.

For example, we have two fusions results. Do we want only take the common fusions from both files?
arriba.fusions.tsv
star-fusion.fusions.tsv
And if I understand correctly, these all have results that overlap:
strelka2.maf
manta-sv.maf
mutect2.maf
Note that I'm still digging into all these files and determining what's here, so after I do some initial analyses I may come back with a suggestion on how to handle this, but if someone knows this data better and has an idea of what we want to do first, then I'm of course open to those suggestions.

We actually have created a high-confidence set of calls by integrating the two fusion callers - will add that progress to #10 soon! The same should be done with the SNVs and SVs (Lumpy SV data yet to come).

cansavvy · 2019-07-25T19:18:18Z

Okay. So I should hold off on doing this until those integrated results come back?

jharenza · 2019-07-25T19:31:10Z

Okay. So I should hold off on doing this until those integrated results come back?

I think you can integrate the SNV results and do the mutual exclusivity/co-occurrance analysis on SNVs more immediately!

cansavvy · 2019-07-25T19:35:25Z

Okay. How should I integrate those? Sounds like you guys have already done some work on this?

jharenza · 2019-07-25T23:15:57Z

Would we be looking to have something like this figure for this analysis, but with the disease type labels? This is from Rotika et al, 2019 doi: http://dx.doi.org/10.1101/566455.

This figure is related to #6. For this issue, we would look for something similar to
example-plot.pdf (created with maftools for another solid tumor dataset I had) for specific histologies or another mode of visualizing/tabling statistical testing for these mutation relationships. First, discover those relationships, then summarize. May depend on #19, but can give a go with broader histologies or start with one, for example, Medulloblastoma, High-grade glioma, low-grade glioma.

jharenza · 2019-07-25T23:19:59Z

Okay. How should I integrate those? Sounds like you guys have already done some work on this?

We do not yet have this automated, so you can come up with the total merge of mutations per sample based on the two mutation algorithms, Mutect2 and Strelka2 and then start with those. It may also be a good idea to investigate mutations present in only one algorithm for potential artifacts (some will be real), but this may be constitute another issue and may cause these issues dependency on that - thoughts, @cgreene ?

cansavvy · 2019-07-29T12:04:46Z

Sounds like I should do some initial analyses to see how much overlap there is between Mutext2 and Strelka2 and then I'll report back and we can try to make some further decisions.
I will hold off for now on combining the fusions results since it sounds like you are working on this.

jharenza · 2019-07-29T12:28:42Z

Sounds like I should do some initial analyses to see how much overlap there is between Mutext2 and Strelka2 and then I'll report back and we can try to make some further decisions.
I will hold off for now on combining the fusions results since it sounds like you are working on this.

Yes, sounds great. Strelka may find more lower frequency variants, which can still be real, so we just have to assess whether these are real and potentially oncogenic and if so, keep!

cgreene · 2019-07-29T12:32:50Z

Maybe it'd be good to add a new issue for:

Evaluate concordance between Mutext2 and Strelka2 and decide on next steps.

This may be helpful to keep discussion within the issue focused on a single topic.

jharenza · 2019-07-29T12:37:04Z

created #30

jashapiro · 2019-10-08T18:02:02Z

I'm going to start working on this, aiming to build a figure similar to the one @jharenza presented:

For now I will used the strelka2 data, but it should be flexible enough to feed in whatever final set of data we would be interested in.

I was planning to allow production alternative plots for different VAF cutoffs, as well as grouping by gene, filtering by effect, mutation type, etc.

jashapiro · 2019-11-06T15:52:43Z

I'm currently working on adding processing of CNV outputs for these analyses. The current plan is to make plots with SNV only, CNV only, and SNV + CNV. For initial analysis, I am taking each gene + loss or gain status (broadly interpreted) as the mutation unit for each sample. However, there are a few issues worth discussing before finalizing the analysis.

In the current analysis (CNVkit), almost every gene has gain or loss in at least one sample. I expect this to be improved by Proposed Analysis: Copy number consensus calls #128, but on other part of the issue may be the use of broadly defined gene locations in Proposed Analysis: map from SEG file to genes (and broader segments) #186/PR 1 of 3: focal CN file preparation #195. Restricting those calls to exons may also improve things.
Large losses or gains will necessarily result in many highly correlated CNV calls. How to display these remains an open question, and I would welcome any thoughts or suggestions. This will likely wait on conclusion of Proposed Analysis: map from SEG file to genes (and broader segments) #186.

jaclyn-taroni · 2020-03-09T18:32:05Z

Closing all planned analysis tickets in favor of opening new proposed analysis/updated analysis tickets as needed.

cgreene added the good first issue Good for newcomers label Jul 14, 2019

cansavvy self-assigned this Jul 25, 2019

jharenza removed the good first issue Good for newcomers label Jul 28, 2019

jharenza closed this as completed Jul 29, 2019

jharenza reopened this Jul 29, 2019

jashapiro assigned jashapiro and unassigned cansavvy Oct 8, 2019

jashapiro added the in progress Someone is working on this issue, but feel free to propose an alternative approach! label Oct 9, 2019

jashapiro mentioned this issue Oct 11, 2019

Analyses including multiple samples from the same individual #155

Closed

jaclyn-taroni added cnv Related to or requires CNV data fusion Related to or requires fusion data snv Related to or requires SNV data labels Oct 26, 2019

jashapiro mentioned this issue Oct 29, 2019

Initial Co-occurence plots #181

Merged

1 task

jaclyn-taroni mentioned this issue Oct 30, 2019

Proposed Analysis: map from SEG file to genes (and broader segments) #186

Closed

cbethell mentioned this issue Oct 31, 2019

PR 1 of 3: focal CN file preparation #195

Merged

8 tasks

jashapiro mentioned this issue Nov 4, 2019

Co-occurence plot by disease #227

Merged

jaclyn-taroni closed this as completed Mar 9, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Planned Analysis: Co-Occurence / Mutual Exclusivity #13

Planned Analysis: Co-Occurence / Mutual Exclusivity #13

PichaiRaman commented Jul 12, 2019

cansavvy commented Jul 25, 2019 •

edited

cansavvy commented Jul 25, 2019

cansavvy commented Jul 25, 2019

jharenza commented Jul 25, 2019

cansavvy commented Jul 25, 2019

jharenza commented Jul 25, 2019

cansavvy commented Jul 25, 2019

jharenza commented Jul 25, 2019

jharenza commented Jul 25, 2019

cansavvy commented Jul 29, 2019 •

edited

jharenza commented Jul 29, 2019 •

edited

cgreene commented Jul 29, 2019

jharenza commented Jul 29, 2019

jashapiro commented Oct 8, 2019

jashapiro commented Nov 6, 2019

jaclyn-taroni commented Mar 9, 2020

Planned Analysis: Co-Occurence / Mutual Exclusivity #13

Planned Analysis: Co-Occurence / Mutual Exclusivity #13

Comments

PichaiRaman commented Jul 12, 2019

cansavvy commented Jul 25, 2019 • edited

cansavvy commented Jul 25, 2019

cansavvy commented Jul 25, 2019

jharenza commented Jul 25, 2019

cansavvy commented Jul 25, 2019

jharenza commented Jul 25, 2019

cansavvy commented Jul 25, 2019

jharenza commented Jul 25, 2019

jharenza commented Jul 25, 2019

cansavvy commented Jul 29, 2019 • edited

jharenza commented Jul 29, 2019 • edited

cgreene commented Jul 29, 2019

jharenza commented Jul 29, 2019

jashapiro commented Oct 8, 2019

jashapiro commented Nov 6, 2019

jaclyn-taroni commented Mar 9, 2020

cansavvy commented Jul 25, 2019 •

edited

cansavvy commented Jul 29, 2019 •

edited

jharenza commented Jul 29, 2019 •

edited