-
Notifications
You must be signed in to change notification settings - Fork 67
Planned Analysis: Co-Occurence / Mutual Exclusivity #13
Comments
Would we be looking to have something like this figure for this analysis, but with the disease type labels? This is from Rotika et al, 2019 doi: http://dx.doi.org/10.1101/566455. |
And after #19 is done, the molecular subtypes could also be added. |
One of my initial questions is how we'd like to combine results that measure similar things. For example, we have two fusions results. Do we want only take the common fusions from both files?
And if I understand correctly, these all have results that overlap:
Note that I'm still digging into all these files and determining what's here, so after I do some initial analyses I may come back with a suggestion on how to handle this, but if someone knows this data better and has an idea of what we want to do first, then I'm of course open to those suggestions. |
We actually have created a high-confidence set of calls by integrating the two fusion callers - will add that progress to #10 soon! The same should be done with the SNVs and SVs (Lumpy SV data yet to come). |
Okay. So I should hold off on doing this until those integrated results come back? |
I think you can integrate the SNV results and do the mutual exclusivity/co-occurrance analysis on SNVs more immediately! |
Okay. How should I integrate those? Sounds like you guys have already done some work on this? |
This figure is related to #6. For this issue, we would look for something similar to |
We do not yet have this automated, so you can come up with the total merge of mutations per sample based on the two mutation algorithms, Mutect2 and Strelka2 and then start with those. It may also be a good idea to investigate mutations present in only one algorithm for potential artifacts (some will be real), but this may be constitute another issue and may cause these issues dependency on that - thoughts, @cgreene ? |
Sounds like I should do some initial analyses to see how much overlap there is between Mutext2 and Strelka2 and then I'll report back and we can try to make some further decisions. |
Yes, sounds great. Strelka may find more lower frequency variants, which can still be real, so we just have to assess whether these are real and potentially oncogenic and if so, keep! |
Maybe it'd be good to add a new issue for:
This may be helpful to keep discussion within the issue focused on a single topic. |
created #30 |
I'm going to start working on this, aiming to build a figure similar to the one @jharenza presented: For now I will used the strelka2 data, but it should be flexible enough to feed in whatever final set of data we would be interested in. I was planning to allow production alternative plots for different VAF cutoffs, as well as grouping by gene, filtering by effect, mutation type, etc. |
I'm currently working on adding processing of CNV outputs for these analyses. The current plan is to make plots with SNV only, CNV only, and SNV + CNV. For initial analysis, I am taking each gene + loss or gain status (broadly interpreted) as the mutation unit for each sample. However, there are a few issues worth discussing before finalizing the analysis.
|
Closing all planned analysis tickets in favor of opening new proposed analysis/updated analysis tickets as needed. |
Determining genetic lesions (Mutation, CNV, Fusion) and/or pathways which co-occur or are mutually exclusive across the PBTA. This could help associate lesions with pathways or define potential synthetic lethality relationship.
The text was updated successfully, but these errors were encountered: