-
Notifications
You must be signed in to change notification settings - Fork 219
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Showing mutational load #453
Comments
Yes, you can do that. For example if you have two maf objects tcgaCompare(maf = c(x, y), cohortName = c("x", "y")) |
Thank you, is it possible to say statistically if mutational burden differs between two data? |
Not at the moment. But I think it can be done. I will let you know if I can implement it.. |
No. But, one can easily do it with multiple |
Sorry for too questioning But can we get total number of SNV and INDEL separately for each sample? |
Check out > ms = mafSummary(maf = laml)
> ms$variant.type.summary
Tumor_Sample_Barcode DEL INS SNP total
1: TCGA-AB-3009 0 6 36 42
2: TCGA-AB-2807 2 0 27 29
3: TCGA-AB-2927 0 2 25 27
4: TCGA-AB-2959 0 0 27 27
5: TCGA-AB-3002 0 0 27 27
---
189: TCGA-AB-2903 0 0 1 1
190: TCGA-AB-2909 0 1 0 1
191: TCGA-AB-2933 0 0 1 1
192: TCGA-AB-2942 0 1 0 1
193: TCGA-AB-2954 0 0 1 1 However, it returns summaries for everything (synonymous and nonsynonymous). |
Sorry tcgaCompare gives median of mutations per Mb for each maf, is there any way to access to number of mutations per Mb for each sample in maf object? |
I am working on it. I will make the function to return pairwise t-test results and mutation load per sample. Should be done soon.. |
Great job and the best of lucks |
Okay, could you try again? |
Thanks; this is simply amazing |
Sorry for my data
What is MNP? |
They are |
Sorry The newly implemented pairwise t-test in tcgaCompare function compares |
Hi, > x_log10 = tcgaCompare(maf = laml, cohortName = "AML")
Performing pairwise t-test for differences in mutation burden..
> x_log10_mb = tcgaCompare(maf = laml, cohortName = "AML", capture_size = 50)
Capture size [TCGA]: 50
Capture size [Input]: 50
Performing pairwise t-test for differences in mutation burden (per MB).. Log10 is used only for visualisation and nothing else. |
Thank you. My actual question roots from here whether t test is suitable for total count comparison among groups or not, I then asked about log because I thought by that you try to make the distribution of number of mutations per Mb close to a normal distribution for using t test. Somewhere I have seen people for comparing such a raw count use wilcox test that is why I was seeking for a clue for type of test. However thank you again |
Hello @PoisonAlien For getting the number of non synonymous mutations per sample
Does total give non synonymous mutations or I should add up |
Output from |
Hello @PoisonAlien I am struggling with mutation per sample Please have a look by
And by
You are seeing the median of mutation per sample differs if dashboard argument gets FALSE or TRUE Am I right? |
Hi, |
Yes @PoisonAlien the same objects as I only have one maf object in environment I also put This is my R |
Seems like you have some faulty values in ref and alt alleles. Please do check and fix them. |
Sorry @PoisonAlien , I got worry about my data, what should I check and correct in my data? |
For SNVs, your data seems to have non A,T,G,C values. Hence the warning.. |
Sorry @PoisonAlien you are right
You think what is the possible source of problem where I have done wrong? I really put a hug amount of time to gather this data and this made me really panicked if I have to do everything from scratch and throw away any conclusion Can I simply remove any |
I am not sure how you generated your MAFs. I would investigate the source data used for generating these maf files. |
I have used Annovar for annotation them annovar2maf function :( |
Yes, check annovar output files for these loci. You can share an example if you think |
Thank you @PoisonAlien This is one of my Annovar files directly fed into annovar2maf However, before getting Andover I had to do some manipulation one
Then
|
I just checked one Annovar sample having |
Your example file looks fine after processing with > x = maftools::annovarToMaf(annovar = "~/Downloads/annovar.txt")
> x[Variant_Type %in% "SNP"][,.N,.(Reference_Allele, Tumor_Seq_Allele2)]
Reference_Allele Tumor_Seq_Allele2 N
1: C T 3912
2: G A 3870
3: T A 1619
4: C A 2328
5: A G 3050
6: T G 3800
7: T C 3054
8: G T 2344
9: G C 757
10: A T 1620
11: C G 752
12: A C 3569 |
Sorry @PoisonAlien is it possible that happened at this step?
I remember I was doing something like
:( |
Hi
I use
tcgaCompare
function to look at mutational load in my data vs TCGA but is there any way to do the same for mine different types of data? Please let's say I have separate maf objects for patients before and after chemotherapy, so that would be nice to look at mutational load in two data side by sideThank you
The text was updated successfully, but these errors were encountered: