-
Notifications
You must be signed in to change notification settings - Fork 67
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Filtering and interpreting results #62
Comments
Feel free to bombard away since it could help to improve the package. You may not want to filter away all the lcn.em=NA segments. There can be clear focal amplifications i.e. large tcn.em but due to the sparsity of het SNPs we can't determine lcn unambiguously. cnlr.median is the segment specific estimate. For the clust suffix segments that are similar are combined and given a cluster specific estimate. So they ought to be very similar. cnlr.median by itself is not informative since it depends on library size, ploidy etc. As you said the right quantity to use is cnlr.median - dipLogR. Venkat |
...Following this up... What about segments where tcn.em=2 and lcn.em=0? This could be neutral segments if both tumour and normal are homozygote (hence lcn.em=0), so they could be filtered out. However, some of these could be loss-of-heterozygosity segments where the normal does have lcn.em=1 (instead cases where tcn.em != 2 and lcn.em=0 are definitely interesting). I think to distinguish which case we have one needs to look at Dario |
A segment that has tcn=2 and lcn=0 is always copy neutral LOH because in order to assign lcn the segment should have had enough het snps in the normal. If it didn't lcn assignment would have been NA. You can question whether min.nhet is large enough. Venkat |
Indeed, that makes sense. Let's try to recapitulate. Would you agree with this classification?
|
Yes except that tcn=1 automatically implies lcn=0. So (1,NA) shouldn't occur (if it does it's a bug). |
Hi - Thanks! I have a few cases where tcn=0 and lcn=NA, 9 out of 8486 (cumulated on several samples). These are the records, if useful:
|
Thanks. Will fix the bug. |
I have a quick question regarding the ploidy/purity output. Thank you! |
Sorry, I'm bombarding you with questions... I'd like to make sure my understanding is correct.
Typically one wants to filter out segments that are copy-number neutral. For this purpose, is it sufficient to filter out records where tcn.em = 2 or NA and lcn.em = 1 or NA? Is any other filtering recommended? E.g.:
Related question, In order to assess the "strength" or enrichment of a segment I guess a useful metric is
cnlr.median
orcnlr.median.clust
(these two appear to be almost perfectly correlated, so I guess it doesn't matter which one is chosen, any comment on this?) However, the rawcnlr.median
should be adjusted for the logR of the diploid state (found infit$dipLogR
). Otherwise, with high or low ploidy the rawcnlr.median
may be misleading.For example, in the results plotted below segments with logR ~ 0 are still duplicated, so a useful metric to assess enrichment becomes:
fit$cncf$cnlr.median - fit$dipLogR
. Do you agree with this?Thanks a lot again!
Dario
The text was updated successfully, but these errors were encountered: