Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Mapping subclones loss/gains #16

Open
angkoomin opened this issue Apr 6, 2022 · 5 comments
Open

Mapping subclones loss/gains #16

angkoomin opened this issue Apr 6, 2022 · 5 comments

Comments

@angkoomin
Copy link

Hi there,

I was wondering how do I manually curate the loss/gains from the HMM*.pred_cnv_regions.dat file with the *cell_groupings file? How do I match each branches of the phylogenetic tree?

image

Thanks

@stasvolik
Copy link

I am very interested in this problem as well.

@angkoomin
Copy link
Author

Unfortunately, I'm still asking around about how to map the subclones. Please feel free to share your thoughts if there are any as I'm still stuck for the past month with this issue.

@Silvia-Bio
Copy link

Silvia-Bio commented Jul 4, 2023

Check out the "inferCNV-postprocess.r.txt" script here. You might have to adjust a few things but it can be used to achieve what you asked.

@starfallin
Copy link

Check out the "inferCNV-postprocess.r.txt" script here. You might have to adjust a few things but it can be used to achieve what you asked.

It really works! Thank you so much. Could you please tell me why you choose state 4 as cut off when determine the gain or loss event during creation of final_cnv file? And I notice usually the authors use LOH instead of loss to describe the cnv event. Why? Sincerely

@Silvia-Bio
Copy link

Hi @starfallin,

I'm glad you found the script useful! I am not sure I'll be able to answer your questions correctly, but I will try.

Infercnv doesn't provide exact copy numbers, instead it classifies genomic windows into categories (neutral, gains or loss), which are then given a numerical 6-State model. A diploid state is represented by State 3, and gains are represented by States higher than 4 depending on the number of copies gained. So, in the "inferCNV-postprocess.r.txt" script, this line "final_cnv$event = ifelse(final_cnv$state >=4, "gain", "loss")" is designed to categorise genomic regions with a State of 4 or higher as gains, and anything less as a loss.

About your second question, the term "LOH" specifically refers to a situation where one of two alleles at a locus is lost. When discussing CNV events in the context of inferCNV, which works with RNA data and provides categorical states like I mentioned above, the term "loss" might be more appropriate because it broadly denotes a decrease in copy number. Using "LOH" might not be accurate in this context, especially given the inherent limitations and inaccuracies of CNV calling from RNA data compared to DNA-based methods.

Hope something here helps!

KR,

S.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants