Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

-ignore_user_sites,how to treat these region? #31

Closed
YangChaoCN opened this issue Apr 25, 2016 · 2 comments
Closed

-ignore_user_sites,how to treat these region? #31

YangChaoCN opened this issue Apr 25, 2016 · 2 comments

Comments

@YangChaoCN
Copy link

YangChaoCN commented Apr 25, 2016

Dear Xavier/Daniel,

I want to know the exact meaning of -ignore_user_sites, is the region ignored involved in subsequent analysis or not ?

If the region is discarded, is there any difference between an alignment which the region is striped and an alignment with the -ignore_user_sites option?

You recommend that we should extract the core from the file generated with mauve, is this means that I had better use core genome sequence alignment, not the complete genome sequence alignment with -ignore_user_sites option to ignore non-core sites ?

Best wishes,
YangChao

@xavierdidelot
Copy link
Owner

Dear YangChao,

Ignoring a region via the "-ignore_user_sites" option is not the same thing as removing the region from your input file. The difference is that if you removed the region from your input file, the two flanking regions would become attached in the input, so that linkage would considered high between them and recombination events would be likely to stretch across them. When you ignore a region using "ignore_user_sites", the linkage between the flanks is kept consistent with the length of the ignored region. So for example if the ignore region is very large, linkage will be low between the flanks and a recombination event will not be allowed to stretch across the ignored region. So the results will be more biologically meaningful. The output files are also easier to interpret, since the location of sites in the output is the same as the location of sites in the input, no matter which sites were ignored.

If you work with Mauve, I would recommend first to extract the core regions using the stripSubsetLCBs utility, and then use the resulting xmfa file as an xmfa input into ClonalFrameML, which requires to use the "-xmfa_file true" option but not the "ignore_user_sites" option.

Best wishes,
Xavier

@YangChaoCN
Copy link
Author

Thank you so much for your help!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants