-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
How to select the neighbors genomes? #13
Comments
We are developing the package neighbors for that. The Tutorial in its documentation might help you complete a given set of targets and to find the corresponding neighbors. |
Thank you for your reply. Is this package currently available? I basically choose other genomes of the same genus as the neighbors, maybe I need to change the - q parameter. |
Yes, that package is available. A common problem with fur is mixture between target and neighbor genomes. To make sure this doesn't happen, one should reconstruct a phylogeny from the genomes. If you'd like to see how this is done, take a look at the tutorial in the neighbors package. |
Is there a way to automatically correct the target and neighbor genomes? |
There's a semi-automatic way, which scales well to hundreds of bacterial genomes. It is described in the tutorial of the neighbors package. It consists of the following steps:
If this sounds promising to you, take a look at the neighbors tutorial for the details. |
Thank you for your reply. |
It is a good question, here is a sketch of an answer: Let's say the leaves in the tree are labelled with prefix 'n' for taxonomic neighbors and prefix 't' for taxonomic targets, as explained in the neighbors tutorial. We are looking for the node, v, that maximizes the number of targets in the subtree rooted on v and the number of neighbors elsewhwere. For example, the tree file
We sort the scores to find that the desired node is 300, which is also the one I'd pick from visual inspection.
Hope this helps. |
Just in case this is still of interest, the Neighbors package now also contains the program |
Hi,
Thanks for developing this software.
I would like to ask how to select the neighbors genomes. Sometimes there are no sequences in the results of fur.
Thanks.
The text was updated successfully, but these errors were encountered: