Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to measure reliability when results are unstable? #107

Closed
xinkwu opened this issue Mar 15, 2020 · 1 comment
Closed

How to measure reliability when results are unstable? #107

xinkwu opened this issue Mar 15, 2020 · 1 comment

Comments

@xinkwu
Copy link

xinkwu commented Mar 15, 2020

Hi Richard,

When I use treetime to infer the evolution relationship of virus sequences, I find that the results of multiple runs are quite different.
Here are the real running results for 8 times [the R squares are shown here: 0.29, 0.28, 0.28, 0.27, 0.28, 0.28, 0.26, 0.28], and the topology of timetree.nexus is different each time.

  1. Is this result due to too similar sequences between viruses? How do I measure the reliability of the results? Is it possible to filter the R square of the results of multiple runs and choose the highest one as the final result?

  2. Does the selective deletion of some sequences help improve the reliability of the results? Some samples may have problems such as repeated sampling leading to sequence redundancy, some may lack date information, some sequences may be of poor quality, and so on.

Here is the command I run TreeTime:
treetime --aln virus.fasta --tree virus.nwk --dates metadata.csv --gtr infer --reroot best --covariation --confidence --reconstruct-tip-states --max-iter 10 --outdir resultTreetime

Any helpful answers are welcome.

Kai

@rneher
Copy link
Member

rneher commented May 10, 2020

the variability here likely comes from random assignment of mutations to branches on either side of the root. Randomness can also arise when your tree has polytomies that are resolved by treetime and there are equal possibilities. If you send me the data, I'd be happy to have a look.

@xinkwu xinkwu closed this as completed Jun 5, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants