Using Fasta Input

If you would like to run augur on viral data, or bacterial SNP data, you probably would like to start with Fasta sequence data.

Sequence data

Your sequence data should

consist of homologous sequences that can be aligned unambiguously
needs to contain sufficient diversity to allow reliable tree reconstruction
should be of similar length. Mixing short sequences (300bp) with much longer ones (10000bp) often yields unexpected results.