Authors: Pauline Barbet, Arnaud Felten
Affiliation: Food Safety Laboratory - ANSES Maisons Alfort (France)
You can find the latest version of the tool at https://github.com/afelten-Anses/QuickPhylo
This workflow aims to fastly build a root tree with Mash and dendropy. To make the root tree, two methods are available: UPGMA and Neighbourg-Joining (NJ).
The script takes in input assembly, reads (compressed or not) and sketch files.
The Mash sketch function, sketch each fasta and fastq files into msh files.
Then, the Mash dist function create a distance matrix based on jaccard index at tsv format with previous sketch files and files already sketch.
Finally dendropy produce in output a newick file with the root tree.
The script has been developed with python 2.7 (tested with 2.7.12)
conda install -c afelten fastosh
- -i : tsv file containing paths to reads, asssembly or/and sketched files, more than 2 (REQUIRED)
- -o : output tsv matrix name (default:output)
- -T : maximum number of threads to use (default:1)
- -k : k-mer size for sketching (default:15)
- -s : sketch size = number of k-mer (default:1000)
- -e : output newick tree name (default:output)
- --S : suppress sketch files (default:False)
- --UPGMA : use UPGMA algorithm (default:NJ)
After installing Mash and dendropy you can test the script with the command lines:
mkdir test
cd test
python FasTosh -T nbThreads -i input.tsv -o matrix_name -e tree_name