Skip to content

afelten-Anses/QuickPhylo

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

55 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

README

Authors: Pauline Barbet, Arnaud Felten

Affiliation: Food Safety Laboratory - ANSES Maisons Alfort (France)

You can find the latest version of the tool at https://github.com/afelten-Anses/QuickPhylo

Workflow

This workflow aims to fastly build a root tree with Mash and dendropy. To make the root tree, two methods are available: UPGMA and Neighbourg-Joining (NJ).

The script takes in input assembly, reads (compressed or not) and sketch files.

The Mash sketch function, sketch each fasta and fastq files into msh files.

Then, the Mash dist function create a distance matrix based on jaccard index at tsv format with previous sketch files and files already sketch.

Finally dendropy produce in output a newick file with the root tree.

Dependencies

The script has been developed with python 2.7 (tested with 2.7.12)

External dependencies

Install with conda

conda install -c afelten fastosh

Parameters

Parameters

  • -i : tsv file containing paths to reads, asssembly or/and sketched files, more than 2 (REQUIRED)
  • -o : output tsv matrix name (default:output)
  • -T : maximum number of threads to use (default:1)
  • -k : k-mer size for sketching (default:15)
  • -s : sketch size = number of k-mer (default:1000)
  • -e : output newick tree name (default:output)

Options

  • --S : suppress sketch files (default:False)
  • --UPGMA : use UPGMA algorithm (default:NJ)

Test

After installing Mash and dendropy you can test the script with the command lines:

mkdir test
cd test
python FasTosh -T nbThreads -i input.tsv -o matrix_name -e tree_name

About

Create tree from assemblies with Mash

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages