MetaSBT v0.1.3 brings the following improvements.
New features
- New option
--uniform-strand
available with theindex
andupdate
modules for processing the input sequences all on the same strand. Mainly used for viral sequences; - New option
--use-representatives
available with theindex
module to use only three representative genomes at the species level; - New option
--resume
available with theindex
andupdate
modules able to resume the index and update processes in case of unexpected errors; - New
expand_fasta.py
utility inscripts
to expand input fasta files into multiple file. One fasta file for each read. Mainly used for viral sequences; - New
fastcluster.py
utility inscript
to compute a average-linkage hierarchical clustering of a set of genomes based on their Mash distances; - Both the
index
andupdate
modules now display a worning message in case the configuration file under--resume
has been previously generated with a different version of MetaSBT; - Both the
index
andupdate
modules now integrateCheckV
andEukCC
for assessing the quality of viruses and eukaryotes; CheckM
has been upgraded toCheckM2
;- The
cluster()
function inutils
is now running in parallel; - The
howdesbt bfdistance
command for computing the distances between bloom filters is now running in parallel.
Fixes
- It correctly checks now for new framework versions when starting a new
metasbt
instance; - Fixed genome quality filtering on completeness and contamination during the
update
; - Improving docstring adopting the numpydoc documentation format.