Ready-made Taxonomic Weights Repository
Searchable Inventory of readytowear taxonomic weights.
If you use any materials in readytowear, please cite:
Kaehler BD, Bokulich NA, McDonald D, Knight R, Caporaso JG, Huttley GA. 2019. Species-level microbial sequence classification is improved by source-environment information. Nature Communications 10: 4643. https://doi.org/10.1038/s41467-019-12669-6
Please also cite (as this classifier was used to provide taxonomic labels for the class weights):
Bokulich NA, Kaehler BD, Rideout JR, Dillon M, Bolyen E, Knight R, Huttley GA, Caporaso JG. 2018. Optimizing taxonomic classification of marker gene sequences. Microbiome 6(1): 90. doi: https://doi.org/10.1186/s40168-018-0470-z.
And finally do not forget to cite the reference database used (citations for individual reference databases are located in the appropriate
How to use the readytowear collection
NOTE: The readytowear collection currently only includes taxonomic weights generated for 16S rRNA gene sequence data. Hence, the collection currently does not include weights for other marker genes. We may accommodate these others needs in future releases, and encourage community contributions (contribution instructions coming soon). In the mean time, if you use non-16S rRNA gene data and wish to use bespoke classifiers, assemble your own custom taxonomic weights with q2-clawback as described here
q2-feature-classifier is a plugin for QIIME 2, and hence QIIME 2 must be installed to use. Before beginning this tutorial, install and activate your QIIME 2 environment.
Clone readytowear to get started:
git clone https://github.com/BenKaehler/readytowear.git
Train a non-saline soil naive Bayes taxonomy classifier using the latest readytowear fashions:
qiime feature-classifier fit-classifier-naive-bayes \ --i-reference-reads readytowear/data/gg_13_8/515f-806r/ref-seqs-v4.qza \ --i-reference-taxonomy readytowear/data/gg_13_8/515f-806r/ref-tax.qza \ --i-class-weight readytowear/data/gg_13_8/515f-806r/soil-non-saline.qza \ --o-classifier gg138_v4_soil-non-saline_classifier.qza
Now this classifier is ready to use! Classify a set of query sequences contained in a FASTA format file as follows:
qiime tools import \ --input-path sequences.fna \ --output-path sequences.qza \ --type 'FeatureData[Sequence]' qiime feature-classifier classify-sklearn \ --i-reads sequences.qza \ --i-classifier gg138_v4_soil-non-saline_classifier.qza \ --o-classification bespoke-classifier-results.qza qiime metadata tabulate \ --m-input-file bespoke-classifier-results.qza \ --m-input-file sequences.qza \ --o-visualization bespoke-classifier-results.qzv