Skip to content

UNITE v2.0-ref

Compare
Choose a tag to compare
@terrimporter terrimporter released this 12 Oct 18:15
· 5 commits to main since this release

The files used to train the RDP Classifier v2.13 . Sequences and taxonomy are largely based on the UNITE + INSD v8.3 full dataset for eukaryotes. Taxonomic adjustments were made to resolve unknown and non-unique taxa into a strictly hierarchical taxonomy. Sequences were dereplicated, only unique sequences retained, to reduce dataset size. Compressed file size 167.5 Mb, decompressed file size 1.2Gb.