The file 16S_Thaum_aligned_cgr.fasta possesses the fasta sequences corresponding to the Thaumarchaeota groups outlined in Vico Oton, 2015 (https://doi.org/10.1038/ismej.2015.101) and used in Sheridan et al 2022 (https://www.biorxiv.org/content/10.1101/2023.03.08.531495v1).
Alignments and Phylogenies directories contain the supermatrix alignments and the reconstructed phylogenies for:
Dataset 1: 19 Gagatemarchaeaceae genomes, two UBA141-like genomes and three AOA
Dataset 2: 64 genomes of Thaumarchaeota and related species (completeness > 45%, contamination < 10%)
Dataset 3: 52 higher-quality genomes of Thaumarchaeota and closely related species (completeness > 70%, contamination < 5%)