# Phylogenetic Tree

In [1]:
import os # version: 3.9.19
import sys # version: 3.9.19
import pandas as pd # version: 2.2.2
import qiime2 as q2 # version: 2024.5.0
from qiime2 import Visualization
import matplotlib.pyplot as plt # version: 3.8.4
import seaborn as sns # version: 0.12.2
%matplotlib inline

# Define the data directory
data_dir = '/data'

## Creating a phylogenetic tree

#### First we align our sequences

In [3]:
! qiime alignment mafft \
    --i-sequences ./data/feature_tables_dada/rep-seqs.qza \
    --o-alignment ./data/phylogeny/aligned-rep-seqs.qza

[32mSaved FeatureData[AlignedSequence] to: ./data/phylogeny/aligned-rep-seqs.qza[0m
[0m

#### Alignment masking

In [4]:
! qiime alignment mask \
    --i-alignment ./data/phylogeny/aligned-rep-seqs.qza \
    --o-masked-alignment ./data/phylogeny/masked-aligned-rep-seqs.qza

[32mSaved FeatureData[AlignedSequence] to: ./data/phylogeny/masked-aligned-rep-seqs.qza[0m
[0m

#### Tree construction

In [5]:
! qiime phylogeny fasttree \
    --i-alignment ./data/phylogeny/masked-aligned-rep-seqs.qza \
    --o-tree ./data/phylogeny/fast-tree.qza

! qiime phylogeny midpoint-root \
    --i-tree ./data/phylogeny/fast-tree.qza \
    --o-rooted-tree ./data/phylogeny/fast-tree-rooted.qza

[32mSaved Phylogeny[Unrooted] to: ./data/phylogeny/fast-tree.qza[0m
[0m[32mSaved Phylogeny[Rooted] to: ./data/phylogeny/fast-tree-rooted.qza[0m
[0m

#### Unable to create tree visualization on Jupyter

Sadly we weren't able to install the qiime2 plugin empress to create the tree visualization, which is why we ran this step on `Euler`.
The bash file we executed on Euler can be found in `./scripts/tree_visualization.sh`.

In [2]:
Visualization.load('./data/phylogeny/fast-tree-rooted.qzv')