# Diversity analysis

In this script, the diversity metrics in the bacteria and archeal community has been acquired and inspected.

In [2]:
# import required packages
from qiime2 import Visualization

## 1. Bacteria

#### 1.1 Phylogeny de novo

A) Sequence alignment

In [2]:
# Sequence alignment
! qiime alignment mafft \
    --i-sequences Data/1-rep-seqs_bac.qza \
    --o-alignment Data/2-aligned-rep-seqs_bac.qza

# Alignment masking
! qiime alignment mask \
    --i-alignment Data/2-aligned-rep-seqs_bac.qza \
    --o-masked-alignment Data/2-masked-aligned-rep-seqs_bac.qza

[32mSaved FeatureData[AlignedSequence] to: Data/2-aligned-rep-seqs_bac.qza[0m
[0m[32mSaved FeatureData[AlignedSequence] to: Data/2-masked-aligned-rep-seqs_bac.qza[0m
[0m

B) Phylogenetic tree construction

We applied all the required steps required steps to create a rooted de novo phylogenetic tree 
from the alligned sequences we performed in the previos steps

In [3]:
! qiime phylogeny fasttree \
    --i-alignment Data/2-masked-aligned-rep-seqs_bac.qza \
    --o-tree Data/2-fasttree-tree_bac.qza

! qiime phylogeny midpoint-root \
    --i-tree Data/2-fasttree-tree_bac.qza \
    --o-rooted-tree Data/2-fasttree-tree-rooted_bac.qza

[32mSaved Phylogeny[Unrooted] to: Data/2-fasttree-tree_bac.qza[0m
[0m[32mSaved Phylogeny[Rooted] to: Data/2-fasttree-tree-rooted_bac.qza[0m
[0m

#### 1.2 Rarefaction

To go further with the diversity metrics, we performd the rarefaction to check the sample coverage and/or depict the sampling depth.

In [4]:
! qiime diversity alpha-rarefaction \
    --i-table Data/1-feature-table_bac.qza \
    --i-phylogeny Data/2-fasttree-tree-rooted_bac.qza \
    --p-max-depth 50000 \
    --p-steps 10 \
    --m-metadata-file Data/0-metadata_bac.tsv \
    --o-visualization Results/2-alpha-rarefaction_bac.qzv

[32mSaved Visualization to: Results/2-alpha-rarefaction_bac.qzv[0m
[0m

In [4]:
Visualization.load('Results/2-alpha-rarefaction_bac.qzv')

In [6]:
Visualization.load('Results/1-feature-table_bac.qzv')

#### 1.3 Diversity metrics

We decided to select sampling depth at 40000; this value is high enough to capture the high counts in diversity present in samples and performed diversity metrics.

In [18]:
! qiime diversity core-metrics-phylogenetic \
  --i-table Data/1-feature-table_bac.qza \
  --i-phylogeny Data/2-fasttree-tree-rooted_bac.qza \
  --m-metadata-file Data/0-metadata_bac.tsv \
  --p-sampling-depth  40000 \
  --output-dir Results/2-core-metrics

[32mSaved FeatureTable[Frequency] to: Results/2-core-metrics/rarefied_table.qza[0m
[32mSaved SampleData[AlphaDiversity] to: Results/2-core-metrics/faith_pd_vector.qza[0m
[32mSaved SampleData[AlphaDiversity] to: Results/2-core-metrics/observed_features_vector.qza[0m
[32mSaved SampleData[AlphaDiversity] to: Results/2-core-metrics/shannon_vector.qza[0m
[32mSaved SampleData[AlphaDiversity] to: Results/2-core-metrics/evenness_vector.qza[0m
[32mSaved DistanceMatrix to: Results/2-core-metrics/unweighted_unifrac_distance_matrix.qza[0m
[32mSaved DistanceMatrix to: Results/2-core-metrics/weighted_unifrac_distance_matrix.qza[0m
[32mSaved DistanceMatrix to: Results/2-core-metrics/jaccard_distance_matrix.qza[0m
[32mSaved DistanceMatrix to: Results/2-core-metrics/bray_curtis_distance_matrix.qza[0m
[32mSaved PCoAResults to: Results/2-core-metrics/unweighted_unifrac_pcoa_results.qza[0m
[32mSaved PCoAResults to: Results/2-core-metrics/weighted_unifrac_pcoa_results.qza[0m
[32mSave

# 1.4 Beta diversity

There are obvious difference in microbioal communities between phases regarding all four beta diversity metrics. The statistical difference will be further tested in R script.

In [11]:
Visualization.load('Results/2-core-metrics_bac/bray_curtis_emperor.qzv')

In [12]:
Visualization.load('Results/2-core-metrics_bac/jaccard_emperor.qzv')

In [14]:
Visualization.load('Results/2-core-metrics_bac/weighted_unifrac_emperor.qzv')

In [15]:
Visualization.load('Results/2-core-metrics_bac/unweighted_unifrac_emperor.qzv')

## 2. Archaea

#### 2.1 Phylogeny de novo

A) Sequence alignment

In [19]:
# Sequence alignment
! qiime alignment mafft \
    --i-sequences Data/1-rep-seqs_arc.qza \
    --o-alignment Data/2-aligned-rep-seqs_arc.qza

# Alignment masking
! qiime alignment mask \
    --i-alignment Data/2-aligned-rep-seqs_arc.qza \
    --o-masked-alignment Data/2-masked-aligned-rep-seqs_arc.qza

[32mSaved FeatureData[AlignedSequence] to: Data/2-aligned-rep-seqs_arc.qza[0m
[0m[32mSaved FeatureData[AlignedSequence] to: Data/2-masked-aligned-rep-seqs_arc.qza[0m
[0m

B) Tree construction

In [20]:
! qiime phylogeny fasttree \
    --i-alignment Data/2-masked-aligned-rep-seqs_arc.qza \
    --o-tree Data/2-fasttree-tree_arc.qza

! qiime phylogeny midpoint-root \
    --i-tree Data/2-fasttree-tree_arc.qza \
    --o-rooted-tree Data/2-fasttree-tree-rooted_arc.qza

[32mSaved Phylogeny[Unrooted] to: Data/2-fasttree-tree_arc.qza[0m
[0m[32mSaved Phylogeny[Rooted] to: Data/2-fasttree-tree-rooted_arc.qza[0m
[0m

#### 2.2 Rarefaction

In [22]:
! qiime diversity alpha-rarefaction \
    --i-table Data/1-feature-table_arc.qza \
    --i-phylogeny Data/2-fasttree-tree-rooted_arc.qza \
    --p-max-depth 10000 \
    --p-steps 10 \
    --m-metadata-file Data/0-metadata_arc.tsv \
    --o-visualization Results/2-alpha-rarefaction_arc.qzv

[32mSaved Visualization to: Results/2-alpha-rarefaction_arc.qzv[0m
[0m

In [5]:
Visualization.load('Results/2-alpha-rarefaction_arc.qzv')

In [24]:
Visualization.load('Results/1-feature-table_arc.qzv')

#### 2.3 Diversity metrics

In [19]:
! qiime diversity core-metrics-phylogenetic \
  --i-table Data/1-feature-table_arc.qza \
  --i-phylogeny Data/2-fasttree-tree-rooted_arc.qza \
  --m-metadata-file Data/0-metadata_arc.tsv \
  --p-sampling-depth  5000 \
  --output-dir Results/2-core-metrics_arc

[32mSaved FeatureTable[Frequency] to: Results/2-core-metrics_arc/rarefied_table.qza[0m
[32mSaved SampleData[AlphaDiversity] to: Results/2-core-metrics_arc/faith_pd_vector.qza[0m
[32mSaved SampleData[AlphaDiversity] to: Results/2-core-metrics_arc/observed_features_vector.qza[0m
[32mSaved SampleData[AlphaDiversity] to: Results/2-core-metrics_arc/shannon_vector.qza[0m
[32mSaved SampleData[AlphaDiversity] to: Results/2-core-metrics_arc/evenness_vector.qza[0m
[32mSaved DistanceMatrix to: Results/2-core-metrics_arc/unweighted_unifrac_distance_matrix.qza[0m
[32mSaved DistanceMatrix to: Results/2-core-metrics_arc/weighted_unifrac_distance_matrix.qza[0m
[32mSaved DistanceMatrix to: Results/2-core-metrics_arc/jaccard_distance_matrix.qza[0m
[32mSaved DistanceMatrix to: Results/2-core-metrics_arc/bray_curtis_distance_matrix.qza[0m
[32mSaved PCoAResults to: Results/2-core-metrics_arc/unweighted_unifrac_pcoa_results.qza[0m
[32mSaved PCoAResults to: Results/2-core-metrics_arc/wei

# 2.4 Beta diversity

In [25]:
Visualization.load('Results/2-core-metrics_arc/bray_curtis_emperor.qzv')

In [26]:
Visualization.load('Results/2-core-metrics_arc/jaccard_emperor.qzv')

In [27]:
Visualization.load('Results/2-core-metrics_bac/weighted_unifrac_emperor.qzv')

In [28]:
Visualization.load('Results/2-core-metrics_arc/unweighted_unifrac_emperor.qzv')