In [1]:
import qiime2

In [2]:
from qiime2 import Visualization

In [None]:
# Description: What is the influence of genotype (intrinsic) and environment (extrinsic) on anemone-associated bacterial communities?

#Data: Illumina MiSeq v3 paired-end (2 × 300 bp) reads (FASTQ)

#Tools: QIIME 2

#Pipeline: 
#Section 1: Importing, cleaning and quality control of the data
#Section 2: Taxonomic Analysis
#Section 3: Building a phylogenetic tree
#Section 4: Basic visualisations and statistics
#Section 5: Exporting data for further analysis in R

In [2]:
#https://www.melbournebioinformatics.org.au/tutorials/tutorials/qiime2/qiime2/

In [None]:
#Section 1: Importing, cleaning and quality control of the data

In [None]:
#The raw data located in the directory raw_data is exported to a single QIIME 2 artefact file, combined.qza, located in the directory anaylsis/seqs.

In [None]:
mkdir -p analysis/seqs

In [19]:
#These sequences still have the primers attached - they need to be removed (using cutadapt) before denoising.

!qiime cutadapt trim-paired \
--i-demultiplexed-sequences analysis/seqs/combined.qza \
--p-front-f AGGATTAGATACCCTGGTA \
--p-front-r CRRCACGAGCTGACGAC \
--p-error-rate 0.20 \
--output-dir analysis/seqs_trimmed \
--verbose

Running external command line application. This may print messages to stdout and/or stderr.
The commands to be run are below. These commands cannot be manually re-run as they will depend on temporary files that no longer exist.

Command: cutadapt --cores 1 --error-rate 0.2 --times 1 --overlap 3 --minimum-length 1 -q 0,0 --quality-base 33 -o /var/folders/cb/bxr92qgn10d978brzl0pdtfc0000gn/T/q2-CasavaOneEightSingleLanePerSampleDirFmt-dlak_tqs/AN10_Fwd01-Rev11_L001_R1_001.fastq.gz -p /var/folders/cb/bxr92qgn10d978brzl0pdtfc0000gn/T/q2-CasavaOneEightSingleLanePerSampleDirFmt-dlak_tqs/AN10_Fwd01-Rev11_L001_R2_001.fastq.gz --front AGGATTAGATACCCTGGTA -G CRRCACGAGCTGACGAC /var/folders/cb/bxr92qgn10d978brzl0pdtfc0000gn/T/qiime2/laurenstoeltje/data/f28e9851-b0cc-49eb-95b3-0e8ee9c90fe4/data/AN10_Fwd01-Rev11_L001_R1_001.fastq.gz /var/folders/cb/bxr92qgn10d978brzl0pdtfc0000gn/T/qiime2/laurenstoeltje/data/f28e9851-b0cc-49eb-95b3-0e8ee9c90fe4/data/AN10_Fwd01-Rev11_L001_R2_001.fastq.gz

This is cutada

Done           00:00:00        13,612 reads @  47.2 µs/read;   1.27 M reads/minute
Finished in 0.650 s (47.770 µs/read; 1.26 M reads/minute).

=== Summary ===

Total read pairs processed:             13,612
  Read 1 with adapter:                  13,609 (100.0%)
  Read 2 with adapter:                  13,597 (99.9%)

== Read fate breakdown ==
Pairs that were too short:                   0 (0.0%)
Pairs written (passing filters):        13,612 (100.0%)

Total basepairs processed:     7,704,804 bp
  Read 1:     4,017,738 bp
  Read 2:     3,687,066 bp
Quality-trimmed:                       0 bp (0.0%)
  Read 1:             0 bp
  Read 2:             0 bp
Total written (filtered):      6,374,606 bp (82.7%)
  Read 1:     3,325,259 bp
  Read 2:     3,049,347 bp

=== First read: Adapter 1 ===

Sequence: AGGATTAGATACCCTGGTA; Type: regular 5'; Length: 19; Trimmed: 13609 times

Minimum overlap: 3
No. of allowed errors:
1-4 bp: 0; 5-9 bp: 1; 10-14 bp: 2; 15-19 bp: 3

Overview of removed sequences



Command: cutadapt --cores 1 --error-rate 0.2 --times 1 --overlap 3 --minimum-length 1 -q 0,0 --quality-base 33 -o /var/folders/cb/bxr92qgn10d978brzl0pdtfc0000gn/T/q2-CasavaOneEightSingleLanePerSampleDirFmt-dlak_tqs/AN48_Fwd03-Rev13_L001_R1_001.fastq.gz -p /var/folders/cb/bxr92qgn10d978brzl0pdtfc0000gn/T/q2-CasavaOneEightSingleLanePerSampleDirFmt-dlak_tqs/AN48_Fwd03-Rev13_L001_R2_001.fastq.gz --front AGGATTAGATACCCTGGTA -G CRRCACGAGCTGACGAC /var/folders/cb/bxr92qgn10d978brzl0pdtfc0000gn/T/qiime2/laurenstoeltje/data/f28e9851-b0cc-49eb-95b3-0e8ee9c90fe4/data/AN48_Fwd03-Rev13_L001_R1_001.fastq.gz /var/folders/cb/bxr92qgn10d978brzl0pdtfc0000gn/T/qiime2/laurenstoeltje/data/f28e9851-b0cc-49eb-95b3-0e8ee9c90fe4/data/AN48_Fwd03-Rev13_L001_R2_001.fastq.gz

This is cutadapt 4.5 with Python 3.8.18
Command line parameters: --cores 1 --error-rate 0.2 --times 1 --overlap 3 --minimum-length 1 -q 0,0 --quality-base 33 -o /var/folders/cb/bxr92qgn10d978brzl0pdtfc0000gn/T/q2-CasavaOneEightSingleLanePerSa

This is cutadapt 4.5 with Python 3.8.18
Command line parameters: --cores 1 --error-rate 0.2 --times 1 --overlap 3 --minimum-length 1 -q 0,0 --quality-base 33 -o /var/folders/cb/bxr92qgn10d978brzl0pdtfc0000gn/T/q2-CasavaOneEightSingleLanePerSampleDirFmt-dlak_tqs/AN67_Fwd05-Rev07_L001_R1_001.fastq.gz -p /var/folders/cb/bxr92qgn10d978brzl0pdtfc0000gn/T/q2-CasavaOneEightSingleLanePerSampleDirFmt-dlak_tqs/AN67_Fwd05-Rev07_L001_R2_001.fastq.gz --front AGGATTAGATACCCTGGTA -G CRRCACGAGCTGACGAC /var/folders/cb/bxr92qgn10d978brzl0pdtfc0000gn/T/qiime2/laurenstoeltje/data/f28e9851-b0cc-49eb-95b3-0e8ee9c90fe4/data/AN67_Fwd05-Rev07_L001_R1_001.fastq.gz /var/folders/cb/bxr92qgn10d978brzl0pdtfc0000gn/T/qiime2/laurenstoeltje/data/f28e9851-b0cc-49eb-95b3-0e8ee9c90fe4/data/AN67_Fwd05-Rev07_L001_R2_001.fastq.gz
Processing paired-end reads on 1 core ...
Done           00:00:00        12,855 reads @  45.3 µs/read;   1.32 M reads/minute
Finished in 0.589 s (45.843 µs/read; 1.31 M reads/minute).

=== Summary 

This is cutadapt 4.5 with Python 3.8.18
Command line parameters: --cores 1 --error-rate 0.2 --times 1 --overlap 3 --minimum-length 1 -q 0,0 --quality-base 33 -o /var/folders/cb/bxr92qgn10d978brzl0pdtfc0000gn/T/q2-CasavaOneEightSingleLanePerSampleDirFmt-dlak_tqs/AN74_Fwd06-Rev02_L001_R1_001.fastq.gz -p /var/folders/cb/bxr92qgn10d978brzl0pdtfc0000gn/T/q2-CasavaOneEightSingleLanePerSampleDirFmt-dlak_tqs/AN74_Fwd06-Rev02_L001_R2_001.fastq.gz --front AGGATTAGATACCCTGGTA -G CRRCACGAGCTGACGAC /var/folders/cb/bxr92qgn10d978brzl0pdtfc0000gn/T/qiime2/laurenstoeltje/data/f28e9851-b0cc-49eb-95b3-0e8ee9c90fe4/data/AN74_Fwd06-Rev02_L001_R1_001.fastq.gz /var/folders/cb/bxr92qgn10d978brzl0pdtfc0000gn/T/qiime2/laurenstoeltje/data/f28e9851-b0cc-49eb-95b3-0e8ee9c90fe4/data/AN74_Fwd06-Rev02_L001_R2_001.fastq.gz
Processing paired-end reads on 1 core ...
Done           00:00:00        12,482 reads @  50.5 µs/read;   1.19 M reads/minute
Finished in 0.642 s (51.470 µs/read; 1.17 M reads/minute).

=== Summary 

This is cutadapt 4.5 with Python 3.8.18
Command line parameters: --cores 1 --error-rate 0.2 --times 1 --overlap 3 --minimum-length 1 -q 0,0 --quality-base 33 -o /var/folders/cb/bxr92qgn10d978brzl0pdtfc0000gn/T/q2-CasavaOneEightSingleLanePerSampleDirFmt-dlak_tqs/AN78_Fwd06-Rev06_L001_R1_001.fastq.gz -p /var/folders/cb/bxr92qgn10d978brzl0pdtfc0000gn/T/q2-CasavaOneEightSingleLanePerSampleDirFmt-dlak_tqs/AN78_Fwd06-Rev06_L001_R2_001.fastq.gz --front AGGATTAGATACCCTGGTA -G CRRCACGAGCTGACGAC /var/folders/cb/bxr92qgn10d978brzl0pdtfc0000gn/T/qiime2/laurenstoeltje/data/f28e9851-b0cc-49eb-95b3-0e8ee9c90fe4/data/AN78_Fwd06-Rev06_L001_R1_001.fastq.gz /var/folders/cb/bxr92qgn10d978brzl0pdtfc0000gn/T/qiime2/laurenstoeltje/data/f28e9851-b0cc-49eb-95b3-0e8ee9c90fe4/data/AN78_Fwd06-Rev06_L001_R2_001.fastq.gz
Processing paired-end reads on 1 core ...
Done           00:00:00        16,591 reads @  57.7 µs/read;   1.04 M reads/minute
Finished in 0.967 s (58.311 µs/read; 1.03 M reads/minute).

=== Summary 


Command: cutadapt --cores 1 --error-rate 0.2 --times 1 --overlap 3 --minimum-length 1 -q 0,0 --quality-base 33 -o /var/folders/cb/bxr92qgn10d978brzl0pdtfc0000gn/T/q2-CasavaOneEightSingleLanePerSampleDirFmt-dlak_tqs/AN93_Fwd07-Rev09_L001_R1_001.fastq.gz -p /var/folders/cb/bxr92qgn10d978brzl0pdtfc0000gn/T/q2-CasavaOneEightSingleLanePerSampleDirFmt-dlak_tqs/AN93_Fwd07-Rev09_L001_R2_001.fastq.gz --front AGGATTAGATACCCTGGTA -G CRRCACGAGCTGACGAC /var/folders/cb/bxr92qgn10d978brzl0pdtfc0000gn/T/qiime2/laurenstoeltje/data/f28e9851-b0cc-49eb-95b3-0e8ee9c90fe4/data/AN93_Fwd07-Rev09_L001_R1_001.fastq.gz /var/folders/cb/bxr92qgn10d978brzl0pdtfc0000gn/T/qiime2/laurenstoeltje/data/f28e9851-b0cc-49eb-95b3-0e8ee9c90fe4/data/AN93_Fwd07-Rev09_L001_R2_001.fastq.gz

This is cutadapt 4.5 with Python 3.8.18
Command line parameters: --cores 1 --error-rate 0.2 --times 1 --overlap 3 --minimum-length 1 -q 0,0 --quality-base 33 -o /var/folders/cb/bxr92qgn10d978brzl0pdtfc0000gn/T/q2-CasavaOneEightSingleLanePerSa

This is cutadapt 4.5 with Python 3.8.18
Command line parameters: --cores 1 --error-rate 0.2 --times 1 --overlap 3 --minimum-length 1 -q 0,0 --quality-base 33 -o /var/folders/cb/bxr92qgn10d978brzl0pdtfc0000gn/T/q2-CasavaOneEightSingleLanePerSampleDirFmt-dlak_tqs/CTRLA_Fwd04-Rev25_L001_R1_001.fastq.gz -p /var/folders/cb/bxr92qgn10d978brzl0pdtfc0000gn/T/q2-CasavaOneEightSingleLanePerSampleDirFmt-dlak_tqs/CTRLA_Fwd04-Rev25_L001_R2_001.fastq.gz --front AGGATTAGATACCCTGGTA -G CRRCACGAGCTGACGAC /var/folders/cb/bxr92qgn10d978brzl0pdtfc0000gn/T/qiime2/laurenstoeltje/data/f28e9851-b0cc-49eb-95b3-0e8ee9c90fe4/data/CTRLA_Fwd04-Rev25_L001_R1_001.fastq.gz /var/folders/cb/bxr92qgn10d978brzl0pdtfc0000gn/T/qiime2/laurenstoeltje/data/f28e9851-b0cc-49eb-95b3-0e8ee9c90fe4/data/CTRLA_Fwd04-Rev25_L001_R2_001.fastq.gz
Processing paired-end reads on 1 core ...
Done           00:00:00        11,121 reads @  45.1 µs/read;   1.33 M reads/minute
Finished in 0.511 s (45.928 µs/read; 1.31 M reads/minute).

=== Summ

[32mSaved SampleData[PairedEndSequencesWithQuality] to: analysis/seqs_trimmed/trimmed_sequences.qza[0m
[0m

In [None]:
mkdir analysis/visualisations

In [15]:
#Create a viewable summary file so the data quality can be checked. Viewing the quality plots generated here helps determine trim settings.
!qiime demux summarize \
--i-data analysis/seqs_trimmed/trimmed_sequences.qza \
--o-visualization analysis/visualisations/trimmed_sequences.qzv

[32mSaved Visualization to: analysis/visualisations/trimmed_sequences.qzv[0m
[0m

In [18]:
# Load and display visualization for trimmed sequences
path_to_trimmed_sequences_qzv = 'analysis/visualisations/trimmed_sequences.qzv'
visualization_trimmed_sequences = Visualization.load(path_to_trimmed_sequences_qzv)
display(visualization_trimmed_sequences)

In [None]:
#Denoising the data
#Trimmed sequences are now quality assessed using the dada2 plugin within QIIME2.

In [26]:
!qiime dada2 denoise-paired \
--i-demultiplexed-seqs analysis/seqs_trimmed/trimmed_sequences.qza \
--p-trunc-len-f 180 \
--p-trunc-len-r 160 \
--p-n-threads 0 \
--output-dir analysis/dada2out \
--verbose

Running external command line application(s). This may print messages to stdout and/or stderr.
The command(s) being run are below. These commands cannot be manually re-run as they will depend on temporary files that no longer exist.

Command: run_dada.R --input_directory /var/folders/cb/bxr92qgn10d978brzl0pdtfc0000gn/T/tmp23gn9uqa/forward --input_directory_reverse /var/folders/cb/bxr92qgn10d978brzl0pdtfc0000gn/T/tmp23gn9uqa/reverse --output_path /var/folders/cb/bxr92qgn10d978brzl0pdtfc0000gn/T/tmp23gn9uqa/output.tsv.biom --output_track /var/folders/cb/bxr92qgn10d978brzl0pdtfc0000gn/T/tmp23gn9uqa/track.tsv --filtered_directory /var/folders/cb/bxr92qgn10d978brzl0pdtfc0000gn/T/tmp23gn9uqa/filt_f --filtered_directory_reverse /var/folders/cb/bxr92qgn10d978brzl0pdtfc0000gn/T/tmp23gn9uqa/filt_r --truncation_length 180 --truncation_length_reverse 160 --trim_left 0 --trim_left_reverse 0 --max_expected_errors 2.0 --max_expected_errors_reverse 2.0 --truncation_quality_score 2 --min_overlap 12 --p

In [22]:
#Generate summary files

In [27]:
!qiime metadata tabulate \
--m-input-file analysis/dada2out/denoising_stats.qza \
--o-visualization analysis/visualisations/16s_denoising_stats.qzv \
--verbose

[32mSaved Visualization to: analysis/visualisations/16s_denoising_stats.qzv[0m
[0m

In [28]:
path_to_16s_denoising_stats_qzv = 'analysis/visualisations/16s_denoising_stats.qzv'
visualization_16s_denoising_stats = Visualization.load(path_to_16s_denoising_stats_qzv)
display(visualization_16s_denoising_stats)

In [31]:
!qiime feature-table summarize \
--i-table analysis/dada2out/table.qza \
--m-sample-metadata-file data_files/metadata.tsv \
--o-visualization analysis/visualisations/16s_table.qzv \
--verbose

[32mSaved Visualization to: analysis/visualisations/16s_table.qzv[0m
[0m

In [32]:
path_to_16s_table_qzv = 'analysis/visualisations/16s_table.qzv'
visualization_16s_table = Visualization.load(path_to_16s_table_qzv)
display(visualization_16s_table)

In [33]:
!qiime feature-table tabulate-seqs \
--i-data analysis/dada2out/representative_sequences.qza \
--o-visualization analysis/visualisations/16s_representative_seqs.qzv \
--verbose

[32mSaved Visualization to: analysis/visualisations/16s_representative_seqs.qzv[0m
[0m

In [34]:
path_to_16s_representative_seqs_qzv = 'analysis/visualisations/16s_representative_seqs.qzv'
visualization_16s_representative_seqs = Visualization.load(path_to_16s_representative_seqs_qzv)
display(visualization_16s_representative_seqs)

In [None]:
#Section 2: Taxonomic Analysis¶
#Assign taxonomy
#classify each identical read or Amplicon Sequence Variant (ASV) to the highest resolution based on a database.
#A classifier has already been trained for you for the V5V6 region of the bacterial 16S rRNA gene using the SILVA database.

In [36]:
!qiime feature-classifier classify-sklearn \
--i-classifier silva_138_16s_v5v6_classifier_2021-4.qza \
--i-reads analysis/dada2out/representative_sequences.qza \
--p-n-jobs 1 \
--output-dir analysis/taxonomy \
--verbose

[32mSaved FeatureData[Taxonomy] to: analysis/taxonomy/classification.qza[0m
[0m

In [None]:
#Generate a viewable summary file of the taxonomic assignments.

In [38]:
!qiime metadata tabulate \
--m-input-file analysis/taxonomy/classification.qza \
--o-visualization analysis/visualisations/taxonomy.qzv \
--verbose

[32mSaved Visualization to: analysis/visualisations/taxonomy.qzv[0m
[0m

In [None]:
#Filtering
#Generate a viewable summary file of the new table to see the effect of filtering.

In [40]:
!qiime taxa filter-table \
--i-table analysis/dada2out/table.qza \
--i-taxonomy analysis/taxonomy/classification.qza  \
--p-exclude Mitochondria,Chloroplast \
--o-filtered-table analysis/taxonomy/16s_table_filtered.qza \
--verbose

[32mSaved FeatureTable[Frequency] to: analysis/taxonomy/16s_table_filtered.qza[0m
[0m

In [42]:
!qiime feature-table summarize \
--i-table analysis/taxonomy/16s_table_filtered.qza \
--m-sample-metadata-file data_files/metadata.tsv \
--o-visualization analysis/visualisations/16s_table_filtered.qzv \
--verbose

[32mSaved Visualization to: analysis/visualisations/16s_table_filtered.qzv[0m
[0m

In [43]:
path_to_16s_table_filtered_qzv = 'analysis/visualisations/16s_table_filtered.qzv'
visualization_16s_table_filtered = Visualization.load(path_to_16s_table_filtered_qzv)
display(visualization_16s_table_filtered)

In [None]:
#Section 3: Build a phylogenetic tree¶

In [44]:
!qiime phylogeny align-to-tree-mafft-fasttree \
--i-sequences analysis/dada2out/representative_sequences.qza \
--o-alignment analysis/tree/aligned_16s_representative_seqs.qza \
--o-masked-alignment analysis/tree/masked_aligned_16s_representative_seqs.qza \
--o-tree analysis/tree/16s_unrooted_tree.qza \
--o-rooted-tree analysis/tree/16s_rooted_tree.qza \
--p-n-threads 1 \
--verbose

Running external command line application. This may print messages to stdout and/or stderr.
The command being run is below. This command cannot be manually re-run as it will depend on temporary files that no longer exist.

Command: mafft --preservecase --inputorder --thread 1 /var/folders/cb/bxr92qgn10d978brzl0pdtfc0000gn/T/qiime2/laurenstoeltje/data/2aca41e4-4608-4f6b-8f17-bd396521c188/data/dna-sequences.fasta

inputfile = orig
700 x 289 - 240 d
nthread = 1
nthreadpair = 1
nthreadtb = 1
ppenalty_ex = 0
stacksize: 8192 kb
generating a scoring matrix for nucleotide (dist=200) ... done
Gap Penalty = -1.53, +0.00, +0.00



Making a distance matrix ..
  601 / 700 (thread    0)
done.

Constructing a UPGMA tree (efffree=0) ... 
  690 / 700
done.

Progressive alignment 1/2... 
STEP   601 / 699 (thread    0)
Reallocating..done. *alloclen = 1580

done.

Making a distance matrix from msa.. 
  600 / 700 (thread    0)
done.

Constructing a UPGMA tree (efffree=1) ... 
  690 / 700
done.

Progressive

In [None]:
#Section 4: Basic visualisations and statistics
#ASV relative abundance bar charts

In [None]:
#Create bar charts to compare the relative abundance of ASVs across samples.

In [45]:
!qiime taxa barplot \
--i-table analysis/taxonomy/16s_table_filtered.qza \
--i-taxonomy analysis/taxonomy/classification.qza \
--m-metadata-file data_files/metadata.tsv \
--o-visualization analysis/visualisations/barchart.qzv \
--verbose

[32mSaved Visualization to: analysis/visualisations/barchart.qzv[0m
[0m

In [46]:
path_to_barchart_qzv = 'analysis/visualisations/barchart.qzv'
visualization_barchart = Visualization.load(path_to_barchart_qzv)
display(visualization_barchart)

In [47]:
#Rarefaction curves

!qiime diversity alpha-rarefaction \
--i-table analysis/taxonomy/16s_table_filtered.qza \
--i-phylogeny analysis/tree/16s_rooted_tree.qza \
--p-max-depth 9062 \
--m-metadata-file data_files/metadata.tsv \
--o-visualization analysis/visualisations/16s_alpha_rarefaction.qzv \
--verbose

Running external command line application. This may print messages to stdout and/or stderr.
The command being run is below. This command cannot be manually re-run as it will depend on temporary files that no longer exist.

Command:

faithpd -i /var/folders/cb/bxr92qgn10d978brzl0pdtfc0000gn/T/qiime2/laurenstoeltje/data/84666e8e-d24b-469c-9b40-21f24e6ff11f/data/feature-table.biom -t /var/folders/cb/bxr92qgn10d978brzl0pdtfc0000gn/T/qiime2/laurenstoeltje/data/7c7ad574-3873-4de6-aa66-64895eca559e/data/tree.nwk -o /var/folders/cb/bxr92qgn10d978brzl0pdtfc0000gn/T/q2-AlphaDiversityFormat-pbqv10gq

Running external command line application. This may print messages to stdout and/or stderr.
The command being run is below. This command cannot be manually re-run as it will depend on temporary files that no longer exist.

Command:

faithpd -i /var/folders/cb/bxr92qgn10d978brzl0pdtfc0000gn/T/qiime2/laurenstoeltje/data/cde26a3e-433f-4820-a6b5-bb472ab2d611/data/feature-table.biom -t /var/folders/cb/bxr

Running external command line application. This may print messages to stdout and/or stderr.
The command being run is below. This command cannot be manually re-run as it will depend on temporary files that no longer exist.

Command:

faithpd -i /var/folders/cb/bxr92qgn10d978brzl0pdtfc0000gn/T/qiime2/laurenstoeltje/data/dbd9ae3f-d4c7-4d55-9fbb-26bcf74d8f11/data/feature-table.biom -t /var/folders/cb/bxr92qgn10d978brzl0pdtfc0000gn/T/qiime2/laurenstoeltje/data/7c7ad574-3873-4de6-aa66-64895eca559e/data/tree.nwk -o /var/folders/cb/bxr92qgn10d978brzl0pdtfc0000gn/T/q2-AlphaDiversityFormat-tj_5g1kd

Running external command line application. This may print messages to stdout and/or stderr.
The command being run is below. This command cannot be manually re-run as it will depend on temporary files that no longer exist.

Command:

faithpd -i /var/folders/cb/bxr92qgn10d978brzl0pdtfc0000gn/T/qiime2/laurenstoeltje/data/2a1e3fd3-c2a0-4a93-ac0b-8da9601a2d26/data/feature-table.biom -t /var/folders/cb/bxr

Running external command line application. This may print messages to stdout and/or stderr.
The command being run is below. This command cannot be manually re-run as it will depend on temporary files that no longer exist.

Command:

faithpd -i /var/folders/cb/bxr92qgn10d978brzl0pdtfc0000gn/T/qiime2/laurenstoeltje/data/e97d9045-e97c-46b1-8e12-b3e8da585144/data/feature-table.biom -t /var/folders/cb/bxr92qgn10d978brzl0pdtfc0000gn/T/qiime2/laurenstoeltje/data/7c7ad574-3873-4de6-aa66-64895eca559e/data/tree.nwk -o /var/folders/cb/bxr92qgn10d978brzl0pdtfc0000gn/T/q2-AlphaDiversityFormat-c9kbez6m

Running external command line application. This may print messages to stdout and/or stderr.
The command being run is below. This command cannot be manually re-run as it will depend on temporary files that no longer exist.

Command:

faithpd -i /var/folders/cb/bxr92qgn10d978brzl0pdtfc0000gn/T/qiime2/laurenstoeltje/data/dc41378e-3a97-4c7c-828a-8429ea15d51f/data/feature-table.biom -t /var/folders/cb/bxr

Running external command line application. This may print messages to stdout and/or stderr.
The command being run is below. This command cannot be manually re-run as it will depend on temporary files that no longer exist.

Command:

faithpd -i /var/folders/cb/bxr92qgn10d978brzl0pdtfc0000gn/T/qiime2/laurenstoeltje/data/67a87507-95b6-4c24-a918-fc60c75238fb/data/feature-table.biom -t /var/folders/cb/bxr92qgn10d978brzl0pdtfc0000gn/T/qiime2/laurenstoeltje/data/7c7ad574-3873-4de6-aa66-64895eca559e/data/tree.nwk -o /var/folders/cb/bxr92qgn10d978brzl0pdtfc0000gn/T/q2-AlphaDiversityFormat-qzd1hisb

Running external command line application. This may print messages to stdout and/or stderr.
The command being run is below. This command cannot be manually re-run as it will depend on temporary files that no longer exist.

Command:

faithpd -i /var/folders/cb/bxr92qgn10d978brzl0pdtfc0000gn/T/qiime2/laurenstoeltje/data/032cc766-51f3-45e2-84ca-1d7b587af4c7/data/feature-table.biom -t /var/folders/cb/bxr

Running external command line application. This may print messages to stdout and/or stderr.
The command being run is below. This command cannot be manually re-run as it will depend on temporary files that no longer exist.

Command:

faithpd -i /var/folders/cb/bxr92qgn10d978brzl0pdtfc0000gn/T/qiime2/laurenstoeltje/data/ce41a8f0-e3a0-4565-a3d7-3e07248d4eb8/data/feature-table.biom -t /var/folders/cb/bxr92qgn10d978brzl0pdtfc0000gn/T/qiime2/laurenstoeltje/data/7c7ad574-3873-4de6-aa66-64895eca559e/data/tree.nwk -o /var/folders/cb/bxr92qgn10d978brzl0pdtfc0000gn/T/q2-AlphaDiversityFormat-_f3vk7f8

Running external command line application. This may print messages to stdout and/or stderr.
The command being run is below. This command cannot be manually re-run as it will depend on temporary files that no longer exist.

Command:

faithpd -i /var/folders/cb/bxr92qgn10d978brzl0pdtfc0000gn/T/qiime2/laurenstoeltje/data/51fcbf17-6a22-459e-a8f9-ba97af2672e7/data/feature-table.biom -t /var/folders/cb/bxr

Running external command line application. This may print messages to stdout and/or stderr.
The command being run is below. This command cannot be manually re-run as it will depend on temporary files that no longer exist.

Command:

faithpd -i /var/folders/cb/bxr92qgn10d978brzl0pdtfc0000gn/T/qiime2/laurenstoeltje/data/b298ac22-0c26-413a-88fe-9261a10133ef/data/feature-table.biom -t /var/folders/cb/bxr92qgn10d978brzl0pdtfc0000gn/T/qiime2/laurenstoeltje/data/7c7ad574-3873-4de6-aa66-64895eca559e/data/tree.nwk -o /var/folders/cb/bxr92qgn10d978brzl0pdtfc0000gn/T/q2-AlphaDiversityFormat-css1bo0m

Running external command line application. This may print messages to stdout and/or stderr.
The command being run is below. This command cannot be manually re-run as it will depend on temporary files that no longer exist.

Command:

faithpd -i /var/folders/cb/bxr92qgn10d978brzl0pdtfc0000gn/T/qiime2/laurenstoeltje/data/7683c178-8baa-4d01-87d7-bdc716f14ef4/data/feature-table.biom -t /var/folders/cb/bxr

Running external command line application. This may print messages to stdout and/or stderr.
The command being run is below. This command cannot be manually re-run as it will depend on temporary files that no longer exist.

Command:

faithpd -i /var/folders/cb/bxr92qgn10d978brzl0pdtfc0000gn/T/qiime2/laurenstoeltje/data/759e00db-94bd-465d-a1f5-13eda33c1ba9/data/feature-table.biom -t /var/folders/cb/bxr92qgn10d978brzl0pdtfc0000gn/T/qiime2/laurenstoeltje/data/7c7ad574-3873-4de6-aa66-64895eca559e/data/tree.nwk -o /var/folders/cb/bxr92qgn10d978brzl0pdtfc0000gn/T/q2-AlphaDiversityFormat-s5u2hlgp

Running external command line application. This may print messages to stdout and/or stderr.
The command being run is below. This command cannot be manually re-run as it will depend on temporary files that no longer exist.

Command:

faithpd -i /var/folders/cb/bxr92qgn10d978brzl0pdtfc0000gn/T/qiime2/laurenstoeltje/data/9a9a7af9-7601-487e-9f07-d6d1ffc26f6c/data/feature-table.biom -t /var/folders/cb/bxr

Running external command line application. This may print messages to stdout and/or stderr.
The command being run is below. This command cannot be manually re-run as it will depend on temporary files that no longer exist.

Command:

faithpd -i /var/folders/cb/bxr92qgn10d978brzl0pdtfc0000gn/T/qiime2/laurenstoeltje/data/85955b7d-5d72-48d0-830c-e50048f884ca/data/feature-table.biom -t /var/folders/cb/bxr92qgn10d978brzl0pdtfc0000gn/T/qiime2/laurenstoeltje/data/7c7ad574-3873-4de6-aa66-64895eca559e/data/tree.nwk -o /var/folders/cb/bxr92qgn10d978brzl0pdtfc0000gn/T/q2-AlphaDiversityFormat-il64mjgo

Running external command line application. This may print messages to stdout and/or stderr.
The command being run is below. This command cannot be manually re-run as it will depend on temporary files that no longer exist.

Command:

faithpd -i /var/folders/cb/bxr92qgn10d978brzl0pdtfc0000gn/T/qiime2/laurenstoeltje/data/0d912e7e-aa9d-4020-ac07-90b54834015e/data/feature-table.biom -t /var/folders/cb/bxr

In [16]:
path_to_qzv = '/Users/laurenstoeltje/final_project_668/analysis/visualisations/16s_alpha_rarefaction.qzv'
visualization = Visualization.load(path_to_qzv)
display(visualization)

In [None]:
#Alpha and beta diversity analysis

In [49]:
!qiime diversity core-metrics-phylogenetic \
  --i-phylogeny analysis/tree/16s_rooted_tree.qza \
  --i-table analysis/taxonomy/16s_table_filtered.qza \
  --p-sampling-depth 5583 \
  --m-metadata-file data_files/metadata.tsv \
  --output-dir analysis/diversity_metrics

[32mSaved FeatureTable[Frequency] to: analysis/diversity_metrics/rarefied_table.qza[0m
[32mSaved SampleData[AlphaDiversity] to: analysis/diversity_metrics/faith_pd_vector.qza[0m
[32mSaved SampleData[AlphaDiversity] to: analysis/diversity_metrics/observed_features_vector.qza[0m
[32mSaved SampleData[AlphaDiversity] to: analysis/diversity_metrics/shannon_vector.qza[0m
[32mSaved SampleData[AlphaDiversity] to: analysis/diversity_metrics/evenness_vector.qza[0m
[32mSaved DistanceMatrix to: analysis/diversity_metrics/unweighted_unifrac_distance_matrix.qza[0m
[32mSaved DistanceMatrix to: analysis/diversity_metrics/weighted_unifrac_distance_matrix.qza[0m
[32mSaved DistanceMatrix to: analysis/diversity_metrics/jaccard_distance_matrix.qza[0m
[32mSaved DistanceMatrix to: analysis/diversity_metrics/bray_curtis_distance_matrix.qza[0m
[32mSaved PCoAResults to: analysis/diversity_metrics/unweighted_unifrac_pcoa_results.qza[0m
[32mSaved PCoAResults to: analysis/diversity_metrics/wei

In [None]:
cp analysis/diversity_metrics/*.qzv analysis/visualisations

In [None]:
#Faith Phylogenetic Diversity (a measure of community richness) and evenness metrics

In [50]:
!qiime diversity alpha-group-significance \
  --i-alpha-diversity analysis/diversity_metrics/faith_pd_vector.qza \
  --m-metadata-file data_files/metadata.tsv \
  --o-visualization analysis/visualisations/faith-pd-group-significance.qzv

[32mSaved Visualization to: analysis/visualisations/faith-pd-group-significance.qzv[0m
[0m

In [51]:
path_to_faith_pd = 'analysis/visualisations/faith-pd-group-significance.qzv'
visualization_faith_pd = Visualization.load(path_to_faith_pd)
display(visualization_faith_pd)

In [52]:
!qiime diversity alpha-group-significance \
  --i-alpha-diversity analysis/diversity_metrics/evenness_vector.qza \
  --m-metadata-file data_files/metadata.tsv \
  --o-visualization analysis/visualisations/evenness-group-significance.qzv

[32mSaved Visualization to: analysis/visualisations/evenness-group-significance.qzv[0m
[0m

In [53]:
path_to_evenness = 'analysis/visualisations/evenness-group-significance.qzv'
visualization_evenness = Visualization.load(path_to_evenness)
display(visualization_evenness)

In [None]:
#PERMANOVA

In [61]:
path_to_unweighted_unifrac = 'analysis/visualisations/unweighted_unifrac_emperor.qzv'
visualization_unweighted_unifrac = Visualization.load(path_to_unweighted_unifrac)
display(visualization_unweighted_unifrac)

In [54]:
!qiime diversity beta-group-significance \
  --i-distance-matrix analysis/diversity_metrics/unweighted_unifrac_distance_matrix.qza \
  --m-metadata-file data_files/metadata.tsv \
  --m-metadata-column Genotype \
  --o-visualization analysis/visualisations/unweighted-unifrac-genotype-significance.qzv \
  --p-pairwise

[32mSaved Visualization to: analysis/visualisations/unweighted-unifrac-genotype-significance.qzv[0m
[0m

In [55]:
path_to_unweighted_unifrac_genotype = 'analysis/visualisations/unweighted-unifrac-genotype-significance.qzv'
visualization_unweighted_unifrac = Visualization.load(path_to_unweighted_unifrac_genotype)
display(visualization_unweighted_unifrac)

In [None]:
#Unweighted UniFrac

In [None]:
!qiime diversity beta-group-significance \
  --i-distance-matrix analysis/diversity_metrics/unweighted_unifrac_distance_matrix.qza \
  --m-metadata-file data_files/metadata.tsv \
  --m-metadata-column Environment \
  --o-visualization analysis/visualisations/unweighted-unifrac-environment-significance.qzv \
  --p-pairwise

In [57]:
path_to_unweighted_unifrac_env = 'analysis/visualisations/unweighted-unifrac-environment-significance.qzv'
visualization_unweighted_unifrac = Visualization.load(path_to_unweighted_unifrac_env)
display(visualization_unweighted_unifrac)

In [None]:
#Results:
#Substantial decreases in bacterial alpha diversity were observed in all “sterile” anemone genotypes.
#Simpson’s and Shannon’s bacterial diversity also decreased in all anemones, regardless of genotype, with significant reductions in AIMS2 (Shannon’s only), AIMS3 (both Shannon’s and Simpson’s), and AIMS4 (Shannon’s only).
#A total of 13 bacterial families were found in at least three genotypes of control anemones and absent in at least three genotypes of “sterile” anemones.
#Conversely, there were five families that were present in the “sterile” anemones, yet absent in control anemones.
#PERMANOVA results by genotype confirmed that the bacterial community structure varied significantly with the interaction of sample type and environment.

#Analysis of these results suggest that the bacterial diversity and community structure for each of four anemone genotypes was significantly different with a reduction in alpha diversity when anemones were moved from unfiltered seawater into sterile seawater.