In [5]:
import qiime2

In [6]:
!qiime tools import   \
--type 'SampleData[PairedEndSequencesWithQuality]'   \
--input-path 'read-files/sample-manifest.tsv'   \
--output-path paired-end-demux.qza   \
--input-format PairedEndFastqManifestPhred33V2 \



[32mImported read-files/sample-manifest.tsv as PairedEndFastqManifestPhred33V2 to paired-end-demux.qza[0m
[0m

In [8]:
#Denoising Step

!qiime dada2 denoise-single \
    --i-demultiplexed-seqs paired-end-demux.qza \
    --p-trunc-len 140 \
    --p-n-threads 2 \
    --output-dir dada2 --verbose

Running external command line application(s). This may print messages to stdout and/or stderr.
The command(s) being run are below. These commands cannot be manually re-run as they will depend on temporary files that no longer exist.

Command: run_dada_single.R /tmp/q2-SingleLanePerSampleSingleEndFastqDirFmt-y0l7b_jb /tmp/tmp8_8_mqns/output.tsv.biom /tmp/tmp8_8_mqns/track.tsv /tmp/tmp8_8_mqns 140 0 2.0 2 Inf independent consensus 1.0 2 1000000 NULL 16

R version 4.0.5 (2021-03-31) 
Loading required package: Rcpp
DADA2: 1.18.0 / Rcpp: 1.0.7 / RcppParallel: 5.1.4 
1) Filtering ....
2) Learning Error Rates
39056920 total bases in 278978 reads from 4 samples will be used for learning the error rates.
3) Denoise samples ....
4) Remove chimeras (method = consensus)
5) Report read numbers through the pipeline
6) Write output
[32mSaved FeatureTable[Frequency] to: dada2/table.qza[0m
[32mSaved FeatureData[Sequence] to: dada2/representative_sequences.qza[0m
[32mSaved SampleData[DADA2Stats] to

In [9]:
!qiime metadata tabulate \
    --m-input-file dada2/denoising_stats.qza \
    --o-visualization dada2/denoising-stats.qzv

[32mSaved Visualization to: dada2/denoising-stats.qzv[0m
[0m

In [13]:
!qiime phylogeny align-to-tree-mafft-fasttree \
    --i-sequences dada2/representative_sequences.qza \
    --output-dir tree

[32mSaved FeatureData[AlignedSequence] to: tree/alignment.qza[0m
[32mSaved FeatureData[AlignedSequence] to: tree/masked_alignment.qza[0m
[32mSaved Phylogeny[Unrooted] to: tree/tree.qza[0m
[32mSaved Phylogeny[Rooted] to: tree/rooted_tree.qza[0m
[0m

In [14]:
!qiime empress tree-plot \
    --i-tree tree/rooted_tree.qza \
    --o-visualization tree/empress.qzv

[32mSaved Visualization to: tree/empress.qzv[0m
[0m

In [40]:
#Alpha Diversity stat

!qiime diversity core-metrics-phylogenetic \
    --i-table dada2/table.qza \
    --i-phylogeny tree/rooted_tree.qza \
    --p-sampling-depth 10000 \
    --m-metadata-file 'read-files/metadata.tsv' \
    --output-dir diversity

Usage: [94mqiime diversity core-metrics-phylogenetic[0m [OPTIONS]

  Applies a collection of diversity metrics (both phylogenetic and non-
  phylogenetic) to a feature table.

[1mInputs[0m:
  [94m[4m--i-table[0m ARTIFACT [32mFeatureTable[Frequency][0m
                          The feature table containing the samples over which
                          diversity metrics should be computed.     [35m[required][0m
  [94m[4m--i-phylogeny[0m ARTIFACT  Phylogenetic tree containing tip identifiers that
    [32mPhylogeny[Rooted][0m     correspond to the feature identifiers in the table.
                          This tree can contain tip ids that are not present
                          in the table, but all feature ids in the table must
                          be present in this tree.                  [35m[required][0m
[1mParameters[0m:
  [94m[4m--p-sampling-depth[0m INTEGER
    [32mRange(1, None)[0m        The total frequency that each sample shou

In [26]:
!qiime diversity alpha-group-significance \
    --i-alpha-diversity diversity/shannon_vector.qza \
    --m-metadata-file 'read-files/metadata.tsv' \
    --o-visualization diversity/alpha_groups.qzv

[32mSaved Visualization to: diversity/alpha_groups.qzv[0m
[0m

In [39]:
#Permanova via Adonis addon

!qiime diversity adonis \
    --i-distance-matrix diversity/weighted_unifrac_distance_matrix.qza \
    --m-metadata-file read-files/metadata.tsv \
    --p-formula "treatment" \
    --p-n-jobs 2 \
    --o-visualization diversity/permanova.qzv

[32mSaved Visualization to: diversity/permanova.qzv[0m
[0m

In [11]:
!qiime picrust2 full-pipeline --help


Usage: [94mqiime picrust2 full-pipeline[0m [OPTIONS]

  QIIME 2 plugin for default 16S PICRUSt2 pipeline

[1mInputs[0m:
  [94m[4m--i-table[0m ARTIFACT [32mFeatureTable[Frequency][0m
                       The feature table containing sequence abundances per
                       sample.                                      [35m[required][0m
  [94m[4m--i-seq[0m ARTIFACT [32mFeatureData[Sequence][0m
                       Sequences (e.g. ASVs or representative OTUs)
                       corresponding to the abundance table given.  [35m[required][0m
[1mParameters[0m:
  [94m--p-threads[0m INTEGER  Number of threads/processes to use during workflow.
    [32mRange(1, None)[0m                                                [35m[default: 1][0m
  [94m--p-hsp-method[0m TEXT [32mChoices('mp', 'emp_prob', 'pic', 'scp',[0m
    [32m'subtree_average')[0m Which hidden-state prediction method to use.
                                                    

In [26]:
#Running Picrust2 module on ASVs obtained as well as on the representative sequences

!qiime picrust2 full-pipeline\
    --i-table dada2/table.qza \
    --i-seq dada2/representative_sequences.qza\
    --output-dir q2-picrust2 \
    --p-placement-tool sepp \
    --p-threads 6 \
    --p-hsp-method pic \
    --p-max-nsti 2 \
    --verbose


This is the set of poorly aligned input sequences to be excluded: 7e879359d8a9e9b3c08b121533609d54





All ASVs were below the max NSTI cut-off of 2.0 and so all were retained for downstream analyses.

All ASVs were below the max NSTI cut-off of 2.0 and so all were retained for downstream analyses.


[32mSaved FeatureTable[Frequency] to: q2-picrust2/ko_metagenome.qza[0m
[32mSaved FeatureTable[Frequency] to: q2-picrust2/ec_metagenome.qza[0m
[32mSaved FeatureTable[Frequency] to: q2-picrust2/pathway_abundance.qza[0m
[0m

In [27]:
#Generate picrust2 feature table
!qiime feature-table summarize \
    --i-table q2-picrust2/pathway_abundance.qza \
    --o-visualization q2-picrust2/pathway_abundance.qzv 

[32mSaved Visualization to: q2-picrust2/pathway_abundance.qzv[0m
[0m

In [28]:
!qiime tools export \
    --input-path q2-picrust2/pathway_abundance.qza \
    --output-path pathabun_exported 

[32mExported q2-picrust2/pathway_abundance.qza as BIOMV210DirFmt to directory pathabun_exported[0m
[0m

In [29]:
!biom convert \
    -i pathabun_exported/feature-table.biom \
    -o pathabun_exported/feature-table.biom.tsv \
    --to-tsv