# QIIME2 analysis of qiita ID 11884 
### Natural history of the infant gut microbiome and impact of antibiotic treatment on bacterial strain diversity and stability (DIABIUMMUNE antibiotics)

#### Abstract

The goal of the DIABIMMUNE antibiotics cohort is to study the effect of repeated antibitioc treatments on the developing infant gut microbiome. Despite widespread use of antibiotics in children, the effect of antibiotic exposure on the developing infant gut microbiome has remained underexplored. Yassour et al. present a longitudinal study capturing how the gut microbiome responds to and recovers from antibiotic perturbations. Antibiotic-treated children had less stable and less diverse communities. Antibiotic resistance genes within the guts of these children peaked after antibiotic treatment but generally returned rapidly to baseline. Delivery mode (vaginal versus Caesarian) also had strong long-term effects on microbial diversity. These data give insight into the consequences of early life factors such as birth mode and antibiotic treatment. Abstract The gut microbial community is dynamic during the first three years of life, before stabilizing to an adult-like state. However, little is known about the impact of environmental factors on the developing human gut microbiome. Here, we report a longitudinal study of the gut microbiome based on DNA sequence analysis of monthly stool samples and clinical information from 39 children, approximately half of whom received multiple courses of antibiotics during the first three years of life. Whereas the gut microbiome of most children born by vaginal delivery was dominated by Bacteroides species, the four children born by Cesarean section and approximately 20% of vaginally born children lacked Bacteroides in the first six to eighteen months of life. Longitudinal sampling, coupled with whole-genome shotgun sequencing, allowed detection of strain-level variation as well as the abundance of antibiotic resistance genes. The microbiota of antibiotic-treated children was less diverse in terms of both bacterial species and strains, with some species often dominated by single strains. In addition, we observed short-term composition changes between consecutive samples from children treated with antibiotics. Antibiotic resistance genes carried on microbial chromosomes showed a peak in abundance after antibiotic treatment followed by a sharp decline, whereas some genes carried on mobile elements persisted longer after antibiotic therapy ended. Our results highlight the value of dense longitudinal studies with high-resolution strain profiles for studying the establishment and response to perturbation of the infant gut microbiome.


### 1.1 “Parameters” that specify the input data locations

In [2]:
# Input data locations
%env INPUT_BIOM_TABLE_PATH=../../data/DIABIMMUNE-Qiita-11884/DIABIMMUNE_month.biom
%env INPUT_REPRESENTATIVE_SEQUENCES_PATH=../../data/DIABIMMUNE-Qiita-11884/72835_reference-hit.seqs.fa
%env INPUT_SAMPLE_METADATA_PATH=../../data/DIABIMMUNE-Qiita-11884/11884_20190508-173103-added-month-abx.txt
%env INPUT_SAMPLE_METADATA_TIME_PATH=../../data/DIABIMMUNE-Qiita-11884/metadata-added-month-abx-lowtime-rm.txt
# Rarefaction depth used in the "Diversity Analyses" section
%env RAREFACTION_DEPTH=4000
# classifier from (qiime2 downloaded 5/31/2019)
%env FEATURE_CLASSIFIER_QZA_PATH=../../data/data-assests/gg-13-8-99-515-806-nb-classifier.qza
# Output directory
%env OUTPUT_DIRECTORY=../../data/DIABIMMUNE-Qiita-11884/q2-analysis


env: INPUT_BIOM_TABLE_PATH=../../data/DIABIMMUNE-Qiita-11884/DIABIMMUNE_month.biom
env: INPUT_REPRESENTATIVE_SEQUENCES_PATH=../../data/DIABIMMUNE-Qiita-11884/72835_reference-hit.seqs.fa
env: INPUT_SAMPLE_METADATA_PATH=../../data/DIABIMMUNE-Qiita-11884/11884_20190508-173103-added-month-abx.txt
env: INPUT_SAMPLE_METADATA_TIME_PATH=../../data/DIABIMMUNE-Qiita-11884/metadata-added-month-abx-lowtime-rm.txt
env: RAREFACTION_DEPTH=4000
env: FEATURE_CLASSIFIER_QZA_PATH=../../data/data-assests/gg-13-8-99-515-806-nb-classifier.qza
env: OUTPUT_DIRECTORY=../../data/DIABIMMUNE-Qiita-11884/q2-analysis


### 1.2 Setting up output and environment 

In [2]:
!source activate qiime2-2019.7

/bin/sh: activate: No such file or directory


### 1.3 Importing Data

In [6]:
!qiime tools import \
    --type "FeatureTable[Frequency]" \
    --input-path $INPUT_BIOM_TABLE_PATH \
    --output-path $OUTPUT_DIRECTORY/table.qza

[32mImported ../../data/DIABIMMUNE-Qiita-11884/DIABIMMUNE_month.biom as BIOMV210DirFmt to ../../data/DIABIMMUNE-Qiita-11884/q2-analysis/table.qza[0m


In [7]:
!qiime tools import \
    --type "FeatureData[Sequence]" \
    --input-path $INPUT_REPRESENTATIVE_SEQUENCES_PATH \
    --output-path $OUTPUT_DIRECTORY/rep-seqs.qza

[32mImported ../../data/DIABIMMUNE-Qiita-11884/72835_reference-hit.seqs.fa as DNASequencesDirectoryFormat to ../../data/DIABIMMUNE-Qiita-11884/q2-analysis/rep-seqs.qza[0m


### 1.4 Summarize the (filtered) table and representative sequences

In [8]:
!qiime feature-table summarize \
    --i-table $OUTPUT_DIRECTORY/table.qza \
    --o-visualization $OUTPUT_DIRECTORY/table.qzv \
    --m-sample-metadata-file $INPUT_SAMPLE_METADATA_PATH

[32mSaved Visualization to: ../../data/DIABIMMUNE-Qiita-11884/q2-analysis/table.qzv[0m


In [9]:
!qiime feature-table tabulate-seqs \
    --i-data $OUTPUT_DIRECTORY/rep-seqs.qza \
    --o-visualization $OUTPUT_DIRECTORY/rep-seqs.qzv

[32mSaved Visualization to: ../../data/DIABIMMUNE-Qiita-11884/q2-analysis/rep-seqs.qzv[0m


### 1.5 Generate phylogenetic trees

In [2]:
!qiime fragment-insertion sepp \
    --i-representative-sequences $OUTPUT_DIRECTORY/rep-seqs.qza \
    --o-tree $OUTPUT_DIRECTORY/insertion-tree.qza \
    --o-placements $OUTPUT_DIRECTORY/tree-placements.qza

[32mSaved Phylogeny[Rooted] to: ../../data/DIABIMMUNE-Qiita-11884/q2-analysis/insertion-tree.qza[0m
[32mSaved Placements to: ../../data/DIABIMMUNE-Qiita-11884/q2-analysis/tree-placements.qza[0m


### 1.6 Diversity Analyses (all core-metrics & beta-diversity aitchison)

In [3]:
!qiime diversity core-metrics-phylogenetic \
    --i-phylogeny $OUTPUT_DIRECTORY/insertion-tree.qza \
    --i-table $OUTPUT_DIRECTORY/table.qza \
    --p-sampling-depth $RAREFACTION_DEPTH \
    --m-metadata-file $INPUT_SAMPLE_METADATA_PATH \
    --output-dir $OUTPUT_DIRECTORY/core-metrics-results

[32mSaved FeatureTable[Frequency] to: ../../data/DIABIMMUNE-Qiita-11884/q2-analysis/core-metrics-results/rarefied_table.qza[0m
[32mSaved SampleData[AlphaDiversity] % Properties('phylogenetic') to: ../../data/DIABIMMUNE-Qiita-11884/q2-analysis/core-metrics-results/faith_pd_vector.qza[0m
[32mSaved SampleData[AlphaDiversity] to: ../../data/DIABIMMUNE-Qiita-11884/q2-analysis/core-metrics-results/observed_otus_vector.qza[0m
[32mSaved SampleData[AlphaDiversity] to: ../../data/DIABIMMUNE-Qiita-11884/q2-analysis/core-metrics-results/shannon_vector.qza[0m
[32mSaved SampleData[AlphaDiversity] to: ../../data/DIABIMMUNE-Qiita-11884/q2-analysis/core-metrics-results/evenness_vector.qza[0m
[32mSaved DistanceMatrix % Properties('phylogenetic') to: ../../data/DIABIMMUNE-Qiita-11884/q2-analysis/core-metrics-results/unweighted_unifrac_distance_matrix.qza[0m
[32mSaved DistanceMatrix % Properties('phylogenetic') to: ../../data/DIABIMMUNE-Qiita-11884/q2-analysis/core-metrics-results/weighted_un

In [4]:
!qiime diversity beta \
    --i-table $OUTPUT_DIRECTORY/table.qza \
    --p-metric aitchison \
    --p-pseudocount 1 \
    --o-distance-matrix $OUTPUT_DIRECTORY/core-metrics-results/aitchison_distance_matrix.qza 

!qiime diversity pcoa \
    --i-distance-matrix $OUTPUT_DIRECTORY/core-metrics-results/aitchison_distance_matrix.qza \
    --o-pcoa $OUTPUT_DIRECTORY/core-metrics-results/aitchison_pcoa_results.qza

!qiime emperor plot \
    --i-pcoa $OUTPUT_DIRECTORY/core-metrics-results/aitchison_pcoa_results.qza \
    --m-metadata-file $INPUT_SAMPLE_METADATA_PATH \
    --o-visualization $OUTPUT_DIRECTORY/core-metrics-results/aitchisons_emperor.qzv

[32mSaved DistanceMatrix to: ../../data/DIABIMMUNE-Qiita-11884/q2-analysis/core-metrics-results/aitchison_distance_matrix.qza[0m
[32mSaved PCoAResults to: ../../data/DIABIMMUNE-Qiita-11884/q2-analysis/core-metrics-results/aitchison_pcoa_results.qza[0m
[32mSaved Visualization to: ../../data/DIABIMMUNE-Qiita-11884/q2-analysis/core-metrics-results/aitchisons_emperor.qzv[0m


In [5]:
!qiime diversity beta-phylogenetic \
    --i-table $OUTPUT_DIRECTORY/core-metrics-results/rarefied_table.qza \
    --p-metric generalized_unifrac \
    --i-phylogeny $OUTPUT_DIRECTORY/insertion-tree.qza \
    --p-alpha 1 \
    --o-distance-matrix $OUTPUT_DIRECTORY/core-metrics-results/gUniFrac_alpha1_distance_matrix.qza 

!qiime diversity pcoa \
    --i-distance-matrix $OUTPUT_DIRECTORY/core-metrics-results/gUniFrac_alpha1_distance_matrix.qza \
    --o-pcoa $OUTPUT_DIRECTORY/core-metrics-results/gUniFrac_alpha1_pcoa_results.qza

!qiime emperor plot \
    --i-pcoa $OUTPUT_DIRECTORY/core-metrics-results/gUniFrac_alpha1_pcoa_results.qza \
    --m-metadata-file $INPUT_SAMPLE_METADATA_PATH \
    --o-visualization $OUTPUT_DIRECTORY/core-metrics-results/gUniFrac_alpha1_emperor.qzv

!qiime diversity beta-phylogenetic \
    --i-table $OUTPUT_DIRECTORY/core-metrics-results/rarefied_table.qza \
    --p-metric generalized_unifrac \
    --i-phylogeny $OUTPUT_DIRECTORY/insertion-tree.qza \
    --p-alpha 0.5 \
    --o-distance-matrix $OUTPUT_DIRECTORY/core-metrics-results/gUniFrac_alpha5_distance_matrix.qza 

!qiime diversity pcoa \
    --i-distance-matrix $OUTPUT_DIRECTORY/core-metrics-results/gUniFrac_alpha5_distance_matrix.qza \
    --o-pcoa $OUTPUT_DIRECTORY/core-metrics-results/gUniFrac_alpha5_pcoa_results.qza

!qiime emperor plot \
    --i-pcoa $OUTPUT_DIRECTORY/core-metrics-results/gUniFrac_alpha5_pcoa_results.qza \
    --m-metadata-file $INPUT_SAMPLE_METADATA_PATH \
    --o-visualization $OUTPUT_DIRECTORY/core-metrics-results/gUniFrac_alpha5_emperor.qzv

!qiime diversity beta-phylogenetic \
    --i-table $OUTPUT_DIRECTORY/core-metrics-results/rarefied_table.qza \
    --p-metric generalized_unifrac \
    --i-phylogeny $OUTPUT_DIRECTORY/insertion-tree.qza \
    --p-alpha 0 \
    --o-distance-matrix $OUTPUT_DIRECTORY/core-metrics-results/gUniFrac_alpha0_distance_matrix.qza 

!qiime diversity pcoa \
    --i-distance-matrix $OUTPUT_DIRECTORY/core-metrics-results/gUniFrac_alpha0_distance_matrix.qza \
    --o-pcoa $OUTPUT_DIRECTORY/core-metrics-results/gUniFrac_alpha0_pcoa_results.qza

!qiime emperor plot \
    --i-pcoa $OUTPUT_DIRECTORY/core-metrics-results/gUniFrac_alpha0_pcoa_results.qza \
    --m-metadata-file $INPUT_SAMPLE_METADATA_PATH \
    --o-visualization $OUTPUT_DIRECTORY/core-metrics-results/gUniFrac_alpha0_emperor.qzv


[32mSaved DistanceMatrix % Properties('phylogenetic') to: ../../data/DIABIMMUNE-Qiita-11884/q2-analysis/core-metrics-results/gUniFrac_alpha1_distance_matrix.qza[0m
[32mSaved PCoAResults to: ../../data/DIABIMMUNE-Qiita-11884/q2-analysis/core-metrics-results/gUniFrac_alpha1_pcoa_results.qza[0m
[32mSaved Visualization to: ../../data/DIABIMMUNE-Qiita-11884/q2-analysis/core-metrics-results/gUniFrac_alpha1_emperor.qzv[0m
[32mSaved DistanceMatrix % Properties('phylogenetic') to: ../../data/DIABIMMUNE-Qiita-11884/q2-analysis/core-metrics-results/gUniFrac_alpha5_distance_matrix.qza[0m
[32mSaved PCoAResults to: ../../data/DIABIMMUNE-Qiita-11884/q2-analysis/core-metrics-results/gUniFrac_alpha5_pcoa_results.qza[0m
[32mSaved Visualization to: ../../data/DIABIMMUNE-Qiita-11884/q2-analysis/core-metrics-results/gUniFrac_alpha5_emperor.qzv[0m
[32mSaved DistanceMatrix % Properties('phylogenetic') to: ../../data/DIABIMMUNE-Qiita-11884/q2-analysis/core-metrics-results/gUniFrac_alpha0_distance

### 1.7 Taxonomic Analysis

In [6]:
!qiime feature-classifier classify-sklearn \
    --i-classifier $FEATURE_CLASSIFIER_QZA_PATH \
    --i-reads $OUTPUT_DIRECTORY/rep-seqs.qza \
    --o-classification $OUTPUT_DIRECTORY/taxonomy.qza

[32mSaved FeatureData[Taxonomy] to: ../../data/DIABIMMUNE-Qiita-11884/q2-analysis/taxonomy.qza[0m


### 1.7.2 Summarize the Taxonomic Classifications

In [7]:
!qiime metadata tabulate \
    --m-input-file $OUTPUT_DIRECTORY/taxonomy.qza \
    --o-visualization $OUTPUT_DIRECTORY/taxonomy.qzv

[32mSaved Visualization to: ../../data/DIABIMMUNE-Qiita-11884/q2-analysis/taxonomy.qzv[0m


### 1.8.1 CTF 

In [38]:
!qiime feature-table filter-features\
    --i-table $OUTPUT_DIRECTORY/table.qza \
    --p-min-frequency 10 \
    --o-filtered-table $OUTPUT_DIRECTORY/table-filt.qza \

[32mSaved FeatureTable[Frequency] to: ../../data/DIABIMMUNE-Qiita-11884/q2-analysis/table-filt.qza[0m


In [3]:
!qiime gemelli ctf --i-table $OUTPUT_DIRECTORY/table-filt.qza \
                   --m-sample-metadata-file $INPUT_SAMPLE_METADATA_PATH \
                   --p-individual-id-column subjectid \
                   --p-state-column month \
                   --m-feature-metadata-file $OUTPUT_DIRECTORY/taxonomy.qza \
                   --p-min-sample-count 2000\
                   --p-min-feature-count 0\
                   --p-max-iterations-rptm 25\
                   --p-n-initializations 25\
                   --p-max-iterations-als 25\
                   --output-dir $OUTPUT_DIRECTORY/ctf-results \
                   --p-n-components 4\
                   --verbose

[32mSaved PCoAResults % Properties('biplot') to: ../../data/DIABIMMUNE-Qiita-11884/q2-analysis/ctf-results/subject_biplot.qza[0m
[32mSaved PCoAResults % Properties('biplot') to: ../../data/DIABIMMUNE-Qiita-11884/q2-analysis/ctf-results/state_biplot.qza[0m
[32mSaved DistanceMatrix to: ../../data/DIABIMMUNE-Qiita-11884/q2-analysis/ctf-results/distance_matrix.qza[0m
[32mSaved SampleData[SampleTrajectory] to: ../../data/DIABIMMUNE-Qiita-11884/q2-analysis/ctf-results/state_subject_ordination.qza[0m
[32mSaved FeatureData[FeatureTrajectory] to: ../../data/DIABIMMUNE-Qiita-11884/q2-analysis/ctf-results/state_feature_ordination.qza[0m
