# Functional redundancy

Examination of functional redundancy in the context of microbial diversity measurements.
**Objective :** compare taxonomic and functional diversity in these same samples to evaluate _functional redundancy_ in microbial communities.

- Metadata : sample_meta_data (done in A)
- Diversity analysis : alpha = core-metrics-results (done in F), beta = core-metrics-results-bd (done in G)
- Metagenome content predicted by PICRUST2 : (done in I)


#### Notebook overview 

[1. Setup](#setup)<br>
&nbsp;&nbsp;&nbsp;&nbsp;[1.1 _Data Import_](#import_data)<br>
[2. Functional redundancy calculation](#)<br>
&nbsp;&nbsp;&nbsp;&nbsp;[2.1 _Generate Plots_](#generate_plots)<br>
[3. Clean data](#clean_data)<br>
[4. Generate Statistics](#statistics)<br>
[Note](#note)<br>



<a id='setup'></a>
## 1. Setup

In [2]:
import os
import qiime2 as q2
import pandas as pd
from qiime2 import Visualization



# do not increase this value!
n_jobs = 3
    
%matplotlib inline

In [3]:
data_dir ='project_data'

<a id='function'></a>
## 2. Functional redundancy calculation

Next we will look at predicted gene pathway information to compare taxonomic vs. functional diversity patterns. 
We will use the `core-metrics` pipeline on the `pathway_abundance.qza` table, which consists of PICRUST2-predicted gene pathway counts. 

In [5]:
! qiime diversity core-metrics \
  --i-table $data_dir/pathway_abundance.qza \
  --m-metadata-file $data_dir/sample_meta_data.tsv \
  --p-sampling-depth 100000 \
  --p-n-jobs $n_jobs \
  --output-dir $data_dir/core-metrics-picrust2  

[32mSaved FeatureTable[Frequency] to: project_data/core-metrics-picrust2/rarefied_table.qza[0m
[32mSaved SampleData[AlphaDiversity] to: project_data/core-metrics-picrust2/observed_features_vector.qza[0m
[32mSaved SampleData[AlphaDiversity] to: project_data/core-metrics-picrust2/shannon_vector.qza[0m
[32mSaved SampleData[AlphaDiversity] to: project_data/core-metrics-picrust2/evenness_vector.qza[0m
[32mSaved DistanceMatrix to: project_data/core-metrics-picrust2/jaccard_distance_matrix.qza[0m
[32mSaved DistanceMatrix to: project_data/core-metrics-picrust2/bray_curtis_distance_matrix.qza[0m
[32mSaved PCoAResults to: project_data/core-metrics-picrust2/jaccard_pcoa_results.qza[0m
[32mSaved PCoAResults to: project_data/core-metrics-picrust2/bray_curtis_pcoa_results.qza[0m
[32mSaved Visualization to: project_data/core-metrics-picrust2/jaccard_emperor.qzv[0m
[32mSaved Visualization to: project_data/core-metrics-picrust2/bray_curtis_emperor.qzv[0m
[0m

## Hypothesis 1 : Drinks

In [5]:
! qiime diversity adonis \
  --i-distance-matrix $data_dir/core-metrics-picrust2/bray_curtis_distance_matrix.qza \
  --m-metadata-file $data_dir/sample_meta_data.tsv \
  --p-formula "NUT_alcohol_frequency*NUT_drinks_per_session" \
--o-visualization $data_dir/core-metrics-picrust2/adonis-bc-h1.qzv

[32mSaved Visualization to: project_data/core-metrics-picrust2/adonis-bc-h1.qzv[0m
[0m

In [4]:
Visualization.load(f'{data_dir}/core-metrics-picrust2/adonis-bc-h1.qzv')

## Hypothesis 2 : Milk products

In [7]:
! qiime diversity adonis \
  --i-distance-matrix $data_dir/core-metrics-picrust2/bray_curtis_distance_matrix.qza \
  --m-metadata-file $data_dir/sample_meta_data.tsv \
  --p-formula "NUT_milk_cheese_frequency*NUT_milk_substitute_frequency*NUT_vitamin_d_supplement_frequency" \
--o-visualization $data_dir/core-metrics-picrust2/adonis-bc-h2.qzv

[32mSaved Visualization to: project_data/core-metrics-picrust2/adonis-bc-h2.qzv[0m
[0m

In [8]:
Visualization.load(f'{data_dir}/core-metrics-picrust2/adonis-bc-h2.qzv')

## Hypothesis 3 : Mediterranean

In [9]:
! qiime diversity adonis \
  --i-distance-matrix $data_dir/core-metrics-picrust2/bray_curtis_distance_matrix.qza \
  --m-metadata-file $data_dir/sample_meta_data.tsv \
  --p-formula "NUT_olive_oil*NUT_seafood_frequency*NUT_vegetable_frequency*NUT_fruit_frequency*NUT_whole_grain_frequency" \
--o-visualization $data_dir/core-metrics-picrust2/adonis-bc-h3.qzv

[32mSaved Visualization to: project_data/core-metrics-picrust2/adonis-bc-h3.qzv[0m
[0m

In [10]:
Visualization.load(f'{data_dir}/core-metrics-picrust2/adonis-bc-h3.qzv')

## Hypothesis 4 : Poultry & Meat

In [None]:
! qiime diversity adonis \
  --i-distance-matrix $data_dir/core-metrics-picrust2/bray_curtis_distance_matrix.qza \
  --m-metadata-file $data_dir/sample_meta_data.tsv \
  --p-formula "NUT_whole_eggs*NUT_poultry_frequency*NUT_high_fat_red_meat_frequency*NUT_meat_eggs_frequency*NUT_red_meat_frequency*NUT_vitamin_b_supplement_frequency" \
--o-visualization $data_dir/core-metrics-picrust2/adonis-bc-h4.qzv

In [4]:
Visualization.load(f'{data_dir}/core-metrics-picrust2/adonis-bc-h4.qzv')