# Statistics / Plotting Analysis
Alongside the microbiome classification, we also could have the statistics analysis by using the `Feature Table` data generated from denoising step. The statistics analysis included:
- Rarefaction depth
    - Alpha Rarefaction
    - Beta Rarefaction
- Diversity metrics
    - Core Diversity Metrics
    - Alpha Diversity
    - Beta Diversity


## Rarefaction Depth
As we begin performing more analyses of the samples in our feature table, an important parameter that needs to be define is the even sampling (i.e. rarefaction) depth that diversity metrics need to be computed at. Because most diversity metrics are sensitive to different sampling depths across different samples, it is common to randomly subsample the counts from each sample to a specific value. However, choosing this value is tricky. We recommend making your choice by reviewing the information presented in the feature table summary file. Choose a value that is as high as possible (so you retain more sequences per sample) while excluding as few samples as possible.

Refer back to `asv-table.qzv` to choose an even sampling depth.


### Alpha Rarefaction
After choosing an even sampling depth, it’s also helpful to see if your diversity metrics appear to have stabilizes at that depth of coverage. You can do this for alpha diversity using an alpha rarefaction plot.

In [None]:
qiime diversity alpha-rarefaction \
  --i-table asv-table.qza \
  --p-metrics shannon \
  --m-metadata-file sample-metadata.tsv \
  --p-max-depth 10000 \
  --o-visualization shannon-rarefaction-plot.qzv

### Beta Rarefaction
Similarly, you can evaluate whether your beta diversity metrics appear stable at the depth you have selected.

In [None]:
qiime diversity beta-rarefaction \
  --i-table asv-table.qza \
  --p-metric braycurtis \
  --p-clustering-method nj \
  --p-sampling-depth 10000 \
  --m-metadata-file sample-metadata.tsv \
  --o-visualization braycurtis-rarefaction-plot.qzv

## Computing Diversity Metrics
The next step that we’ll work through is computing a series of common diversity metrics on our feature table. We’ll do this using the `q2-diversity` plugin’s `core-metrics-phylogenetic` action. This action is another QIIME 2 pipeline, this time combining over ten different actions into a single action.

### Core Diversity Metrics
`core-metrics-phylogenetic` requires a feature table, rooted phylogenetic tree, and sample metadata as input. It additionally requires that you provide the sampling depth that this analysis will be performed at that we discussed before. In the interest of retaining as many of the samples as possible, we’ll set our sampling depth to 10,000 for this analysis.

In [None]:
qiime diversity core-metrics-phylogenetic \
  --i-phylogeny rooted_tree.qza \
  --i-table asv-table.qza \
  --p-sampling-depth 10000 \
  --m-metadata-file sample-metadata.tsv \
  --output-dir diversity-metrics-phylogenetic

Check the output directory `diversity-metrics-phylogenetic` to see all the results.

*   diversity-metrics-phylogenetic/rarefied_table.qza
*   diversity-metrics-phylogenetic/faith_pd_vector.qza
*   diversity-metrics-phylogenetic/observed_features_vector.qza
*   diversity-metrics-phylogenetic/shannon_vector.qza
*   diversity-metrics-phylogenetic/evenness_vector.qza
*   diversity-metrics-phylogenetic/unweighted_unifrac_distance_matrix.qza
*   diversity-metrics-phylogenetic/weighted_unifrac_distance_matrix.qza
*   diversity-metrics-phylogenetic/jaccard_distance_matrix.qza
*   diversity-metrics-phylogenetic/bray_curtis_distance_matrix.qza
*   diversity-metrics-phylogenetic/unweighted_unifrac_pcoa_results.qza
*   diversity-metrics-phylogenetic/weighted_unifrac_pcoa_results.qza
*   diversity-metrics-phylogenetic/jaccard_pcoa_results.qza
*   diversity-metrics-phylogenetic/bray_curtis_pcoa_results.qza
*   diversity-metrics-phylogenetic/unweighted_unifrac_emperor.qzv
*   diversity-metrics-phylogenetic/weighted_unifrac_emperor.qzv
*   diversity-metrics-phylogenetic/jaccard_emperor.qzv
*   diversity-metrics-phylogenetic/bray_curtis_emperor.qzv

As you can see, we get so many different results from this pipeline, including the rarefied_rable, alpha diversity metrics (Faith's PD, observed features, Shannon, evenness), and beta diversity metrics (Jaccard, Bray Curtis, unweighted unifrac, weighted unifrac). There are actually many other diversity metrics that we can apply. For further reading, you can visit this forum, https://forum.qiime2.org/t/alpha-and-beta-diversity-explanations-and-commands/2282.

### Alpha Diversity Significance
Next, we’ll look for general patterns, by comparing different categorical groupings of samples to see if there is some relationship to richness and/or evenness.

To start with, we’ll examine ‘observed features’:

In [None]:
qiime diversity alpha-group-significance \
  --i-alpha-diversity diversity-metrics-phylogenetic/observed_features_vector.qza \
  --m-metadata-file sample-metadata.tsv \
  --o-visualization diversity-metrics-phlogenetic/alpha-group-sig-obs-feats.qzv

### Beta Diversity Significance
Similar to alpha diversity, there are a few more beta diversity metrics that we can apply. Run `qiime diversity beta --help` or `qiime diversity beta-phylogenetic --help` to see the other metrics. Run this command if you need another non-phylogenetic beta diversity metrics:

In [None]:
qiime diversity beta-group-significance \
--i-distance-matrix diversity-metrics-results/bray_curtis_distance_matrix.qza \
--m-metadata-file sample-metadata.tsv \
--m-metadata-column group \
--o-visualization diversity-metrics-results/bray-curtis-group-significance.qzv

Congratulations, you have sucessfully analyze the statistics. Now continue to the next section of the tutorial: [Enrichment Analysis](6_Enrichment_Analysis.ipynb),

or you just can back to the main tutorial page [Main Page](1_Metagenomics_Workshop_Module.ipynb)