Skip to content
Examination of microbial diversity patterns
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Type Name Latest commit message Commit time
Failed to load latest commit information.
pipeline Add egm2 and afree Sep 27, 2017

Examination of microbial diversity patterns

This method has now been published in PNAS!

A null model for microbial diversification
Timothy J. Straub and Olga Zhaxybayeva
PNAS July 3,2017 vol. 114 no. 27 E5414-E5423


General steps in the analysis pipeline

  1. Run the pipeline on empirical data. See pipeline/ for details.
    a. Fast alignment-free similarity comparisons of all vs all protein sequences using afree
    b. Clustering on similarity using MCL to generate gene families
    c. Filtering on gene families to generate near single-copy core
    d. Furthest-neighbor clustering within gene families using Mothur
  2. Simulate neutral gene families based on empirical data. See sim/ for details.
    a. Simulate phylogenies using a Moran model
    b. Generate DNA sequences based on these phylogenies using Seq-Gen
    c. Clustering with Mothur
  3. Comparisons between simulated and empirical data using R. See statistics.R for details.
    a. Load in both empirical and simulated clustering results
    b. Compare empirical gene family clustering patterns to simulated null model
    c. Investigate gene families that are significantly different from null model
You can’t perform that action at this time.