Provides Jupyter notebooks to analyse the poreFUME output
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
README.md
analyzeResistome.ipynb
calculateResistome.ipynb
paths.py
run.sh

README.md

poreFUME_paper

This repo allows to reproduce the analysis in the poreFUME paper. Run run.sh to setup the analysis, this will:

  1. Clone into poreFUME.
  2. Download the input nanopore, PacBio and Sanger sequence data files and raw nanopore data from ENA
  3. Download the processed data files by poreFUME, these are not strictly necessary, but allow the user to skip the poreFUME pipeline itself
  4. Run install.sh of poreFUME which takes care of the daligner, DAZZ_DB, POA, nanocorrect dependencies.

Next you can check out the calculateResistome.ipynb notebook to run poreFUME and some auxiliary analysis (ie. sequence identity to the Sanger set) or directly go to analyzeResistome.ipynb to reproduce the figures in the paper

Dependencies

Minimal Python 2.7, pandas, numpy, biopython as described in the poreFUME install document.
poreFUME makes use of the CARD database. So when using please cite McArthur et al. 2013. The Comprehensive Antibiotic Resistance Database. Antimicrobial Agents and Chemotherapy, 57, 3348-3357. Furthermore Nanocorrect and Nanopolish are used, which can be cited by Loman NJ, Quick J, Simpson JT: A complete bacterial genome assembled de novo using only nanopore sequencing data. Nat Methods 2015, 12:733–735.

poreFUME

Check the poreFUME README and poreFUME install documents for more details and specific dependencies of poreFUME.