Skip to content

Bacterial wg-GWAS for the detection of E. coli BSI and commensal genetic determinants

License

Notifications You must be signed in to change notification settings

jburgaya/2022_ecoli_commensal

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

26 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Snakemake workflow for wg-GWAS on E. coli commensal and BSI

Snakemake pipeline to reproduce the wg-GWAS analysis on the genetic determinants of E. coli bloodstream infections (BSI) and commensal isolates. Users wishing to replicate this study should place the data containig the phenotype and covariates within the data/ directory. Further details can be found in the preprint.

Input genomes

The input assemblies can be found under the following bioproject accessions:

Phenotype Bioproject accession
BSI PRJEB39260, PRJEB35745
Commensal PRJEB38489, PRJEB44819, PRJEB44872, PRJEB39252, PRJEB55584

The fasta files should be placed inside the data/fastas directory, and the annotated gff files in the data/gffs directory.

Usage

To make a dry run of the analyis:

snakemake --use-conda --cores 36 -n -p

Snakemake will install the appropriate packages for each step as conda environments when running it without the -n flag.

A symbolic link to a directory containing the eggnog-mapper database should be placed in data/eggnog-mapper, as well as one to the unzipped fasta file from uniref50 (data/uniref50.fasta).

Author

Judit Burgaya (BurgayaVentura.Judit@mh-hannover.de)

About

Bacterial wg-GWAS for the detection of E. coli BSI and commensal genetic determinants

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published