Snakemake workflow for wg-GWAS on E. coli commensal and BSI

Snakemake pipeline to reproduce the wg-GWAS analysis on the genetic determinants of E. coli bloodstream infections (BSI) and commensal isolates. Users wishing to replicate this study should place the data containig the phenotype and covariates within the data/ directory. Further details can be found in the preprint.

Input genomes

The input assemblies can be found under the following bioproject accessions:

Phenotype	Bioproject accession
BSI	PRJEB39260, PRJEB35745
Commensal	PRJEB38489, PRJEB44819, PRJEB44872, PRJEB39252, PRJEB55584

The fasta files should be placed inside the data/fastas directory, and the annotated gff files in the data/gffs directory.

Usage

To make a dry run of the analyis:

snakemake --use-conda --cores 36 -n -p

Snakemake will install the appropriate packages for each step as conda environments when running it without the -n flag.

A symbolic link to a directory containing the eggnog-mapper database should be placed in data/eggnog-mapper, as well as one to the unzipped fasta file from uniref50 (data/uniref50.fasta).

Author

Judit Burgaya (BurgayaVentura.Judit@mh-hannover.de)

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
albi		albi
config		config
out/figures		out/figures
workflow		workflow
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Snakemake workflow for wg-GWAS on E. coli commensal and BSI

Input genomes

Usage

Author

About

Releases

Packages

Contributors 2

Languages

License

jburgaya/2022_ecoli_commensal

Folders and files

Latest commit

History

Repository files navigation

Snakemake workflow for wg-GWAS on E. coli commensal and BSI

Input genomes

Usage

Author

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages