Skip to content
/ LazyB Public

Economic Genome Assembly from Low Coverage Illumina and Nanopore Data

Notifications You must be signed in to change notification settings

TGatter/LazyB

Repository files navigation

MuCHSALSA

For an efficient implementation of this prototype please visit: https://github.com/0x002A/MuCHSALSA/

Lazy B

The Bradypus is a lazy but very effective animal.

Introduction

This is a preliminary version of the tool under rapid development.

Gatter, Thomas, et al. "Economic genome assembly from low coverage Illumina and Nanopore data." 20th International Workshop on Algorithms in Bioinformatics (WABI 2020). Schloss Dagstuhl-Leibniz-Zentrum für Informatik, 2020.

Please contact us with any issues and suggestions.

Requirements

Following tools must be installed:

  • jellyfish 2
  • bbduk
  • Abyss 2 (using 'abyss-pe')
  • minimap2
  • modified version of racon (racon_mod in this project, compiled for generic linux)

Usage

call the script lazyB_pipeline.sh as follows:

/path/lazyB\_pipeline.sh \[k-mer-size-filter\] \[k-mer-size-assembly\] \[name\] \[illumina-inputfile-1\] \[illumina-inputfile-2\]] \[nanopore-inputfile\] \[output-folder\]

example:

/path/lazyB\_pipeline.sh 50 90 Scerevisiae illumina\_paired\_end\_1.fa illumina\_paired\_end\_2.fa nanopore\_data.fastq output-folder

Note: Parts of the pipeline run on 8 cores. To change the number of available cores, set the value manually in lazyB_pipeline.sh.

[k-mer-size-filter] specifies the k-mer size for k-mer counting in raw illumina data. Reads with highly abundant k-mers are removed from the data. Starting at k=50 is recommended.

[k-mer-size-assembly] specifies the k-mer size during illumina assembly (here using Abyss). Starting at k=90 is recommended.

Comment out the following line in lazyB_pipeline.sh to disable the experimental polishing step.

$SCRIPTPATH/racon_prokrast -u -t $CORES $TMP/temp_1.query.fa $TMP/temp_1.align.paf $TMP/temp_1.target.fa > $OUT/03.assembly.fa

About

Economic Genome Assembly from Low Coverage Illumina and Nanopore Data

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published