Skip to content

Commit

Permalink
Add MRSA Nanopore assembly tutorial to AMR learning pathway
Browse files Browse the repository at this point in the history
  • Loading branch information
bebatut committed Feb 29, 2024
1 parent baa4209 commit 96393a7
Show file tree
Hide file tree
Showing 6 changed files with 840 additions and 891 deletions.
6 changes: 2 additions & 4 deletions learning-pathways/amr-gene-detection.md
Original file line number Diff line number Diff line change
Expand Up @@ -32,10 +32,8 @@ pathway:
tutorials:
- name: mrsa-illumina
topic: assembly
# - name: mrsa-nanopore
# topic: assembly
# - name: hybrid-assembly
# topic: assembly
- name: mrsa-nanopore
topic: assembly

- section: "Module: Genome annotation"
description: |
Expand Down
32 changes: 32 additions & 0 deletions topics/assembly/tutorials/mrsa-nanopore/tutorial.bib
Original file line number Diff line number Diff line change
Expand Up @@ -76,6 +76,38 @@ @article{Li_2016
doi={10.1093/bioinformatics/btw152}
}

@article{Li_2009,
author = {Li, H. and Durbin, R.},
journal = {Bioinformatics},
number = {14},
year = {2009},
month = {5},
pages = {1754--1760},
publisher = {Oxford University Press (OUP)},
title = {Fast and accurate short read alignment with {Burrows}-{Wheeler} transform},
volume = {25},
}

@article{Li_2010,
author = {Li, Heng and Durbin, Richard},
journal = {Bioinformatics},
number = {5},
year = {2010},
month = {1},
pages = {589--595},
publisher = {Oxford University Press (OUP)},
title = {Fast and accurate long-read alignment with {Burrows}--{Wheeler} transform},
volume = {26},
}

@misc{Li2013Aligning,
author = {Li, Heng},
year = {2013},
title = {Aligning sequence reads, clone sequences and assembly contigs with {BWA}-{MEM}},
howpublished = {http://arxiv.org/abs/1303.3997},
}




@article{lin2016assembly,
Expand Down
10 changes: 5 additions & 5 deletions topics/assembly/tutorials/mrsa-nanopore/tutorial.md
Original file line number Diff line number Diff line change
@@ -1,7 +1,7 @@
---
layout: tutorial_hands_on

title: Genome Assembly of MRSA using Oxford Nanopore MinION (and Illumina data if available)
title: Genome Assembly of MRSA from Oxford Nanopore MinION data (and optionally Illumina data)
zenodo_link: 'https://zenodo.org/record/10669812'
questions:
- How to check the quality of the MinION data (together with Illumina data)?
Expand Down Expand Up @@ -40,9 +40,9 @@ contributions:

follow_up_training:
- type: "internal"
topic_name: visualisation
topic_name: genome-annotation
tutorials:
- jbrowse
- amr-gene-detection
- type: "internal"
topic_name: galaxy-interface
tutorials:
Expand Down Expand Up @@ -311,7 +311,7 @@ For each run, **NanoPlot** generates 5 outputs:
> > <solution-title></solution-title>
> > 1. 3,400 bp to 5,451 bp, a 60.3% increase
> > 2. A -2.0% decrease is not a very significant decrease. Our data was quite good to start with and didn't have many short reads which were removed (23.4%)
> > 3. The coverage is a measure of how many reads 'cover' on average a single base in the genome. If you divide the total reads by the genome size, you will get a number how many times your genomes could theoretically be covered by reads.
> > 3. The coverage is a measure of how many reads 'cover' on average a single base in the genome. If you divide the total reads by the genome size, you will get a number how many times your genomes could theoretically be "covered" by reads.
> > 4. Before $$ \frac{621,945,741}{2,900,000} = 214.4$$ and after $$ \frac{609,657,642}{2,900,000} = 210.2$$. This is *not* a very big decrease in coverage, so no cause for concern. Generally in sequencing experiments you have some minimum coverage you expect to see based on how much of your sample you sequenced. If it falls below that threshold it might be cause for concern.
> >
> {: .solution}
Expand Down Expand Up @@ -432,7 +432,7 @@ Several tools exist for polishing, e.g. Racon ({% cite Vaser_2017 %}). Here we w

**Polypolish** needs as input the assembly but also SAM/BAM files where each read has been aligned to *all* possible locations (not just a single best location). Errors in repeats will be covered by short-read alignments, and **Polypolish** can therefore fix those errors.

To get the SAM/BAM files, we need to map the short reads on the assembly. We will use **BWA-MEM** ({% cite Li_2009, Li_2010, Li2013Aligning %}).
To get the SAM/BAM files, we need to map the short reads on the assembly. We will use **BWA-MEM** ({% cite Li_2009 %}, {% cite Li_2010 %}, {% cite Li2013Aligning %}).

We need to set up **BWA-MEM** so it aligns each read to all possible locations, not just the best location. This option does not work with paired-end alignment. We will then need to align forward and reverse read files separately, instead of aligning both read files with a single BWA-MEM run as usually recommended.

Expand Down
Original file line number Diff line number Diff line change
@@ -0,0 +1,46 @@
- doc: Test outline for Genome-Assembly-of-MRSA-using-Oxford-Nanopore-MinION-(and-Illumina-data-if-available)
job:
Illumina reverse raw reads:
class: File
location: https://zenodo.org/record/10669812/files/DRR187567.fastq.bz2
filetype: fastqsanger.bz2
Illumina forward raw reads:
class: File
location: https://zenodo.org/record/10669812/files/DRR187559_1.fastqsanger.bz2
filetype: fastqsanger.bz2
Nanopore raw reads:
class: File
location: https://zenodo.org/record/10669812/files/DRR187559_2.fastqsanger.bz2
filetype: fastqsanger.bz2
outputs:
nanoplot_before_filtering:
asserts:
has_text:
text: "number_of_reads"
has_text:
text: "91288"
has_n_lines:
n: 24
nanoplot_after_filtering:
asserts:
has_text:
text: "number_of_reads"
has_n_lines:
n: 24
flye_assembly_info:
asserts:
has_text:
text: "contig_1"
has_n_lines:
n: 2
quast_report_before_polishing:
asserts:
has_text:
text: "Flye"

quast_report_after_polishing:
asserts:
has_text:
text: "Flye"
has_text:
text: "Polypolish"
Loading

0 comments on commit 96393a7

Please sign in to comment.