# Read Alignment

## Introduction
Sequence alignment in NGS is the process of determining the most likely source of the observed DNA sequencing read within the reference genome sequence.

![NGS Read Alignment](images/alignment.png "NGS Read Alignment")

### Why align?
There are typical inferences you can make from an alignment of NGS data against a reference genome:

* Variation from the reference – could have functional consequence.
* Transcript abundance: the number of aligned reads within genes can be used to quantify expression.
* Ab-initio transcript discovery: you can see a pileup from RNA-seq data showing evidence for an exon which was previously missed.

![Uses of Read Alignment](images/alignment-uses.png "Uses of Read Alignment")

## Learning outcomes
On completion of the tutorial, you should be able to:

* Perform read alignment using standard tools (BWA-MEM).
* Be able to mark duplicate reads and understand their effect on analysis results.
* Visualise read alignments using IGV (Integrated Visualisation Tool).

If there is time you will learn how to:

* Merge the results from multiple alignments and understand when it is appropriate to perform a merge.

## Tutorial sections
This tutorial comprises the following sections:   
 1. [Performing read alignment](alignment.ipynb)   
 2. [Alignment visualisation](visualisation.ipynb)   
 
There is also an additional (optional) section: 
 3. [Alignment workflows](workflows.ipynb)    

## Authors
This tutorial was written by [Jacqui Keane](https://github.com/jacquikeane) based on material from [Thomas Keane](https://github.com/tk2), [Vivek Iyer](https://github.com/vviyer) and [Victoria Offord](https://github.com/vo1).

## Running the commands from this tutorial
You can follow this tutorial by typing all the commands you see into a terminal window. This is similar to the "Command Prompt" window on MS Windows systems, which allows the user to type DOS commands to manage files.

To get started, open a new terminal on your computer and type the command below:

In [None]:
cd /home/manager/course_data/read_alignment

Now you can follow the instructions in the tutorial from here.

## Let’s get started!
This tutorial assumes that you have samtools, bwa, Picard tools and IGV installed on your computer. These are already installed on the VM you are using. To check that these are installed, you can run the following commands:

In [None]:
samtools --help

In [None]:
bwa

In [None]:
picard -h

In [None]:
igv

This should return the help message for samtools, bwa and Picard tools respectively. The final command should launch the genome viewer IGV. You can close the IGV software for now; we will use it later in this tutorial to visualise alignments.

To get started with the tutorial, head to the first section: [Performing read alignment](alignment.ipynb)