reference-guided aligner for next-generation sequencing technologies
Switch branches/tags
Nothing to show
Clone or download
Latest commit 5c25216 Jun 27, 2014
Failed to load latest commit information.
demo modified: Sep 18, 2012
src Version 2.2.30 Jun 27, 2014
README The announcement of the bug fix in README. Mar 27, 2014


MOSAIK 2.2        Release Distribution Documentation                  2013-09-20
Wan-Ping Lee & Michael Stromberg    Marth Lab, Boston College Biology Department


1. 2014-03-26 
   A bug causing incorrect bases of reverse complement alignments has been fixed.
   Please check any version greater than 2.2.19 for the fix.

2. Version 2.2 is about four-fold faster than previous by improving the Smith-
Waterman (SW) strategy and using SIMD SW [1]. This is also the version that we 
use on 1000 Genomes Project [2] Phase-III datasets.


MOSAIK is a stable, sensitive and open-source program for mapping second and 
third-generation sequencing reads to a reference genome. Uniquely among current 
mapping tools, MOSAIK can align reads generated by all the major sequencing 
technologies, including Illumina, Applied Biosystems SOLiD, Roche 454, 
Ion Torrent and Pacific BioSciences SMRT.


MOSAIK is open source software, licensed under the GPL 2.0+, and is provided 
free-of-charge to all users. All we ask in return is that you acknowledge its 
use in publications, talks, etc.

We ask that you cite this paper if you use MOSAIK in work that leads to 


To download MOSAIK, please use git to download the most recent development tree.

    % git clone

Or, download it from google code webpage:

If you encounter issues with the development HEAD, or simply wish to obtain the
most recent stable revision (2.2.3) then use:

        % git checkout 915f6feca4d321a79c29de34f6512f92b751b457


MOSAIK requires g++ and the standard C and C++ development libraries.
Just enter the src directory, glance at the README file, and type "make".

If static libraries are not installed, please modify includes/,
deleting "-static".

Running the demo:

To see a full list of the available command line options, enter the bin
directory and run any of the program without any extra parameters.

Included in this release is an Illumina data set featuring reads E.Coli.

To test MOSAIK with this data set, just enter the demo directory and 
type Build, and Align to run the appropriate scripts.


> cd demo
> ./
> ./

The resulting bam file (read.mka.bam) will be found in the
demo/fastq directory.