Simultaneous detection of SNPs and Indels using a 16-genotype probabilistic model
Perl C++ C Perl 6 Other
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
lib
.gitignore
CPUfunctions.h
CounterReader.cpp
CounterReader.h
FisherExactTest.cpp
FisherExactTest.h
HSP.h
Makefile
README.md
SAMhandler.cpp
SAMhandler.h
SNP.cpp
SNP.h
SNPFunctions.cpp
SNPFunctions.h
SNP_Caller.cpp
SNP_Caller.h
SNP_Meta.h
SNP_Stat.h
SimpleMemoryPool.h
SnapshotHandler.cpp
SnapshotHandler.h
VariantCaller.cpp
VariantCaller.h
bam.h
bam2snapshot.cpp
bam2snapshot.h
bam2snapshot.ini
bgzf.h
buildRegionList.cpp
bwt.h
coreStruct.h
definitions.h
dependencies.h
filterVCF.pl
fisher.cpp
fisher.h
getIndelDetails.pl
indexFunction.cpp
indexFunction.h
interpreter.cpp
interpreter.h
likelihood_cache.cpp
likelihood_cache.h
readIndex.cpp
readIndex.h
sam.h
snapshotSnpcaller.cpp
snapshotSnpcaller.ini
struct.h
txt2vcf.pl
ycsq.cpp
ycsq.h

README.md

16GT

16GT is a variant caller utilizing a 16-genotype probabilistic model to unify SNP and indel calling in a single algorithm. 16GT is easy to use. The default parameters will fit most of the use cases with human genome. For the detailed parameters for each module, please run the module to get an info.

Quick start

Inputs: genome.fa alignments.bam, Output: variants.vcf

0. Install

git clone https://github.com/aquaskyline/16GT
cd 16GT
make
# Tested in Ubuntu 14.04 and CentOS 6.7 with GCC 4.7.2

1. Build reference index

git clone https://github.com/aquaskyline/SOAP3-dp.git
cd SOAP3-dp
make SOAP3-Builder
make BGS-Build
soap3-dp-builder genome.fa
BGS-Build genome.fa.index

2. Convert BAM to SNAPSHOT

bam2snapshot -i genome.fa.index -b alignments.bam -o output/prefix

3. Call variants

snapshotSnpcaller -i genome.fa.index -o output/prefix
perl txt2vcf.pl output/prefix.txt sampleName genome.fa > variants.vcf
perl filterVCF.pl variants.vcf dbSNP.vcf.gz > variants.filtered.vcf

Exome variant calling

Inputs: genome.fa alignement.bam region.bed, Outputs: region.bin variants.vcf

RegionIndexBuilder genome.fa.index region.bed region.bin -bed/-gff
bam2snapshot -i genome.fa.index -b alignments.bam -o output/prefix -e region.bin
snapshotSnpcaller -i genome.fa.index -o output/prefix -e region.bin

License

GPLv3