Skip to content
jewmanchue edited this page Jun 9, 2014 · 10 revisions

Description

XPEHH is a popular tool for examining the differences in haplotype decay between two populations. For further reading follow the link: [Sabeti et al. 2007(http://www.nature.com/nature/journal/v449/n7164/full/nature06250.html). The standard implementation of XPEHH requires phased and imputed genotypes.

Usage statement for xpEHH:

INFO: help
INFO: description:
     xpEHH estimates haplotype decay between the target and background populations.  Haplotypes are integrated
     until EHH in the target and background is less than 0.05. The score is the itegrated EHH (target) / integrated EHH (background).
     xpEHH does NOT integrate over genetic distance, as genetic maps are not availible for most non-model organisms.
Output : 4 columns :
     1. seqid
     2. position
     3. alllele frequency
     4. EHH-alt
     5. EHH-ref
     6. xpEHH

INFO: xpEHH  --target 0,1,2,3,4,5,6,7 --background 11,12,13,16,17,19,22 --file my.vcf

INFO: required: t,target     -- argument: a zero based comma separated list of target individuals corrisponding to VCF columns
INFO: required: b,background -- argument: a zero based comma separated list of background individuals corrisponding to VCF columns
INFO: required: f,file       -- argument: a properly formatted phased VCF file
INFO: required: y,type       -- argument: type of genotype likelihood: PL, GL or GP
INFO: optional: r,region     -- argument: a tabix compliant genomic range: seqid or seqid:start-end

INFO: version 1.1.0 ; date: April 2014 ; author: Zev Kronenberg; email : zev.kronenberg@utah.edu

Running xpEHH with provided data:

cd samples 
bgzip scaffold612.vcf
tabix -p vcf scaffold612.vcf.gz

bin/xpEHH --target  1,20,25,29,30,38,43,46  --background \
2,3,4,5,6,7,21,22,22,23,24,26,26,28,31,32,33,34,35,36,37,39,40,41,42,44,45 \
--file samples/scaffold612.phased.vcf --type GP > t.txt

 R --vanilla < bin/plotXPEHH.R --args t.txt 

The resulting plot:

xpehh

While this plot does not look as impressive as the associations tools' output (pFst, wcFst & pFst) it reflects reality.