Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

should I keep unphased sites? #18

Closed
hhu1 opened this issue Feb 26, 2016 · 1 comment
Closed

should I keep unphased sites? #18

hhu1 opened this issue Feb 26, 2016 · 1 comment
Labels

Comments

@hhu1
Copy link

hhu1 commented Feb 26, 2016

I am using MSMC over genomes sequenced by Complete Genomics. Based on the Schiffels & Durbin 2014 paper, unphased sites would introduce bias for population split analysis. However, when I looked into run_shapeit.sh tool, it seems that phasing was performed only on SNVs present in shapeit2 reference panel. Afterwards, both phased and unphased sites were merged into the same vcf file.

My question is, should I keep the unphased sites (those not present in the reference phasing panel) in my vcf file? If not, should I somehow fix the mask file to reflect the fact that only sites present in the reference panel are callable?

Thanks very much for your suggestions,

Hao Hu

@stschiff
Copy link
Owner

stschiff commented Mar 1, 2016

I always keep unphased sites, I can then decide when running MSMC to remove them using the —skipAmbiguous flag.

Best wishes,
Stephan

On 26 Feb 2016, at 21:25, hhu1 notifications@github.com wrote:

I am using MSMC over genomes sequenced by Complete Genomics. Based on the Schiffels & Durbin 2014 paper, unphased sites would introduce bias for population split analysis. However, when I looked into run_shapeit.sh tool, it seems that phasing was performed only on SNVs present in shapeit2 reference panel. Afterwards, both phased and unphased sites were merged into the same vcf file.

My question is, should I keep the unphased sites (those not present in the reference phasing panel) in my vcf file? If not, should I somehow fix the mask file to reflect the fact that only sites present in the reference panel are callable?

Thanks very much for your suggestions,

Hao Hu


Reply to this email directly or view it on GitHub #18.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants