Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

bayes error filter removes all INDELs #70

Open
ian1roberts opened this issue Sep 4, 2012 · 0 comments
Open

bayes error filter removes all INDELs #70

ian1roberts opened this issue Sep 4, 2012 · 0 comments

Comments

@ian1roberts
Copy link
Collaborator

May be I've misunderstood how the filter works, so might be my application error rather than a real issue.
Essentially, miseq amplicon data processed through GATK pipeline, calling glm=BOTH generates VCF that contains some nice INDELs

Post filtering thus ...

vcf_filter.py --site-quality=99 --genotype-quality=99 --eblr=-10  gatk_raw.vcf sq mgq eb > gatk_filtered.vcf

Contains no INDELs. I thought about it, and wondered if its appropriate to apply this filter to INDELs in tumour samples where the distribution of allele depths isn't expected to be negative binomial. So I added an argument to the filter to exclude it from variants typed INDEL

I hacked the ErrorBiasFilter to include the extra argparse:

parser.add_argument('--no-indels', type=bool, default=True,
                help='Apply filter to SNPs only')

and in init

self.no_indels = args.no_indels

and in call

if self.no_indels and record.is_indel:
            return None

Wanted to know if this is useful before I commit, or have I missed how this filter should work. Thanks, Ian

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant