This program takes a vcf filename as a parameter and for each variant with a PASS Filter value and an existing Gene prints:
- The total number of variants (mutations) represented in the file together with a breakdown of this total by mutation type (SNV/INS/DEL/MNV).
- A list of genes within which at least one variant has been found, the total number of variants found in that gene, and a breakdown by variant type.
Variants that do not have a Gene specified are excluded from this list!
python toyvcf/variant_counter.py data/testData.vcf
python -m unittest discover -s <path_to_tests> -p '*_tests.py'
Sample program output for running on data/testData.vcf is located in data/testData.output
Written by Sergei Iakhnin on July 22, 2014