Skip to content

A python package for determining inversion genotype in Anopheles coluzzii and gambiae

License

Notifications You must be signed in to change notification settings

sanjaynagi/compkaryo

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

How to use compkaryo

Compkaryo (short for "computational karyotyping") is a Python package for taking a VCF file generated from arm 2R or 2L of Anopheles coluzzii and/or gambiae, and returning predictions for karyotypes of up to six inversions on those arms. Available inversions are: 2La; 2Rb; 2Rc; 2Ru; and for An. gambiae, 2Rd and 2Rj. The method behind these predictions is described in our publication, Love et al 2019 DOI: https://doi.org/10.1534/g3.119.400445.

Compkaryo does not currently do any filtering of the VCF. The file should be filtered to mask low-quality genotypes (we recommend masking everything with a GQ less than 20) before running compkaryo. Including low-quality genotypes in the calculation may significantly skew the results.

Compkaryo requires Python 3 and depends on numpy and scikit-allel.

Usage:

python compkaryo.py vcf inversion

Options:

-h, --help Print the help message

-o, --out Specify an output file. If no output file is given, output prints to standard out.

-s, --samples Specify sample names to include. Names can be given as a text file, with one name per line, or as a space-separated list on the command line.

-t, --totals Output the total number of sites matching each possible inversion genotype (0, 1, 2).

Compkaryo currently operates on one inversion at a time. Use a loop to analyze multiple inversions.

About

A python package for determining inversion genotype in Anopheles coluzzii and gambiae

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Python 100.0%