Analysis of human genetic data

This repository contains python and R utilities for analysis and downstream visualization of genetic data:

parse_manifest
parse_vep_output
worldFreq

`parse_manifest`

This is a utility for parsing the MEGA manifest provided by Illumina (https://www.illumina.com/science/consortia/human-consortia/multi-ethnic-genotyping-consortium.html). The MEGA platform enables high imputation accuracy for multi-ethnic cohorts and was developed by the PAGE Study (www.pagestudy.org). To read about trans and multi-ethnic GWAS using the MEGA platform, check out our preprint on biorxiv: (http://www.biorxiv.org/cgi/content/short/188094v1).

This is a general tool that can be used with other array platforms including MEGA, MEGA-ex, and GSA.

`parse_vep_output`

Script for selecting annotations of interest from a VEP VCF file and simplifying into a tab-delimited TXT file.

Usage: parse_vep_output.py <VCF file> <annotation type>

`worldFreq`

Analysis and visualization of minor allele frequency or carrier frequency for a large number of globally representative populations. worldFreq was used for studying COL27A1.pG697R carrier frequencies in Figure 4A in Belbin GM et al "Genetic identification of a common collagen disease in Puerto Ricans via identity-by-descent mapping in a health system" (2017). (https://elifesciences.org/articles/25060/figures) And also for Figure5, HLA-B star 57:01 allele frequency in Wojcik et al, "Genetic diversity turns a new PAGE in our understanding of complex traits" (2017). (http://www.biorxiv.org/content/early/2017/09/15/188094).

Frequency data from the eLife paper is provided in example-data.tsv but this code will work with any tab-delimited file of allele frequency data from PLINK.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
Figure4A_Belbin_KENNY_2017.pdf		Figure4A_Belbin_KENNY_2017.pdf
Figure5_Wojcik_et_al_2017.pdf		Figure5_Wojcik_et_al_2017.pdf
README.md		README.md
example-data.tsv		example-data.tsv
parse_manifest_refseq.py		parse_manifest_refseq.py
parse_vep_output.py		parse_vep_output.py
worldFreq.R		worldFreq.R

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Figure4A_Belbin_KENNY_2017.pdf

Figure4A_Belbin_KENNY_2017.pdf

Figure5_Wojcik_et_al_2017.pdf

Figure5_Wojcik_et_al_2017.pdf

README.md

README.md

example-data.tsv

example-data.tsv

parse_manifest_refseq.py

parse_manifest_refseq.py

parse_vep_output.py

parse_vep_output.py

worldFreq.R

worldFreq.R

Repository files navigation

Analysis of human genetic data

`parse_manifest`

`parse_vep_output`

`worldFreq`

About

Releases

Packages

Languages

epsorokin/clinical_genetics

Folders and files

Latest commit

History

Repository files navigation

Analysis of human genetic data

parse_manifest

parse_vep_output

worldFreq

About

Resources

Stars

Watchers

Forks

Languages

`parse_manifest`

`parse_vep_output`

`worldFreq`