Workflow for ancestry unbaised genetic signatures
1.0
Leslie A Smith, James A Cahill, Kiley Graim
-- GnomAD VCF parsing done in leslie-smith1112/gnomad_vcf_parsing. Kept as seperate directory for now for simplicity in dependencies.
-- Files for pipeline: GnomAD file: https: https://gnomad.broadinstitute.org/downloads (gnomAD V2.1.1) HumanBase networks: https://hb.flatironinstitute.org/download (full networks; mammary epithelium, uterine endometrium, thyroid gland) Disease expression matrices and clinical info (cBioportal): https://www.cbioportal.org/datasets (brca_tcga_pan_can_atlas_2018.tar.gz, ucec_tcga_pan_can_atlas_2018.tar.gz, thca_tcga_pan_can_atlas_2018.tar.gz)
-- Scripts for figures used in the paper are in the ./figure_scripts directory.