Contributor: Janne Pott and Holger Kirsten
Last Updated: 03/31/2022
Supporting code for the following paper:
- Pott J, Garcia T, Hauck SM, Petrera A, Wirkner K, Loeffler M, et al. (2022) Genetically regulated gene expression and proteins revealed discordant effects. PLoS ONE 17(5): e0268815. https://doi.org/10.1371/journal.pone.0268815
We are providing the main scripts used in the locus-wide association study (LWAS) of Olink proteins in LIFE-Adult, to empower other researchers to reproduce our results, starting from the summary statistics (for protein ~ SNP, gene expression ~ SNP and protein ~ gene expression associations in LIFE).
Complete data sets including genetic data of LIFE-Adult participants cannot be made publicly available due to ethical and legal restrictions, as they are sufficient to identity study participants. This is not covered by the informed consent. However, access to the LIFE-Adult data is possible via project agreements addressed to:
- LIFE Research Center for Civilization Diseases, Medical Faculty, University of Leipzig, Leipzig, Germany
- E-mail: info@life.uni-leipzig.de
- Homepage: https://www.uniklinikum-leipzig.de/einrichtungen/life/kontakt (currently only available in German)
You will need to customize a source file, indicating
- path to R library (please use R Version 4.x, all necessary packages are listed in the source file)
- path to Phyton and Conda (e.g. Anaconda)
- path to MetaXcan
- path to summary-gwas-imputation
- path to PLINK
- path to 1000 Genomes Phase 3 EUR data
- path to GTEx v8 Data
- path to CAD statistics van der Harst et al., summary statistics
- Harmonization calls to lift from hg19 to hg38
- Hierarchical FDR for combined, males and females
- Lead SNPs: sex-interaction test --> Figure S4
- Lead SNPs: comparison with effect on gene expression (GTEx v8 data) --> Figure 2
- Circular Plot --> Figure 1
- Locus level: Co-localization of pQTLs and eQTLs (GTEx v8 data) --> Figure S6
- Locus level: Correlation of protein levels and genetically regulated gene expression using MetaXcan
- Causal analyses: Test causality of gene expression on protein levels
- Causal analyses: Test causality of protein levels on CAD (van der Haarst data) --> Figure S8
- Causal analyses: Test mediating effect of protein levels on GE-CAD relation
- Overview of all tissue-specific results (combined setting only)
- Main Tables & Figure 3
- Supplemental Tables
- Figures S5 and S7
- Plink Calls to generate association statistics (individual level data)
- Priority Pruning with LIFE-Adult data (individual level data)
- Annotation Pipeline (not yet published from GenStatLeipzig Group)
- Figures:
- Figure 4: DAGs MR Mediation (generated in PowerPoint, based on Table S12)
- Figure S1: Sample Overview (individual level data)
- Figure S2: Flowchart of Analysis Plan (generated in PowerPoint)
- Figure S3: Genetic PC Plot (individual level data)