Contributor: Janne Pott
Last Updated: 22/01/2024
Supporting code for the following paper:
- Pott J, Kheirkhah A, Gadin JR, Kleber ME, Delgado GE, Kirsten H, et al. Sex and statin-related genetic associations at the PCSK9 gene locus – results of genome-wide association meta-analysis. Biology of Sex Differences. 2023 (under review)
This project is an extension to our previous publications:
- Pott J, Burkhardt R, Beutner F, Horn K, Teren A, Kirsten H, et al. Genome-wide meta-analysis identifies novel loci of plaque burden in carotid artery. Atherosclerosis. 2017;259:32--40. DOI.
- Pott J, Gadin J, Theusch E, Kleber ME, Delgado GE, Kirsten H, et al. Meta-GWAS of PCSK9 levels detects two novel loci at APOB and TM6SF2. Hum Mol Genet 2021. DOI.
We are providing the main scripts used in the GWAMA of PCSK9 levels in four European Cohorts (LIFE-Adult, LIFE-Heart, LURIC, TwinGene, KORA-F3, and GCKD), stratified by sex and statin treatment, to empower other researchers to reproduce our results, starting from the summary statistics. Data can be found on zenodo (LINK tba)
You will need to customize a source file, indicating
-
R library and R packages: all scripts were run under R Version 4.x. All necessary packages are listed in the source file. Additional function not published in any R package are listed in the directory 'helperFunctions'.
-
Path to tools other than R:
-
Path to downloaded data sets used throughout the analyses:
- 1000 Genomes Phase 3 EUR data
- GTEx v8 data
- GTEx v8 data - sex stratified
- Summary statistics for lipid - sex-stratified, publication: Kanoni et al. (2022)
- Summary statistics for lipids - sex-combined, publication: Graham et al. (2021)
- Summary statistics for coronary artery disease, publication: van der Harst et al. (2018)
- Summary statistics for sleep duration, publication: Dashti et al. (2019)
R scripts staring with 0x:
- Get summary statistics as uploaded to zenodo (documentary, uses GWAS pipeline output, you will not need to rerun this when you downloaded the zenodo data)
- Define associated loci
- Fine-mapping of PCSK9 locus (GCTA conditional joint analyses)
- Interaction Tests
- Co-localization
- Preparation of data
- Run within PCSK9 data
- Run against eQTLs
- Run against other GWAS traits
- Mendelian Randomization
- Preparation of UKBB data (GWAS for stratified LDLC)
- PCSK9 on LDL-C using UKBB data for LDLC summary statistics
- PCSK9 on LDL-C using GLGC data
- Look-up of sex-biased gene expression and sex-biased eQTLs
R scripts staring with MF:
- Heatmap of independent SNPs at PCSK9 gene loci
- Interaction scatter plot of SNP estimates
- Forest plot of causal estimates per subgroup
- Interaction scatter plot of causal estimates
R scripts staring with MT:
- Summary of independent SNPs in the PCSK9 GWAS
- Results of the SNP interaction analysis
- Results of the MR analysis
- Results of the MR interaction analysis
R script staring with ST:
- Description of Studies (as received from participating studies) --> not included here
- Sample Sizes, SNP Numbers, genomic inflation factor
$\lambda$ , and LDSC heritability results per phenotype - Overview of associated loci
- Annotation of associated SNPs
- GCTA COJO results
- Interaction results
- Colocalization results
- Mendelian Randomozation results