# LDSC-SEG on TWAS Meta-Analysis Results for OUD

**Author**: Jesse Marks<br>
**NIH Project**: [Harnessing Knowledge of Gene Function in Brain Tissue for Discovering Biology Underlying Heroin Addiction](https://reporter.nih.gov/search/RC99reuHhEW0n_3WuFPU6g/project-details/10116351) <br>
**Charge Code**:0218755.001.002.001 ~0215889.001.001~<br>
**GitHub Issue**:  [Opioid Use Disorder TWAS Meta-analysis (Uniform Processing) #183](https://github.com/RTIInternational/bioinformatics/issues/183)<br>

**Overview**:<br>
Differential gene expression (DGE) analysis commonly uses RNA-seq (RNA sequencing) data as a primary source of information. RNA-seq is a powerful and widely used technique for measuring gene expression levels and identifying differentially expressed genes between different biological conditions. It provides a comprehensive view of gene expression and allows researchers to compare expression patterns across different conditions, treatments, or stages of development. It has become a valuable tool for studying gene regulation, identifying disease-associated genes, and understanding biological processes at the transcriptomic level.

We received nucleus accumbens (NAc) RNA-seq data from the New York Genome Center (NYGC). The GitHub issue  [NYGC NAc RNA-seq data processing and QC #189](https://github.com/RTIInternational/bioinformatics/issues/189) describes the process of retrieving, processing, and QC'ing these nucleus accumbens (NAc) data.


**Description**:<br>
This notebook details the process used to test for heritability enrichment in the significantly differentially expressed genes from our DEG meta-analysis for 47 phenotypes.


We performed Stratified LD Score Regression (S-LDSC) analyses using the LD score regression approach described in [Finucane et al. 2018](https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5896795/) on specifically expressed genes (LDSC-SEG).
LDSC-SEG applies stratified LD score regression to test whether disease heritability is enriched in the regions surrounding genes with the highest specific expression in a given tissue.
This approach helps to interpret GWAS signal by leveraging gene expression data.



The meta-analysis results are published in [THIS](https://github.com/RTIInternational/bioinformatics/issues/189#issuecomment-1545939252) GitHub comment.
The file name is `hic_NAc_Seney_published_meta_analysis_results_20230427.txt`.
These results are from the meta-analysis with Seney et al. published summary stats (Batch 1, Batch 2, Seney et al.):
96 significant DEGs (FDR < 0.05)


We will filter the results to obtain two sets of significantly expressed genes:

- List of genes with a Benjamini-Hochberg FDR <0.05
- List of genes with a Benjamini-Hochberg FDR <0.10


Overall, this notebook will address the biological question of whether significantly differentially expressed genes (and their proximal regions) from the opioid use disorder DEG meta-analysis results showed enrichment for genetic signal associated with opioid overdose deaths or any related phenotypes from the following EUR-specific studies:

___

<details>
    <summary>phenotype list</summary>
    
`sleep studies`:
* Insomnia (Jansen et al., 2019 Nat Genet [30804565](https://pubmed.ncbi.nlm.nih.gov/30804565/))
* Insomnia (Lane et al., 2019 Nat Genet [30804566](https://pubmed.ncbi.nlm.nih.gov/30804566/))
* LongSleepDur (Dashti et al., 2019 Nat Commun [30846698](https://pubmed.ncbi.nlm.nih.gov/30846698/))
* ShortSleepDur (Dashti et al., 2019 Nat Commun [30846698](https://pubmed.ncbi.nlm.nih.gov/30846698/))
* sleepDuration (Dashti et al., 2019 Nat Commun [30846698](https://pubmed.ncbi.nlm.nih.gov/30846698/))
* Sleepdur (Jansen et al., 2019 Nat Genet [30804565](https://pubmed.ncbi.nlm.nih.gov/30804565/))
    
`41 other studies:`
* Age of Initiation  (Liu et al., 2019 Nat Genet [30643251](https://pubmed.ncbi.nlm.nih.gov/30643251/))
* Alcohol Dependence (Walters et al., 2018 Nat Neurosci [30482948](https://pubmed.ncbi.nlm.nih.gov/30482948))
* Alcohol Drinks per Week (DPW) (Liu et al., 2019 Nat Genet [30643251]())
* Alzheimer's Disease (Lambert et al., 2013 Nat Genet [24162737](https://pubmed.ncbi.nlm.nih.gov/24162737))
* Amyotrophic Lateral Sclerosis (Rheenen et al., 2016 Nat Genet [27455348](https://pubmed.ncbi.nlm.nih.gov/27455348))
* Anorexia Nervosa (Watson et al., 2019 Nat Genet [31308545](https://pubmed.ncbi.nlm.nih.gov/31308545))
* Attention Deficit Hyperactivity Disorder (Demontis et al., 2019 Nat Genet [30478444]())
* Autism Spectrum Disorders (Grove et al., 2019 Nat Genet [30804558](https://pubmed.ncbi.nlm.nih.gov/30804558))
* Bipolar Disorder (Stahl et al., 2019 Nat Genet [31043756](https://pubmed.ncbi.nlm.nih.gov/31043756))
* Cannabis Use Disorder (CUD) (Demontis et al., 2019 Nat Neurosci [31209380](https://pubmed.ncbi.nlm.nih.gov/31209380))
* Childhood IQ (Benyamin et al., 2014 Mol Psychiatry [23358156](https://pubmed.ncbi.nlm.nih.gov/23358156))
* Cigarettes Per Day (Liu et al., 2019 Nat Genet [30643251](https://pubmed.ncbi.nlm.nih.gov/30643251/))
* College Completion (Rietveld et al., 2013 Science [23722424](https://pubmed.ncbi.nlm.nih.gov/23722424))
* Cotinine Levels (Ware et al., 2016 Sci Rep [26833182](https://pubmed.ncbi.nlm.nih.gov/26833182/))
* Fagerstrom Test for Nicotine Dependence (FTND) (Quach et al., 2020 Nat Commun [33144568](https://pubmed.ncbi.nlm.nih.gov/33144568/))
* Heaviness of Smoking Index (HSI) (Quach et al., 2020 Nat Commun [33144568](https://pubmed.ncbi.nlm.nih.gov/33144568/))
* Intelligence (Sniekers et al., 2017 Nat Genet [28530673](https://pubmed.ncbi.nlm.nih.gov/28530673))
* Lifetime Cannabis Use (Ever vs. Never) (Pasman et al., 2018 Nat Neurosci [30150663](https://pubmed.ncbi.nlm.nih.gov/30150663))
* Major Depressive Disorder (Howard et al., 2018 Nat Commun [29662059](https://pubmed.ncbi.nlm.nih.gov/29662059))
* Mean Accumbens Volume (Hibar et al., 2015 Nature [25607358](https://pubmed.ncbi.nlm.nih.gov/25607358/))
* Mean Caudate Volume (Hibar et al., 2015 Nature [25607358](https://pubmed.ncbi.nlm.nih.gov/25607358/))
* Mean Hippocampus Volume (Hibar et al., 2015 Nature [25607358](https://pubmed.ncbi.nlm.nih.gov/25607358/))
* Mean Pallidum Volume (Hibar et al., 2015 Nature [25607358](https://pubmed.ncbi.nlm.nih.gov/25607358/))
* Mean Putamen Volume (Hibar et al., 2015 Nature [25607358](https://pubmed.ncbi.nlm.nih.gov/25607358/))
* Mean Thalamus Volume (Hibar et al., 2015 Nature [25607358](https://pubmed.ncbi.nlm.nih.gov/25607358/))
* Neo-conscientiousness (de Moor et al., 2012 Mol Psychiatry [21173776](https://pubmed.ncbi.nlm.nih.gov/21173776))
* Neo-openness to Experience (de Moor et al., 2012 Mol Psychiatry [21173776](https://pubmed.ncbi.nlm.nih.gov/21173776))
* Neuroticism (Okbay et al., 2016 Nat Genet [27089181]())
* Opioid Addiction: GENOA GWAS meta-analysis
* Opioid Addiction: gSEM OA GWAS meta-analysis (i.e., GENOA, MVP-SAGE-YP, PGC-SUD, and Partners Health)
* Parkinson's Disease (Sanchez et al., 2009 Nat Genet [19915575](https://pubmed.ncbi.nlm.nih.gov/19915575))
* Post-traumatic Stress Disorder (Nievergelt et al., 2019 Nat Commun [31594949](https://pubmed.ncbi.nlm.nih.gov/31594949))
* Psychiatric Genetics Consortium Cross-disorder GWAS (Schizophrenia, Bipolar Disorder, MDD, ASD and ADHD) (Cross-Disorder Group of the Psychiatric Genomics Consortium, 2013 Lancet [23453885](https://pubmed.ncbi.nlm.nih.gov/23453885))
* Schizophrenia (Ripke et al., 2014 Nature [25056061](https://pubmed.ncbi.nlm.nih.gov/25056061))
* Smoking Cessation (Liu et al., 2019 Nat Genet [30643251](https://pubmed.ncbi.nlm.nih.gov/30643251/))
* Smoking Initiation (Liu et al., 2019 Nat Genet [30643251](https://pubmed.ncbi.nlm.nih.gov/30643251/))
* Subjective Well Being (Okbay et al., 2016 Nat Genet [27089181](https://pubmed.ncbi.nlm.nih.gov/27089181))
* Total Intracranial Volume (ICV) (Hibar et al., 2015 Nature [25607358](https://pubmed.ncbi.nlm.nih.gov/25607358/))
* Years of Schooling (Okbay et al., 2016 Nature [27225129](https://pubmed.ncbi.nlm.nih.gov/27225129))
</details><br><br>

Download file from GitHub https://github.com/RTIInternational/bioinformatics/files/11465503/hic_NAc_Seney_published_meta_analysis_results_20230427.txt

There was an issue with the file missing a column header. Caryn instructed Jesse to add in the header manually.

```css
"It looks like I have the Ensembl id’s as the row names so it didn’t have a column name but you could add Gene before SYMBOL to fix that."
```

In [11]:
sed -i.bak '1s/^/"Gene"\t/' hic_NAc_Seney_published_meta_analysis_results_20230427.txt
sed -i.bak 's/"//g' hic_NAc_Seney_published_meta_analysis_results_20230427.txt

In [12]:
%%bash

head ~/Downloads/hic_NAc_Seney_published_meta_analysis_results_20230427.txt

Gene	SYMBOL	pvalue	overall.eff.direction	p.adjust.bonferroni	p.adjust.fdr
ENSG00000000003	TSPAN6	0.435375129889047	+	1	0.801291554092373
ENSG00000000419	DPM1	0.48850757113392	+	1	0.845234634361694
ENSG00000000457	SCYL3	0.681123215633089	+	1	0.979037097735917
ENSG00000000460	C1orf112	0.0516570130885042	+	1	0.353211250563386
ENSG00000000938	FGR	0.742674561619552	-	1	1
ENSG00000000971	CFH	0.139491690296594	-	1	0.500835281920593
ENSG00000001036	FUCA2	0.619340681287323	+	1	0.941892016492497
ENSG00000001084	GCLC	0.711857457254194	-	1	0.996942214950086
ENSG00000001167	NFYA	0.929884555311393	+	1	1


## Create FDR filtered files
Only need the "Gene" column.

In [22]:
%%bash

meta=~/Downloads/hic_NAc_Seney_published_meta_analysis_results_20230427


# create FDR filtered file: 0.05
# use last column "p.adjust.fdr"
> ${meta}_fdr0.05_genes.txt
tail -n +2 ${meta}.txt | \
    awk '$NF < 0.05 {print $1}' >> ${meta}_fdr0.05_genes.txt


# create FDR filtered file: 0.10
# use last column "p.adjust.fdr"
> ${meta}_fdr0.10_genes.txt
tail -n +2 ${meta}.txt | \
    awk '$NF < 0.10 {print $1}' >> ${meta}_fdr0.10_genes.txt


wc -l ${meta}.txt
wc -l ${meta}_fdr0.10_genes.txt
wc -l ${meta}_fdr0.05_genes.txt

   13419 /Users/jmarks/Downloads/hic_NAc_Seney_published_meta_analysis_results_20230427.txt
     240 /Users/jmarks/Downloads/hic_NAc_Seney_published_meta_analysis_results_20230427_fdr0.10_genes.txt
      96 /Users/jmarks/Downloads/hic_NAc_Seney_published_meta_analysis_results_20230427_fdr0.05_genes.txt


## Create Gene Coordinate File
One of the requirements of running [partioned heritability analysis with LDSC software](https://github.com/bulik/ldsc/wiki/LD-Score-Estimation-Tutorial#partitioned-ld-scores) is to provide a gene coordinate file.
With columns:
```css
GENE, CHR, START, END
```
where START and END are base pair coordinates of TSS and TES (Transcription Start Site and Transcription End Site, respectively).
This file can contain more genes than are in the gene set. 
It's used to define the genomic regions or intervals corresponding to individual genes.
It allows researchers to assess the contribution of specific gene regions or functional elements to the overall heritability of a trait.

By associating genetic variants with these defined gene regions, partitioned heritability analysis can estimate the proportion of trait heritability explained by each gene or functional element. This analysis helps identify which specific genes or genomic regions are significantly associated with the trait of interest, providing insights into the genetic architecture of the trait and potential biological mechanisms involved.

In summary, the gene coordinate file is a crucial input in partitioned heritability analysis as it defines the boundaries of genes and enables the investigation of the genetic contribution of different genomic regions to complex traits or diseases.

### Use GENCODE v40 IDS annotation file
We will download the GENCODE v40 annotation file to capture the TSS and TES for our gene coordinate file.

Note that our meta-analysis results are in in gencode v40 hg38.
But the LD-scores for LDSC are in build hg19. 
Thus, we will have to use the [gencode GRCh37_mapping](https://ftp.ebi.ac.uk/pub/databases/gencode/Gencode_human/release_40/GRCh37_mapping/) to get the TSS and TES in GRCh37 coordinates.

In [None]:
# download gencode v40 annotation file
wget https://ftp.ebi.ac.uk/pub/databases/gencode/Gencode_human/release_40/GRCh37_mapping/gencode.v40lift37.annotation.gtf.gz

In [10]:
%%bash

gunzip --to-stdout ~/Downloads/gencode.v40lift37.annotation.gtf.gz | head

#description: evidence-based annotation of the human genome, version 30 (Ensembl 96), mapped to GRCh37 with gencode-backmap
#provider: GENCODE
#contact: gencode-help@ebi.ac.uk
#format: gtf
#date: 2019-04-02
chr1	HAVANA	gene	11869	14409	.	+	.	gene_id "ENSG00000223972.5_2"; gene_type "transcribed_unprocessed_pseudogene"; gene_name "DDX11L1"; level 2; havana_gene "OTTHUMG00000000961.2_2"; remap_status "full_contig"; remap_num_mappings 1; remap_target_status "overlap";
chr1	HAVANA	transcript	11869	14409	.	+	.	gene_id "ENSG00000223972.5_2"; transcript_id "ENST00000456328.2_1"; gene_type "transcribed_unprocessed_pseudogene"; gene_name "DDX11L1"; transcript_type "processed_transcript"; transcript_name "DDX11L1-202"; level 2; transcript_support_level 1; tag "basic"; havana_gene "OTTHUMG00000000961.2_2"; havana_transcript "OTTHUMT00000362751.1_1"; remap_num_mappings 1; remap_status "full_contig"; remap_target_status "overlap";
chr1	HAVANA	exon	11869	12227	.	+	.	gene_id "ENSG00000223972.5_2"; tr

<br><br>

get the start and end positions for each gene
https://www.gencodegenes.org/pages/data_format.html

| column-number |              content             |                                      values/format                                     |
|:-------------:|:--------------------------------:|:--------------------------------------------------------------------------------------:|
| 1             | chromosome name                  | chr{1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,X,Y,M} or GRC accession a |
| 2             | annotation source                | {ENSEMBL,HAVANA}                                                                       |
| 3             | feature type                     | {gene,transcript,exon,CDS,UTR,start_codon,stop_codon,Selenocysteine}                   |
| 4             | genomic start location           | integer-value (1-based)                                                                |
| 5             | genomic end location             | integer-value                                                                          |
| 6             | score(not used)                  | .                                                                                      |
| 7             | genomic strand                   | {+,-}                                                                                  |
| 8             | genomic phase (for CDS features) | {0,1,2,.}                                                                              |

as well as gene_id in column 9.

We want 

```css
GENE CHR START END

```
for our coordinate file, so columns: 9, 1, 4, 5

In [6]:
import gzip

annfile = "/Users/jmarks/Downloads/gencode.v40lift37.annotation.gtf.gz"
outfile = "/Users/jmarks/Downloads/gencode.v40lift37.coordinate_file.txt"

with gzip.open(annfile, 'rt') as annfile, open(outfile, 'w') as outfile:
    for _ in range(5):
        next(annfile)
    line = annfile.readline()
    outfile.write("GENE\tCHR\tSTART\tEND\n")

    while line:
        sl = line.split("\t")
        chrom = sl[0]
        #if chrom[:3] == "chr": # some instances of GL000204.1 genes
        if sl[2] == "gene" and chrom not in ("chrX", "chrY", "chrM"): # LDSC only contains LDscores for autosomes
            #chrom = chrom.split("chr")[1] # remove "chr" prefix
            start = sl[3]
            end = sl[4]
            gencode = sl[8].split(";")[0] # remove all additional info after ;
            gencode = gencode.split(" ")[1].strip('"') # remove "gene_id" portion, and double quotes
            gencode = gencode.split(".")[0] # remove suffix <ENSG...>
            outline = f"{gencode}\t{chrom}\t{start}\t{end}\n" # gene, chr, start-, and end-genomic position
            outfile.write(outline)
        line = annfile.readline()

Note, another option could have been to use Bryan Quach's 
GENCODE data munging R functions. See https://github.com/bryancquach/omixjutsu
`subset_gencode_gtf`

## munge sumstats
Summary statisics should be in [Summary Statistics File Format](https://github.com/bulik/ldsc/wiki/Summary-Statistics-File-Format).

For our analysis,some of the sumstats were already munged, so we will just use them when we can.
Others we will have to download an munge. 

Note that we did this for another analysis, so nothing to do here.
See the following notebook for notes on processing: https://github.com/RTIInternational/jaamarks_notebooks/blob/master/heroin/develop/oud_twas_stratified_ldsc/20230426_oud_stratified_ldsc_on_twas_meta.ipynb

# Stratified LDSC
https://github.com/bulik/ldsc/wiki/LD-Score-Estimation-Tutorial#partitioned-ld-scores

## Download munged sumstats

In [None]:
mkdir sumstats && cd sumstats

aws s3 cp s3://rti-shared/ldsc/data/gscan_liu2019/munged/AgeOfInitiation.txt.munged.merged.txt.gz .
aws s3 cp s3://rti-shared/ldsc/data/alcohol_dependence_walters2018_nat_neurosci/munged/pgc_alcdep.eur_discovery.aug2018_release.txt.munged.merged.txt.gz .
aws s3 cp s3://rti-shared/ldsc/data/gscan_liu2019/munged/DrinksPerWeek.txt.munged.merged.txt.gz .
aws s3 cp s3://rti-shared/ldsc/data/alzheimers_disease_lambert2013_nat_genet/munged/alzheimers_disease_lambert2013_nat_genet.sumstats.gz .
aws s3 cp s3://rti-shared/ldsc/data/amyotrophic_lateral_sclerosis_rheenen2016_nat_genet/munged/amyotrophic_lateral_sclerosis_rheenen2016_nat_genet.sumstats.gz .
aws s3 cp s3://rti-shared/ldsc/data/anorexia_watson2019_nat_genet/munged/anorexia_watson2019_workflow_ready.txt.munged.merged.txt.gz .
aws s3 cp s3://rti-shared/ldsc/data/adhd_demontis2018_nat_genet/munged/daner_meta_filtered_NA_iPSYCH23_PGC11_sigPCs_woSEX_2ell6sd_EUR_Neff_70.meta.munged.merged.txt.gz .
aws s3 cp s3://rti-shared/ldsc/data/autism_spectrum_disorder_grove2019_nat_genet/munged/iPSYCH-PGC_ASD_Nov2017.munged.merged.txt.gz .
aws s3 cp s3://rti-shared/ldsc/data/bipolar_disorder_stahl2019_nat_genet/munged/daner_PGC_BIP32b_mds7a_0416a.munged.merged.txt.gz .
aws s3 cp s3://rti-shared/ldsc/data/brain_volume_hibar2015_nature/munged/ENIGMA2_MeanAccumbens_Combined_GenomeControlled_Jan23.tbl.sumstats.gz . # Mean Accumbens Volume (Hibar et al., 2015 Nature 25607358)
#10

aws s3 cp s3://rti-shared/ldsc/data/brain_volume_hibar2015_nature/munged/ENIGMA2_MeanAmygdala_Combined_GenomeControlled_Jan23.tbl.sumstats.gz .
aws s3 cp s3://rti-shared/ldsc/data/brain_volume_hibar2015_nature/munged/ENIGMA2_MeanCaudate_Combined_GenomeControlled_Jan23.tbl.sumstats.gz . # Mean Caudate Volume (Hibar et al., 2015 Nature 25607358)
aws s3 cp s3://rti-shared/ldsc/data/brain_volume_hibar2015_nature/munged/ENIGMA2_MeanHippocampus_Combined_GenomeControlled_Jan23.tbl.sumstats.gz . # Mean Hippocampus Volume (Hibar et al., 2015 Nature 25607358)
aws s3 cp s3://rti-shared/ldsc/data/brain_volume_hibar2015_nature/munged/ENIGMA2_MeanPallidum_Combined_GenomeControlled_Jan23.tbl.sumstats.gz . # Mean Pallidum Volume (Hibar et al., 2015 Nature 25607358)
aws s3 cp s3://rti-shared/ldsc/data/brain_volume_hibar2015_nature/munged/ENIGMA2_MeanPutamen_Combined_GenomeControlled_Jan23.tbl.sumstats.gz . # Mean Putamen Volume (Hibar et al., 2015 Nature 25607358)
aws s3 cp s3://rti-shared/ldsc/data/brain_volume_hibar2015_nature/munged/ENIGMA2_MeanThalamus_Combined_GenomeControlled_Jan23.tbl.sumstats.gz . # Mean Thalamus Volume (Hibar et al., 2015 Nature 25607358)
aws s3 cp s3://rti-shared/ldsc/data/brain_volume_hibar2015_nature/munged/ENIGMA2_ICV_Combined_GenomeControlled_Jan23.tbl.sumstats.gz .
aws s3 cp s3://rti-shared/ldsc/data/cannabis_use_disorder_demontis2019_nat_neurosci/munged/CUD_GWAS_iPSYCH_June2019.munged.merged.txt.gz .
aws s3 cp s3://rti-shared/ldsc/data/childhood_intelligence_benyamin2014_mol_psych/munged/CHIC_Summary_Benyamin2014.sumstats.gz . # Childhood IQ s3://rti-shared/gwas_publicly_available_sumstats/childhood_intelligence_benyamin2014_mol_psych/raw/CHIC_Summary_Benyamin2014.txt.gz
aws s3 cp s3://rti-shared/ldsc/data/gscan_liu2019/munged/CigarettesPerDay.txt.munged.merged.txt.gz .
#20

aws s3 cp s3://rti-shared/ldsc/data/educational_attainment_rietveld2013_science/munged/SSGAC_College_Rietveld2013_publicrelease.sumstats.gz . # College completion
aws s3 cp s3://rti-shared/ldsc/data/cotinine_levels_ware2016_sci_rep/munged/cotinine_ware2016_workflow_ready.txt.munged.merged.txt.gz .
aws s3 cp s3://rti-shared/ldsc/data/depressive_symptoms_okbay2016/munged/DS_Full.txt.munged.merged.txt.gz .
aws s3 cp s3://rti-shared/ldsc/data/nicotine_dependence_quach2020_nat_commun/munged/ftnd_wave3_eur_quach2020_workflow_ready.txt.munged.merged.txt.gz .
aws s3 cp s3://rti-shared/ldsc/data/ukb_hsi/munged/ukb_gwa_003_workflow_ready.txt.munged.merged.txt.gz .
aws s3 cp s3://rti-shared/ldsc/data/intelligence_sniekers2017_nat_genet/munged/intelligence_sniekers2017_nat_genet_sumstats_formatted.sumstats.gz .
aws s3 cp s3://rti-shared/ldsc/data/lifetime_cannabis_use_pasman2018_nat_neurosci/munged/cannabis_icc_ukb_workflow_ready.txt.munged.merged.txt.gz .
aws s3 cp s3://rti-shared/ldsc/data/major_depressive_disorder_howard2018_nat_commun/munged/pgc_ukb_depression_gwas_workflow_ready.txt.munged.merged.txt.gz .
aws s3 cp s3://rti-shared/ldsc/data/personality_demoor_2012_mol_psych/munged/GPC-1.NEO-CONSCIENTIOUSNESS.full.with_header.h19.sumstats.gz . # Neo-conscientiousness (de Moor et al., 2012 Mol Psychiatry 21173776)
aws s3 cp s3://rti-shared/ldsc/data/personality_demoor_2012_mol_psych/munged/GPC-1.NEO-OPENNESS.full.with_header.hg19.sumstats.gz .# Neo-openness to Experience (de Moor et al., 2012 Mol Psychiatry 21173776)
#30

aws s3 cp s3://rti-shared/ldsc/data/neuroticism_okbay2016_nat_genet/munged/neuroticism_okbay2016_nat_genet.sumstats.gz .
aws s3 cp s3://rti-shared/ldsc/data/opioid_addiction_gaddis_mathur2022_sci_rep/munged/cats+coga+decode+kreek+odb+uhs+vidus+yale-penn.ea.chrall.maf_gt_0.01.rsq_gt_0.8.sumstats_formatted.sumstats.gz .
aws s3 cp s3://rti-heroin/rti-midas-data/studies/ngc/GenomicSEM/results/29/gSEM/final/munged/genomicSEM_GWAS.oaALL.MVP1_MVP2_YP_SAGE.PGC.Song.table.sumstats.gz .
aws s3 cp s3://rti-shared/ldsc/data/parkinsons_disease_sanchez2009_nat_genet/munged/parkinsons_disease_sanchez2009_nat_genet.sumstats.gz .
aws s3 cp s3://rti-shared/ldsc/data/ptsd_nievergelt2019_nat_commun/munged/pts_eur_freeze2_overall.results.munged.merged.txt.gz .
aws s3 cp s3://rti-shared/ldsc/data/cross_disorder_gwas_pgc2013_lancet/munged/pgc.cross.full.2013-03.hg19.sumstats.gz . # pgc needs a liftover from hg18 (s3://rti-shared/gwas_publicly_available_sumstats/cross_disorder_gwas_pgc2013_lancet/raw/)
aws s3 cp s3://rti-shared/ldsc/data/schizophrenia_ripke2014_nature/munged/daner_natgen_pgc_eur.munged.merged.txt.gz .
aws s3 cp s3://rti-shared/ldsc/data/gscan_liu2019/munged/SmokingCessation.txt.munged.merged.txt.gz .
aws s3 cp s3://rti-shared/ldsc/data/gscan_liu2019/munged/SmokingInitiation.txt.munged.merged.txt.gz .
aws s3 cp s3://rti-shared/ldsc/data/subjective_wellbeing_okbay2016_nat_genet/munged/SWB_Full.sumstats.gz . # Subjective Well Being (Okbay et al., 2016 Nat Genet 27089181)
#40

aws s3 cp s3://rti-shared/ldsc/data/years_schooling_okbay2022_nat_genet/munged/EA4_additive_excl_23andMe.sumstats.gz . # Years of Education (Okbay et al., 2022 Nature Genetics  35361970)
aws s3 cp s3://rti-shared/gwas_publicly_available_sumstats/insomnia_jansen2019_nat_genet/raw/Insomnia-Jansen_2019.sumstats.gz .
aws s3 cp s3://rti-shared/gwas_publicly_available_sumstats/insomnia_lane2019_nat_genet/raw/Insomnia-Lane_2019.sumstats.gz .
aws s3 cp s3://rti-shared/gwas_publicly_available_sumstats/long_sleep_duration_dashti2019_nat_commun/raw/LongSleepDur-Dashti_2019.sumstats.gz .
aws s3 cp s3://rti-shared/gwas_publicly_available_sumstats/short_sleep_duration_dashti2019_nat_commun/raw/ShortSleepDur-Dashti_2019.sumstats.gz .
aws s3 cp s3://rti-shared/gwas_publicly_available_sumstats/sleep_duration_dashti2019_nat_commun/raw/SleepDuration-Dashti_2019.sumstats.gz .
aws s3 cp s3://rti-shared/gwas_publicly_available_sumstats/sleep_duration_jansen2019_nat_genet/raw/Sleepdur-Jansen_2019.sumstats.gz .
#47

## Download LD-scores
And other necessary files for sLDSC

In [None]:
# make dir for gene coordinate BED file and geneset, logs, and 1000g files
# make dir for logs
mkdir {gene_files,logs,1000g} 

cd 1000g
# download files needed for partitioned heritability analysis
aws s3 cp s3://rti-shared/ldsc/ld_score_reference/1000G/LDSCORE-1000G_Phase3_baseline_ldscores.tar .
aws s3 cp s3://rti-shared/ldsc/ld_score_reference/1000G/LDSCORE-1000G_Phase3_frq.tar .
aws s3 cp s3://rti-shared/ldsc/ld_score_reference/1000G/LDSCORE-1000G_Phase3_plinkfiles.tar .
aws s3 cp s3://rti-shared/ldsc/ld_score_reference/1000G/LDSCORE-weights_hm3_no_hla.tar .
#wget https://storage.googleapis.com/broad-alkesgroup-public/LDSCORE/1000G_phase3_baseline_ldscores.tgz
#wget https://storage.googleapis.com/broad-alkesgroup-public/LDSCORE/1000G_Phase3_plinkfiles.tgz
#wget https://storage.googleapis.com/broad-alkesgroup-public/LDSCORE/1000G_Phase3_frq.tgz
#wget https://storage.googleapis.com/broad-alkesgroup-public/LDSCORE/weights_hm3_no_hla.tgz


# extract files
tar -xvf LDSCORE-1000G_Phase3_baseline_ldscores.tar
tar -xvf LDSCORE-1000G_Phase3_plinkfiles.tar
tar -xvf LDSCORE-1000G_Phase3_frq.tar
tar -xvf LDSCORE-weights_hm3_no_hla.tar

## Start sLDSC

In [None]:
# interactive session
docker run -it -v $PWD:/data/ \
    rtibiocloud/ldsc:v1.0.1_0bb574e bash

In [None]:
date=20230718
window=100000

for fdr in {"0.05","0.10"}; do # loop through each BED file
    coord_file=/data/gene_files/gencode.v40lift37.coordinate_file.txt 
    geneset_file=/data/gene_files/hic_NAc_Seney_published_meta_analysis_results_20230427_fdr${fdr}_genes.txt

    # store processing files for each meta in separate dir
    mkdir -p /data/{annotations_ldscores,results}/fdr$fdr/

    for j in {1..22}; do # loop through each chromosome
        # create annotation files
        python /opt/ldsc/make_annot.py \
            --gene-set-file $geneset_file \
            --gene-coord-file $coord_file \
            --windowsize $window \
            --bimfile /data/1000g/1000G_EUR_Phase3_plink/1000G.EUR.QC.$j.bim \
            --annot-file /data/annotations_ldscores/fdr$fdr/meta_fdr${fdr}genes_window${window}_chr$j.annot.gz | \
        tee -a /data/logs/sldsc_${date}.log

        # compute LD scores
        python /opt/ldsc/ldsc.py \
            --l2 \
            --thin-annot \
            --ld-wind-cm 1 \
            --print-snps /data/1000g/1000G_EUR_Phase3_baseline/print_snps.txt \
            --bfile /data/1000g/1000G_EUR_Phase3_plink/1000G.EUR.QC.$j \
            --annot /data/annotations_ldscores/fdr$fdr/meta_fdr${fdr}genes_window${window}_chr$j.annot.gz \
            --out /data/annotations_ldscores/fdr$fdr/meta_fdr${fdr}genes_window${window}_chr$j | \
        tee -a /data/logs/sldsc_${date}.log
    done # end chr loop

    for trait in {"insomnia_jansen","insomnia_lane","long_sleep_duration_dashti","short_sleep_duration_dashti","sleep_duration_dashti","sleep_duration_jansen","adhd","age_of_initiation","alcohol_dependence","alzheimers_disease","amyotrophic_lateral_sclerosis","anorexia","autism","bipolar","brain_volume_mean_accumbens","brain_volume_mean_amygdala","brain_volume_mean_caudate","brain_volume_mean_hippocampus","brain_volume_mean_pallidum","brain_volume_mean_putamen","brain_volume_mean_thalamus","brain_volume_total_intracranial","cannabis_use_disorder","childhood_intelligence","cigarettes_per_day","college_completion","cotinine_levels","cross_disorder","depressive_symptoms","drinks_per_week","ftnd","heaviness_smoking_index","intelligence","lifetime_cannabis_use","major_depressive_disorder","neo_conscientiousness","neo_openness","neuroticism","opioid_addiction_144","opioid_addiction_gsem","parkinsons","ptsd","schizophrenia","smoking_cessation","smoking_initiation","subjective_wellbeing","years_of_education"}; do # loop through all traits
        case $trait in  # use sumstats files that corresponds to the trait name for the h2 estimate
            "insomnia_jansen") stats=/data/sumstats/Insomnia-Jansen_2019.sumstats.gz ;;
            "insomnia_lane") stats=/data/sumstats/Insomnia-Lane_2019.sumstats.gz ;;
            "long_sleep_duration_dashti") stats=/data/sumstats/LongSleepDur-Dashti_2019.sumstats.gz ;;
            "short_sleep_duration_dashti") stats=/data/sumstats/ShortSleepDur-Dashti_2019.sumstats.gz ;;
            "sleep_duration_dashti") stats=/data/sumstats/SleepDuration-Dashti_2019.sumstats.gz ;;
            "sleep_duration_jansen") stats=/data/sumstats/Sleepdur-Jansen_2019.sumstats.gz ;;
            "adhd") stats=/data/sumstats/daner_meta_filtered_NA_iPSYCH23_PGC11_sigPCs_woSEX_2ell6sd_EUR_Neff_70.meta.munged.merged.txt.gz ;;
            "age_of_initiation") stats=/data/sumstats/AgeOfInitiation.txt.munged.merged.txt.gz ;;
            "alcohol_dependence") stats=/data/sumstats/pgc_alcdep.eur_discovery.aug2018_release.txt.munged.merged.txt.gz ;;
            "alzheimers_disease") stats=/data/sumstats/alzheimers_disease_lambert2013_nat_genet.sumstats.gz ;;
            "amyotrophic_lateral_sclerosis") stats=/data/sumstats/amyotrophic_lateral_sclerosis_rheenen2016_nat_genet.sumstats.gz ;;
            "anorexia") stats=/data/sumstats/anorexia_watson2019_workflow_ready.txt.munged.merged.txt.gz ;;
            "autism") stats=/data/sumstats/iPSYCH-PGC_ASD_Nov2017.munged.merged.txt.gz ;;
            "bipolar") stats=/data/sumstats/daner_PGC_BIP32b_mds7a_0416a.munged.merged.txt.gz ;;
            "brain_volume_mean_accumbens") stats=/data/sumstats/ENIGMA2_MeanAccumbens_Combined_GenomeControlled_Jan23.tbl.sumstats.gz ;;
            "brain_volume_mean_amygdala") stats=/data/sumstats/ENIGMA2_MeanAmygdala_Combined_GenomeControlled_Jan23.tbl.sumstats.gz ;;
            "brain_volume_mean_caudate") stats=/data/sumstats/ENIGMA2_MeanCaudate_Combined_GenomeControlled_Jan23.tbl.sumstats.gz ;;
            "brain_volume_mean_hippocampus") stats=/data/sumstats/ENIGMA2_MeanHippocampus_Combined_GenomeControlled_Jan23.tbl.sumstats.gz ;;
            "brain_volume_mean_pallidum") stats=/data/sumstats/ENIGMA2_MeanPallidum_Combined_GenomeControlled_Jan23.tbl.sumstats.gz ;;
            "brain_volume_mean_putamen") stats=/data/sumstats/ENIGMA2_MeanPutamen_Combined_GenomeControlled_Jan23.tbl.sumstats.gz ;;
            "brain_volume_mean_thalamus") stats=/data/sumstats/ENIGMA2_MeanThalamus_Combined_GenomeControlled_Jan23.tbl.sumstats.gz ;;
            "brain_volume_total_intracranial") stats=/data/sumstats/ENIGMA2_ICV_Combined_GenomeControlled_Jan23.tbl.sumstats.gz ;;
            "cannabis_use_disorder") stats=/data/sumstats/CUD_GWAS_iPSYCH_June2019.munged.merged.txt.gz ;;
            "childhood_intelligence") stats=/data/sumstats/CHIC_Summary_Benyamin2014.sumstats.gz ;;
            "cigarettes_per_day") stats=/data/sumstats/CigarettesPerDay.txt.munged.merged.txt.gz ;;
            "college_completion") stats=/data/sumstats/SSGAC_College_Rietveld2013_publicrelease.sumstats.gz ;;
            "cotinine_levels") stats=/data/sumstats/cotinine_ware2016_workflow_ready.txt.munged.merged.txt.gz ;;
            "cross_disorder") stats=/data/sumstats/pgc.cross.full.2013-03.hg19.sumstats.gz ;;
            "depressive_symptoms") stats=/data/sumstats/DS_Full.txt.munged.merged.txt.gz ;;
            "drinks_per_week") stats=/data/sumstats/DrinksPerWeek.txt.munged.merged.txt.gz ;;
            "ftnd") stats=/data/sumstats/ftnd_wave3_eur_quach2020_workflow_ready.txt.munged.merged.txt.gz ;;
            "heaviness_smoking_index") stats=/data/sumstats/ukb_gwa_003_workflow_ready.txt.munged.merged.txt.gz ;;
            "intelligence") stats=/data/sumstats/intelligence_sniekers2017_nat_genet_sumstats_formatted.sumstats.gz ;;
            "lifetime_cannabis_use") stats=/data/sumstats/cannabis_icc_ukb_workflow_ready.txt.munged.merged.txt.gz ;;
            "major_depressive_disorder") stats=/data/sumstats/pgc_ukb_depression_gwas_workflow_ready.txt.munged.merged.txt.gz ;;
            "neo_conscientiousness") stats=/data/sumstats/GPC-1.NEO-CONSCIENTIOUSNESS.full.with_header.h19.sumstats.gz ;;
            "neo_openness") stats=/data/sumstats/GPC-1.NEO-OPENNESS.full.with_header.hg19.sumstats.gz ;;
            "neuroticism") stats=/data/sumstats/neuroticism_okbay2016_nat_genet.sumstats.gz ;;
            "opioid_addiction_144") stats=/data/sumstats/cats+coga+decode+kreek+odb+uhs+vidus+yale-penn.ea.chrall.maf_gt_0.01.rsq_gt_0.8.sumstats_formatted.sumstats.gz ;;
            "opioid_addiction_gsem") stats=/data/sumstats/genomicSEM_GWAS.oaALL.MVP1_MVP2_YP_SAGE.PGC.Song.table.sumstats.gz ;;
            "parkinsons") stats=/data/sumstats/parkinsons_disease_sanchez2009_nat_genet.sumstats.gz ;;
            "ptsd") stats=/data/sumstats/pts_eur_freeze2_overall.results.munged.merged.txt.gz ;;
            "schizophrenia") stats=/data/sumstats/daner_natgen_pgc_eur.munged.merged.txt.gz ;;
            "smoking_cessation") stats=/data/sumstats/SmokingCessation.txt.munged.merged.txt.gz ;;
            "smoking_initiation") stats=/data/sumstats/SmokingInitiation.txt.munged.merged.txt.gz ;;
            "subjective_wellbeing") stats=/data/sumstats/SWB_Full.sumstats.gz ;;
            "years_of_education") stats=/data/sumstats/EA4_additive_excl_23andMe.sumstats.gz ;;
        esac

        # computed partitioned heritability estimate
        python /opt/ldsc/ldsc.py \
            --h2 $stats \
            --overlap-annot \
            --print-coefficients \
            --w-ld-chr "/data/1000g/weights_hm3_no_hla/weights." \
            --frqfile-chr "/data/1000g/1000G_Phase3_frq/1000G.EUR.QC." \
            --ref-ld-chr "/data/annotations_ldscores/fdr$fdr/meta_fdr${fdr}genes_window${window}_chr,/data/1000g/1000G_EUR_Phase3_baseline/baseline." \
            --out "/data/results/fdr$fdr/${trait}_with_meta_analysis_deg_genes_fdr${fdr}_window${window}" | \
          tee -a /data/logs/sldsc_${date}.log
    done
done

In [None]:
cd results/

for fdr in {"0.05","0.10"}; do
    outfile=${date}_all_phenotypes_with_meta_analysis_deg_fdr${fdr}_window${window}_final_results.tsv
    touch $outfile
    head -1 fdr${fdr}/smoking_initiation_with_meta_analysis_deg_genes_fdr${fdr}_window${window}.results > $outfile
        
    for file in fdr${fdr}/*_fdr${fdr}_window${window}.results; do
        trait=$(echo $file | sed "s/_with_oa_twas_meta_analysis_deg_genes_fdr.*//") # remove suffix
        trait=$(echo $trait | sed "s/fdr$fdr\///") # remove directory prefix
        #echo $trait
        awk -v trait=$trait \
        '$1 = trait {print $0}' OFS="\t" <(tail -n +2 $file | head -1) >> $outfile
    done
done

## fdr0.05_window100000_final_results

| Category                                                                                  | Prop._SNPs       | Prop._h2         | Prop._h2_std_error | Enrichment     | Enrichment_std_error | Enrichment_p          | Coefficient        | Coefficient_std_error | Coefficient_z-score |
|-------------------------------------------------------------------------------------------|------------------|------------------|--------------------|----------------|----------------------|-----------------------|--------------------|-----------------------|---------------------|
| adhd_with_meta_analysis_deg_genes_fdr0.05_window100000.results                            | 0.00870166355234 | 0.0255842592481  | 0.00657035062018   | 2.94015725778  | 0.755068336147       | 0.010888320497495454  | 1.58162412662e-07  | 6.24872734529e-08     | 2.53111400006       |
| age_of_initiation_with_meta_analysis_deg_genes_fdr0.05_window100000.results               | 0.00870166355234 | 0.00852239296764 | 0.00573337652427   | 0.979398125011 | 0.658882808993       | 0.97504144727272768   | -1.69118995272e-09 | 4.42773376733e-09     | -0.381953848535     |
| alcohol_dependence_with_meta_analysis_deg_genes_fdr0.05_window100000.results              | 0.00870166355234 | -0.0113700059354 | 0.0129930101884    | -1.3066473861  | 1.49316393471        | 0.10549005764853481   | -3.40616927178e-08 | 2.32045860688e-08     | -1.46788624528      |
| alzheimers_disease_with_meta_analysis_deg_genes_fdr0.05_window100000.results              | 0.00870166355234 | 0.0186995406635  | 0.0152086748395    | 2.14896158085  | 1.74778934488        | 0.51104650620035352   | -1.11801656058e-09 | 1.99593257386e-08     | -0.0560147459502    |
| amyotrophic_lateral_sclerosis_with_meta_analysis_deg_genes_fdr0.05_window100000.results   | 0.00870166355234 | 0.0299201123831  | 0.023644626653     | 3.43843590402  | 2.71725360453        | 0.35574426658909342   | 8.74784811041e-09  | 2.11194911505e-08     | 0.414207333315      |
| anorexia_with_meta_analysis_deg_genes_fdr0.05_window100000.results                        | 0.00870166355234 | 0.0101780761051  | 0.00450954485894   | 1.16967014915  | 0.518239395469       | 0.74379771697718977   | 4.67766272693e-09  | 1.93243217994e-08     | 0.242060900014      |
| autism_with_meta_analysis_deg_genes_fdr0.05_window100000.results                          | 0.00870166355234 | 0.0125350176942  | 0.00686898242557   | 1.44053118336  | 0.789387268797       | 0.57691947688725809   | 1.16727349486e-08  | 2.54015554614e-08     | 0.459528353149      |
| bipolar_with_meta_analysis_deg_genes_fdr0.05_window100000.results                         | 0.00870166355234 | 0.0159776877082  | 0.00478182037281   | 1.83616473012  | 0.549529448484       | 0.12769024041357641   | 6.17345393874e-08  | 6.22356140949e-08     | 0.991948746472      |
| brain_volume_mean_accumbens_with_meta_analysis_deg_genes_fdr0.05_window100000.results     | 0.00870166355234 | 0.0267181245146  | 0.0316329927592    | 3.07046168286  | 3.63528106654        | 0.55843363391535727   | 3.36531395958e-08  | 6.5659671409e-08      | 0.512538958444      |
| brain_volume_mean_amygdala_with_meta_analysis_deg_genes_fdr0.05_window100000.results      | 0.00870166355234 | -12.4289582327   | 480.050450446      | -1428.34277124 | 55167.6639253        | 0.39104708595636217   | 7.02958403975e-08  | 7.40856474903e-08     | 0.948845596668      |
| brain_volume_mean_caudate_with_meta_analysis_deg_genes_fdr0.05_window100000.results       | 0.00870166355234 | 0.030681178022   | 0.0128432647291    | 3.52589798921  | 1.47595510351        | 0.067885192929666535  | 5.25309930191e-08  | 4.75170454492e-08     | 1.10551892532       |
| brain_volume_mean_hippocampus_with_meta_analysis_deg_genes_fdr0.05_window100000.results   | 0.00870166355234 | 0.0172497389492  | 0.0244419908054    | 1.98234956402  | 2.80888713501        | 0.72405545896062196   | 2.01356344781e-08  | 6.82737039435e-08     | 0.294925180781      |
| brain_volume_mean_pallidum_with_meta_analysis_deg_genes_fdr0.05_window100000.results      | 0.00870166355234 | 0.0618915110152  | 0.0286407132045    | 7.1126067611   | 3.29140664107        | 0.04513575183616262   | 1.37412145539e-07  | 7.34355250045e-08     | 1.87119443254       |
| brain_volume_mean_putamen_with_meta_analysis_deg_genes_fdr0.05_window100000.results       | 0.00870166355234 | 0.0331245894108  | 0.0112278867955    | 3.80669618074  | 1.29031497575        | 0.022836830909067775  | 1.37986525472e-07  | 7.07127128666e-08     | 1.95136800553       |
| brain_volume_mean_thalamus_with_meta_analysis_deg_genes_fdr0.05_window100000.results      | 0.00870166355234 | 0.0353614401703  | 0.0209399508546    | 4.06375631023  | 2.4064307622         | 0.20150793472635628   | 9.01729858465e-08  | 6.84154155757e-08     | 1.31802145887       |
| brain_volume_total_intracranial_with_meta_analysis_deg_genes_fdr0.05_window100000.results | 0.00870166355234 | 0.0224669869814  | 0.0197274175583    | 2.58191860053  | 2.2670857635         | 0.46032037386542457   | 1.04647501501e-08  | 6.69703352281e-08     | 0.156259485852      |
| cannabis_use_disorder_with_meta_analysis_deg_genes_fdr0.05_window100000.results           | 0.00870166355234 | 0.0358067828576  | 0.0334324804442    | 4.11493533876  | 3.84207918901        | 0.39372587821911198   | 6.39060267738e-08  | 8.77059722888e-08     | 0.728639397136      |
| childhood_intelligence_with_meta_analysis_deg_genes_fdr0.05_window100000.results          | 0.00870166355234 | 0.0269144089793  | 0.0173519437005    | 3.09301880238  | 1.99409499071        | 0.29002641992119332   | 5.26512368144e-08  | 5.47292073895e-08     | 0.962031780208      |
| cigarettes_per_day_with_meta_analysis_deg_genes_fdr0.05_window100000.results              | 0.00870166355234 | 0.00882226075015 | 0.00267171493721   | 1.01385909684  | 0.307034961894       | 0.96391921548052362   | -4.13177312802e-09 | 3.59131423511e-09     | -1.15049056071      |
| college_completion_with_meta_analysis_deg_genes_fdr0.05_window100000.results              | 0.00870166355234 | 0.0157432250945  | 0.00879440272481   | 1.80922015656  | 1.01065763712        | 0.42424939702119346   | 9.12932957909e-09  | 1.51344302921e-08     | 0.603215938946      |
| cotinine_levels_with_meta_analysis_deg_genes_fdr0.05_window100000.results                 | 0.00870166355234 | -0.0267032307046 | 0.356074748577     | -3.06875007796 | 40.9203075291        | 0.69927949961895353   | -8.64478505763e-08 | 1.61641777396e-07     | -0.534811309113     |
| cross_disorder_with_meta_analysis_deg_genes_fdr0.05_window100000.results                  | 0.00870166355234 | 0.0197167738156  | 0.00482340161274   | 2.26586257869  | 0.554307987631       | 0.02140110266598633   | 2.50744761433e-08  | 1.41426584899e-08     | 1.77296766101       |
| depressive_symptoms_with_meta_analysis_deg_genes_fdr0.05_window100000.results             | 0.00870166355234 | 0.00385336846433 | 0.00522366848795   | 0.442831240389 | 0.600306878855       | 0.35780241802341906   | -3.52406640977e-09 | 4.23176860374e-09     | -0.832764439591     |
| drinks_per_week_with_meta_analysis_deg_genes_fdr0.05_window100000.results                 | 0.00870166355234 | 0.0173733080875  | 0.00545523036202   | 1.9965501979   | 0.62691809781        | 0.11298475028019651   | 5.21972816647e-09  | 4.58279546495e-09     | 1.13898344502       |
| ftnd_with_meta_analysis_deg_genes_fdr0.05_window100000.results                            | 0.00870166355234 | 0.00244859955802 | 0.0100871388038    | 0.281394418814 | 1.15921958406        | 0.54022940840461409   | -1.30495111341e-08 | 1.45968630697e-08     | -0.893994214498     |
| heaviness_smoking_index_with_meta_analysis_deg_genes_fdr0.05_window100000.results         | 0.00870166355234 | 0.0313835268977  | 0.0223231049784    | 3.60661231142  | 2.56538360097        | 0.2605824506068018    | 2.29322410408e-08  | 2.72719250825e-08     | 0.840873571317      |
| insomnia_jansen_with_meta_analysis_deg_genes_fdr0.05_window100000.results                 | 0.00870166355234 | 0.0126445868593  | 0.00559884651045   | 1.45312293255  | 0.643422545215       | 0.48102074330057976   | 1.50538939024e-09  | 4.45788985373e-09     | 0.337691024146      |
| insomnia_lane_with_meta_analysis_deg_genes_fdr0.05_window100000.results                   | 0.00870166355234 | 0.0113274708255  | 0.0036983994818    | 1.30175922769  | 0.425022118995       | 0.47872315194756998   | 9.94958164423e-10  | 3.88225611801e-09     | 0.256283494488      |
| intelligence_with_meta_analysis_deg_genes_fdr0.05_window100000.results                    | 0.00870166355234 | 0.0112892274711  | 0.00702356691955   | 1.2973642802   | 0.807152204553       | 0.71302031779648145   | 3.78349530597e-09  | 2.31080260849e-08     | 0.163730787393      |
| lifetime_cannabis_use_with_meta_analysis_deg_genes_fdr0.05_window100000.results           | 0.00870166355234 | 0.00410613525034 | 0.00420221000728   | 0.471879339582 | 0.482920303917       | 0.2716715467211599    | -6.4355988417e-09  | 4.91153543718e-09     | -1.31030284196      |
| long_sleep_duration_dashti_with_meta_analysis_deg_genes_fdr0.05_window100000.results      | 0.00870166355234 | 0.01609664238    | 0.00455575901374   | 1.84983506696  | 0.523550351761       | 0.096333946836897319  | 1.89714493871e-09  | 1.76466390009e-09     | 1.07507437457       |
| major_depressive_disorder_with_meta_analysis_deg_genes_fdr0.05_window100000.results       | 0.00870166355234 | 0.0122596562669  | 0.00350094434651   | 1.40888649546  | 0.402330465371       | 0.31006190385531446   | 4.03657690862e-09  | 4.01296434185e-09     | 1.00588407092       |
| neo_conscientiousness_with_meta_analysis_deg_genes_fdr0.05_window100000.results           | 0.00870166355234 | 0.0705666311747  | 0.0378682327354    | 8.10955637968  | 4.35183830168        | 0.014310877788372605  | 1.03267772153e-07  | 4.13380737103e-08     | 2.49812734082       |
| neo_openness_with_meta_analysis_deg_genes_fdr0.05_window100000.results                    | 0.00870166355234 | 0.00241864634677 | 0.0223771975123    | 0.27795217917  | 2.57159994497        | 0.77992005390694774   | -8.42759725349e-09 | 4.2579406784e-08      | -0.197926600909     |
| neuroticism_with_meta_analysis_deg_genes_fdr0.05_window100000.results                     | 0.00870166355234 | 0.00465984994444 | 0.00286150349493   | 0.535512539231 | 0.328845568174       | 0.14653116793940507   | -6.95472797336e-09 | 4.30821564615e-09     | -1.61429430293      |
| opioid_addiction_144_with_meta_analysis_deg_genes_fdr0.05_window100000.results            | 0.00870166355234 | 0.0105155912136  | 0.011137231661     | 1.20845757255  | 1.27989683742        | 0.87124902622824241   | 5.37327683463e-11  | 2.60328988903e-09     | 0.020640332286      |
| opioid_addiction_gsem_with_meta_analysis_deg_genes_fdr0.05_window100000.results           | 0.00870166355234 | 0.0193503254118  | 0.0075047450931    | 2.22375012495  | 0.862449467043       | 0.14913595563902779   | 1.28774046578e-08  | 1.12164744292e-08     | 1.14807952705       |
| parkinsons_with_meta_analysis_deg_genes_fdr0.05_window100000.results                      | 0.00870166355234 | 0.00709567742566 | 0.0216633006012    | 0.815439183897 | 2.48955851613        | 0.9407982041464189    | -8.18387504095e-08 | 1.91987792714e-07     | -0.426270593836     |
| ptsd_with_meta_analysis_deg_genes_fdr0.05_window100000.results                            | 0.00870166355234 | 0.00318593490283 | 0.0135908916632    | 0.366129405448 | 1.56187280529        | 0.67962597620851795   | -4.21241955683e-09 | 1.00373220291e-08     | -0.419675641032     |
| schizophrenia_with_meta_analysis_deg_genes_fdr0.05_window100000.results                   | 0.00870166355234 | 0.00927002423178 | 0.00215060532504   | 1.0653163244   | 0.247148756339       | 0.79184147178278197   | -2.62662874262e-08 | 3.67305877862e-08     | -0.715106645694     |
| short_sleep_duration_dashti_with_meta_analysis_deg_genes_fdr0.05_window100000.results     | 0.00870166355234 | 0.0147399460566  | 0.00485126584814   | 1.69392277326  | 0.557510161012       | 0.21500012220386022   | 3.19179037301e-09  | 3.88403412989e-09     | 0.821771968594      |
| sleep_duration_dashti_with_meta_analysis_deg_genes_fdr0.05_window100000.results           | 0.00870166355234 | 0.0175284667724  | 0.00452470849018   | 2.01438112     | 0.519982008379       | 0.051951234785041059  | 7.77581504945e-09  | 5.42096313407e-09     | 1.43439733072       |
| sleep_duration_jansen_with_meta_analysis_deg_genes_fdr0.05_window100000.results           | 0.00870166355234 | 0.0161025017011  | 0.0042959261384    | 1.85050842338  | 0.493690213665       | 0.088935493980230262  | 6.29221011282e-09  | 5.18271670187e-09     | 1.2140756431        |
| smoking_cessation_with_meta_analysis_deg_genes_fdr0.05_window100000.results               | 0.00870166355234 | 0.0250847443565  | 0.00549737442477   | 2.88275272948  | 0.631761316868       | 0.0026814229978564964 | 8.22986600201e-09  | 3.30570220558e-09     | 2.4895969117        |
| smoking_initiation_with_meta_analysis_deg_genes_fdr0.05_window100000.results              | 0.00870166355234 | 0.0123411035138  | 0.00385444756888   | 1.41824645823  | 0.442955251683       | 0.34520790564350401   | 3.30344574513e-09  | 4.57539921232e-09     | 0.722001642225      |
| subjective_wellbeing_with_meta_analysis_deg_genes_fdr0.05_window100000.results            | 0.00870166355234 | 0.00552113099052 | 0.00413708640266   | 0.634491434576 | 0.475436263166       | 0.4458039789443945    | -2.03451727232e-09 | 1.91619673433e-09     | -1.06174759401      |
| years_of_education_with_meta_analysis_deg_genes_fdr0.05_window100000.results              | 0.00870166355234 | 0.0134958598266  | 0.00189801692418   | 1.55095169394  | 0.218121157266       | 0.011974623946558864  | 8.43490564125e-09  | 4.59925395533e-09     | 1.83397257972       |


## fdr0.10_window100000_final_results

| Category                                                                                  | Prop._SNPs      | Prop._h2           | Prop._h2_std_error | Enrichment       | Enrichment_std_error | Enrichment_p          | Coefficient        | Coefficient_std_error | Coefficient_z-score |
|-------------------------------------------------------------------------------------------|-----------------|--------------------|--------------------|------------------|----------------------|-----------------------|--------------------|-----------------------|---------------------|
| adhd_with_meta_analysis_deg_genes_fdr0.10_window100000.results                            | 0.0235019733579 | 0.0318551174922    | 0.00766241339458   | 1.35542309606    | 0.326032766607       | 0.27762915094615515   | 2.91765508929e-08  | 2.74711937659e-08     | 1.06207801312       |
| age_of_initiation_with_meta_analysis_deg_genes_fdr0.10_window100000.results               | 0.0235019733579 | 0.025052529551     | 0.00928418473288   | 1.06597557446    | 0.39503851832        | 0.86729497427276281   | -6.10123021093e-10 | 2.63015730117e-09     | -0.231972065253     |
| alcohol_dependence_with_meta_analysis_deg_genes_fdr0.10_window100000.results              | 0.0235019733579 | -0.000497586197079 | 0.0209129735571    | -0.0211721028486 | 0.889839046221       | 0.24270659086880342   | -1.4499484047e-08  | 1.47077641792e-08     | -0.985838763141     |
| alzheimers_disease_with_meta_analysis_deg_genes_fdr0.10_window100000.results              | 0.0235019733579 | 0.0404742623042    | 0.0205822188447    | 1.72216441947    | 0.875765559399       | 0.40951306648821562   | -5.5368507959e-09  | 9.77709076447e-09     | -0.566308621786     |
| amyotrophic_lateral_sclerosis_with_meta_analysis_deg_genes_fdr0.10_window100000.results   | 0.0235019733579 | 0.0295629598465    | 0.0356802333351    | 1.25789266273    | 1.51818031583        | 0.86481047528682342   | -6.6861336738e-09  | 1.25922442845e-08     | -0.530972360663     |
| anorexia_with_meta_analysis_deg_genes_fdr0.10_window100000.results                        | 0.0235019733579 | 0.0303146775314    | 0.00754723399557   | 1.28987796343    | 0.321131927122       | 0.36952132526941572   | 1.05993303891e-08  | 1.23437842885e-08     | 0.858677545014      |
| autism_with_meta_analysis_deg_genes_fdr0.10_window100000.results                          | 0.0235019733579 | 0.0396037489417    | 0.00938373852413   | 1.68512440801    | 0.399274494156       | 0.088413684373080184  | 2.12268975135e-08  | 1.30171681637e-08     | 1.63068474238       |
| bipolar_with_meta_analysis_deg_genes_fdr0.10_window100000.results                         | 0.0235019733579 | 0.03155003084      | 0.00619308507269   | 1.3424417754     | 0.263513407083       | 0.19444137571484557   | 9.00450641274e-09  | 3.01355287189e-08     | 0.298800346154      |
| brain_volume_mean_accumbens_with_meta_analysis_deg_genes_fdr0.10_window100000.results     | 0.0235019733579 | 0.037305353313     | 0.0490361527884    | 1.58732855088    | 2.08646959307        | 0.77440176167903862   | 5.64124701666e-09  | 3.8995301822e-08      | 0.144664786605      |
| brain_volume_mean_amygdala_with_meta_analysis_deg_genes_fdr0.10_window100000.results      | 0.0235019733579 | 0.107754834966     | 52.9731284671      | 4.58492711762    | 2253.98640618        | 0.99487935825594043   | 5.42886640797e-09  | 3.62610810718e-08     | 0.149716066028      |
| brain_volume_mean_caudate_with_meta_analysis_deg_genes_fdr0.10_window100000.results       | 0.0235019733579 | 0.0969001339914    | 0.0287363470951    | 4.12306373239    | 1.22272060552        | 0.0071337912452482952 | 7.91187696614e-08  | 3.84124737807e-08     | 2.05971555264       |
| brain_volume_mean_hippocampus_with_meta_analysis_deg_genes_fdr0.10_window100000.results   | 0.0235019733579 | 0.0103406959784    | 0.0366169092383    | 0.439992668739   | 1.55803551816        | 0.71578586749287587   | -1.9337864561e-08  | 3.80180144465e-08     | -0.50865003979      |
| brain_volume_mean_pallidum_with_meta_analysis_deg_genes_fdr0.10_window100000.results      | 0.0235019733579 | 0.104600197932     | 0.0399634255256    | 4.45069851535    | 1.70042850943        | 0.029592036154114224  | 7.39479770128e-08  | 3.85590468295e-08     | 1.9177853991        |
| brain_volume_mean_putamen_with_meta_analysis_deg_genes_fdr0.10_window100000.results       | 0.0235019733579 | 0.0488804678261    | 0.0158251651787    | 2.07984525732    | 0.673354740803       | 0.086752669029321189  | 4.1457730765e-08   | 3.6843406482e-08      | 1.12524152144       |
| brain_volume_mean_thalamus_with_meta_analysis_deg_genes_fdr0.10_window100000.results      | 0.0235019733579 | 0.0589704924027    | 0.0292140673601    | 2.50917195355    | 1.24304742054        | 0.22348482441572115   | 3.89606318979e-08  | 3.60782163249e-08     | 1.07989351655       |
| brain_volume_total_intracranial_with_meta_analysis_deg_genes_fdr0.10_window100000.results | 0.0235019733579 | 0.0302331307405    | 0.0288105513058    | 1.28640817859    | 1.22587796638        | 0.80735661098468159   | -2.54991866379e-08 | 3.80128976819e-08     | -0.670803548082     |
| cannabis_use_disorder_with_meta_analysis_deg_genes_fdr0.10_window100000.results           | 0.0235019733579 | 0.096072976466     | 0.0577597829014    | 4.08786849526    | 2.45765672618        | 0.14075985139800018   | 7.04545220168e-08  | 5.1493505088e-08      | 1.36822152418       |
| childhood_intelligence_with_meta_analysis_deg_genes_fdr0.10_window100000.results          | 0.0235019733579 | 0.0599643377625    | 0.0235432483297    | 2.55145969445    | 1.00175623431        | 0.12126885055377402   | 4.14792607518e-08  | 2.85241119766e-08     | 1.45418236984       |
| cigarettes_per_day_with_meta_analysis_deg_genes_fdr0.10_window100000.results              | 0.0235019733579 | 0.0289905494269    | 0.00582235917887   | 1.23353681776    | 0.247739161738       | 0.3351657751266427    | -5.52960715934e-10 | 2.90607815312e-09     | -0.190277303912     |
| college_completion_with_meta_analysis_deg_genes_fdr0.10_window100000.results              | 0.0235019733579 | 0.0298925716507    | 0.0110595938822    | 1.27191751924    | 0.470581500277       | 0.56495301696106903   | 2.54080724985e-09  | 7.25335751201e-09     | 0.350293949477      |
| cotinine_levels_with_meta_analysis_deg_genes_fdr0.10_window100000.results                 | 0.0235019733579 | -0.0593686374002   | 0.833660063733     | -2.5261128713    | 35.4719176572        | 0.4780019487839352    | -6.39592471541e-08 | 7.75374647499e-08     | -0.824881847252     |
| cross_disorder_with_meta_analysis_deg_genes_fdr0.10_window100000.results                  | 0.0235019733579 | 0.0354514890529    | 0.00834091287368   | 1.50844733389    | 0.354902660584       | 0.1515223867377706    | 7.48572202e-09     | 9.16680031746e-09     | 0.816612314086      |
| depressive_symptoms_with_meta_analysis_deg_genes_fdr0.10_window100000.results             | 0.0235019733579 | 0.0027896383329    | 0.00851415203342   | 0.118698046774   | 0.362273920738       | 0.016739901577230848  | -5.92908018752e-09 | 2.63488861253e-09     | -2.25022043031      |
| drinks_per_week_with_meta_analysis_deg_genes_fdr0.10_window100000.results                 | 0.0235019733579 | 0.0398396450619    | 0.00690985656583   | 1.69516169793    | 0.294011760656       | 0.019571434476542655  | 3.60646788007e-09  | 2.14914928328e-09     | 1.67809091165       |
| ftnd_with_meta_analysis_deg_genes_fdr0.10_window100000.results                            | 0.0235019733579 | 0.0259541107133    | 0.0169714940567    | 1.10433750894    | 0.722130597219       | 0.88410559523537158   | -1.56565426635e-09 | 9.02702823143e-09     | -0.17344071894      |
| heaviness_smoking_index_with_meta_analysis_deg_genes_fdr0.10_window100000.results         | 0.0235019733579 | 0.0404016637869    | 0.0315164776434    | 1.71907538025    | 1.34101409969        | 0.56941903102791502   | 3.78576120838e-09  | 1.52804033628e-08     | 0.247752701189      |
| insomnia_jansen_with_meta_analysis_deg_genes_fdr0.10_window100000.results                 | 0.0235019733579 | 0.0384662127229    | 0.00822291112551   | 1.63672267589    | 0.349881731219       | 0.070676849027460339  | 3.28417269495e-09  | 2.4745865651e-09      | 1.32716015728       |
| insomnia_lane_with_meta_analysis_deg_genes_fdr0.10_window100000.results                   | 0.0235019733579 | 0.0274587273551    | 0.00476314686948   | 1.16835837302    | 0.20267008208        | 0.40643426368333679   | 3.17315691577e-10  | 1.90321654951e-09     | 0.166726004804      |
| intelligence_with_meta_analysis_deg_genes_fdr0.10_window100000.results                    | 0.0235019733579 | 0.026747271713     | 0.00915518274402   | 1.13808620688    | 0.38954953291        | 0.72304113814713367   | 6.30839507957e-10  | 1.1244744926e-08      | 0.0561008286187     |
| lifetime_cannabis_use_with_meta_analysis_deg_genes_fdr0.10_window100000.results           | 0.0235019733579 | 0.0151580906649    | 0.00801635420692   | 0.64497097474    | 0.341092813138       | 0.29368230946567037   | -4.10428942975e-09 | 3.51610005746e-09     | -1.16728459449      |
| long_sleep_duration_dashti_with_meta_analysis_deg_genes_fdr0.10_window100000.results      | 0.0235019733579 | 0.0410998577354    | 0.0104065097907    | 1.74878326639    | 0.442793021345       | 0.094822666089747862  | 1.94263583651e-09  | 1.56359545423e-09     | 1.24241588913       |
| major_depressive_disorder_with_meta_analysis_deg_genes_fdr0.10_window100000.results       | 0.0235019733579 | 0.0266474285535    | 0.00472985315447   | 1.13383791853    | 0.201253447209       | 0.5054426014355724    | 1.56185655051e-09  | 2.01921316591e-09     | 0.773497606332      |
| neo_conscientiousness_with_meta_analysis_deg_genes_fdr0.10_window100000.results           | 0.0235019733579 | 0.191199463231     | 0.0923204898197    | 8.13546421485    | 3.92820162009        | 0.0065487473360090753 | 1.05378959074e-07  | 3.79625880908e-08     | 2.7758634059        |
| neo_openness_with_meta_analysis_deg_genes_fdr0.10_window100000.results                    | 0.0235019733579 | -0.0402690040514   | 0.0339392626408    | -1.71343075912   | 1.444102677          | 0.029851977188901199  | -4.11971310857e-08 | 2.13237235019e-08     | -1.93198580361      |
| neuroticism_with_meta_analysis_deg_genes_fdr0.10_window100000.results                     | 0.0235019733579 | 0.0203983314152    | 0.00546109195136   | 0.867941219429   | 0.232367379037       | 0.56485779269466119   | -1.79401922086e-09 | 3.16895958793e-09     | -0.566122467352     |
| opioid_addiction_144_with_meta_analysis_deg_genes_fdr0.10_window100000.results            | 0.0235019733579 | 0.0497879452604    | 0.0196320492697    | 2.11845807593    | 0.835336206486       | 0.18506441122330691   | 2.12478348547e-09  | 1.73209007289e-09     | 1.22671650784       |
| opioid_addiction_gsem_with_meta_analysis_deg_genes_fdr0.10_window100000.results           | 0.0235019733579 | 0.0287271577149    | 0.00991415419262   | 1.2223296009     | 0.421843478488       | 0.59366869326507166   | 1.02928599286e-09  | 5.68032184259e-09     | 0.181202055338      |
| parkinsons_with_meta_analysis_deg_genes_fdr0.10_window100000.results                      | 0.0235019733579 | 0.0484728452976    | 0.0442486414162    | 2.06250107425    | 1.8827628107         | 0.57481340565007755   | 2.07524565868e-08  | 1.49914822022e-07     | 0.13842831754       |
| ptsd_with_meta_analysis_deg_genes_fdr0.10_window100000.results                            | 0.0235019733579 | 0.0222675444771    | 0.02247054064      | 0.947475522076   | 0.956112931362       | 0.95606940455335865   | 3.24883974243e-11  | 6.2567410316e-09      | 0.00519254309236    |
| schizophrenia_with_meta_analysis_deg_genes_fdr0.10_window100000.results                   | 0.0235019733579 | 0.0320347438846    | 0.00407749818819   | 1.36306613052    | 0.173495992277       | 0.038411519832770552  | 2.76211947757e-08  | 2.63627832305e-08     | 1.04773439641       |
| short_sleep_duration_dashti_with_meta_analysis_deg_genes_fdr0.10_window100000.results     | 0.0235019733579 | 0.0388831967692    | 0.012106646039     | 1.65446518797    | 0.515133170081       | 0.2050333120228881    | 3.46706167464e-09  | 3.636766782e-09       | 0.953336268853      |
| sleep_duration_dashti_with_meta_analysis_deg_genes_fdr0.10_window100000.results           | 0.0235019733579 | 0.0452085255736    | 0.0119805059708    | 1.92360551539    | 0.509765958305       | 0.07477291210259239   | 7.87005113402e-09  | 5.49361657973e-09     | 1.43258107292       |
| sleep_duration_jansen_with_meta_analysis_deg_genes_fdr0.10_window100000.results           | 0.0235019733579 | 0.040230098511     | 0.0108342441223    | 1.71177534322    | 0.460992953967       | 0.12426233186812584   | 5.81800822085e-09  | 4.89702952422e-09     | 1.18806884706       |
| smoking_cessation_with_meta_analysis_deg_genes_fdr0.10_window100000.results               | 0.0235019733579 | 0.0429445834425    | 0.00806419324428   | 1.82727564144    | 0.343128345926       | 0.015429063153835835  | 3.21813129221e-09  | 1.90050106857e-09     | 1.69330675232       |
| smoking_initiation_with_meta_analysis_deg_genes_fdr0.10_window100000.results              | 0.0235019733579 | 0.0260994151404    | 0.00481810422193   | 1.11052015688    | 0.20500849646        | 0.58935135712504594   | 5.65573628823e-10  | 2.17780073932e-09     | 0.259699438342      |
| subjective_wellbeing_with_meta_analysis_deg_genes_fdr0.10_window100000.results            | 0.0235019733579 | 0.030702222292     | 0.0106354178114    | 1.30636784514    | 0.452532970293       | 0.49620905833319662   | 7.33866582491e-10  | 1.81175741758e-09     | 0.405057860048      |
| years_of_education_with_meta_analysis_deg_genes_fdr0.10_window100000.results              | 0.0235019733579 | 0.0294987394144    | 0.00254148101557   | 1.25516010784    | 0.108139047597       | 0.019124108434994555  | 3.19824863693e-09  | 2.23486200656e-09     | 1.43107208747       |