# MiXeR
**Author:** Jesse Marks <br>
**GitHub Issue:** NA

This document details the steps to apply the [MiXeR](https://github.com/precimed/mixer) software tool to our FTND meta-analysis results. MiXeR is a statistical tool that quantifies the polygenic overlap, irrespective of the genetic overlap, between two complex traits using only the GWAS summary stats. It should help us gain a better understanding of the cross-trait genetic architecture by considering polygenecity. We need to employ this method to address the reviewer comment for the [FTND GWAS manuscript](https://www.biorxiv.org/content/10.1101/2020.01.15.898858v1):

```css
The authors could do more to describe the genetic overlap in terms of pleiotropy, e.g., pairwise GWAS (Pickrell et al) or MiXeR (Frei et al) to identify loci that are unique for a specific trait vs. loci that are shared across traits.
```

The MiXeR software is described in [Bivariate causal mixture model quantifies polygenic overlap between complex traits beyond genetic correlation](https://www.nature.com/articles/s41467-019-10310-0.pdf). The output for this tool is a Venn Diagram. Applied to a few different phenotypes, we might expect to see the a collection of diagrams such as the follow that was taken from the MiXeR manuscript:

![image.png](attachment:image.png)

<br><br>
We will apply Mixer to our FTND data pair with the following phenotypes:
* CPD (GSCAN )
* Smoking cessation (GSCAN)
* Age of smoking initiation (GSCAN)
* Smoking Initiation (ever vs. never, GSCAN)
* Alcohol Dependence
* Posttraumatic stress disorder
* Attention deficit hyperactive disorder
* Depressive symptoms
* Major depressive disorder
* Bipolar disorder
* Schizophrenia

## Data Preparation
MiXeR recognizes summary statistics in LDSC format as described [here](https://github.com/bulik/ldsc/wiki/Summary-Statistics-File-Format). In brief, each trait must be represented as a single table containing columns SNP, N, Z, A1, A2. Thus, it is possible to use `munge_sumstats.py` script as described [here](https://github.com/bulik/ldsc/wiki/Partitioned-Heritability#step-1-download-the-data). This is convenient for us since we are already familiar with LDSC functionality. We will simply run the LDSC pipeline which formats the data for us how we want.

### Convert  summary stats to LDSC format
**LDSC workflow guidelines**
1. Create Excel phenotype file locally then upload to EC2 instance
2. Clone https://github.com/RTIInternational/ld-regression-pipeline
3. Then edit full_ld_regression_wf_template.json to include the reference data of choice
4. Use dockerized tool to finish filling out the json file that will be input for workflow
5. Run the WDL workflow for LDSC


#### use ld-regression pipeline
`623adf4d-b5c4-4ab1-a614-43a264c12f43`

In [None]:
cd /shared/jmarks/projects/nicotine/mixer/data_processing

procD=/shared/jmarks/projects/nicotine/mixer/data_processing/001
phen=ldsc_local_phenotype_columns.xlsx

# enter compute node and use screen tool

# clone github repo
cd $procD
git clone https://github.com/RTIInternational/ld-regression-pipeline
    
# edit file-input json
cd ld-regression-pipeline
mkdir workflow_inputs
cp json_input/full_ld_regression_wf_template.json workflow_inputs
cd workflow_inputs

## vim edit file (see README.md at https://github.com/RTIInternational/ld-regression-pipeline)

# create final workflow input (a json file)
docker run -v $procD/ld-regression-pipeline/workflow_inputs:/data/ \
    rticode/generate_ld_regression_input_json:1ddbd682cb1e44dab6d11ee571add34bd1d06e21 \
    --json-input /data/full_ld_regression_wf_template.json \
    --pheno-file /data/$phen >\
        $procD/ld-regression-pipeline/workflow_inputs/final_wf_inputs.json

## zip appropriate files 
# Change to directory immediately above ld-regression-pipeline
cd $procD/ld-regression-pipeline
cd ..
# Make zipped copy of repo somewhere
zip --exclude=*var/* --exclude=*.git/* -r \
    $procD/ld-regression-pipeline/workflow_inputs/ld-regression-pipeline.zip \
    ld-regression-pipeline

## download cromwell and the config file, if necessary
cd /shared/jmarks/bin/cromwell
#aws s3 cp s3://rti-cromwell-output/cromwell-config/cromwell_default_genomics_queue.conf .
#wget https://github.com/broadinstitute/cromwell/releases/download/44/cromwell-44.jar

## run ldsc workflow on AWS EC2 instance
java -Dconfig.file=/shared/jmarks/bin/cromwell/cromwell_default_genomics_queue.conf \
    -jar cromwell-44.jar \
    run $procD/ld-regression-pipeline/workflow/full_ld_regression_wf.wdl \
    -i $procD/ld-regression-pipeline/workflow_inputs/final_wf_inputs.json \
    -p $procD/ld-regression-pipeline/workflow_inputs/ld-regression-pipeline.zip


#java -Dconfig.file=/shared/jmarks/bin/cromwell/cromwell_high_priority_genomics_queue.conf \
#    -jar cromwell-44.jar \
#    run $procD/ld-regression-pipeline/workflow/full_ld_regression_wf.wdl \
#    -i $procD/ld-regression-pipeline/workflow_inputs/final_wf_inputs.json \
#    -p $procD/ld-regression-pipeline/workflow_inputs/ld-regression-pipeline.zip
#

### Convert using python_convert
The software package [python_convert](https://github.com/precimed/python_convert![image.png](attachment:image.png)) is a host of tools created by the makers of MiXeR. We will use these tools instead of the LDSC pipeline since it is not working currently. 

Note that we already prepared the reference panel.

In [None]:
### calculate LD info in the genotype reference panel
#for chr in {1..22}; do
#	python3 ~/bin/mixer/precimed/mixer.py ld \
#   --lib ~/bin/mixer/src/build/lib/libbgmg.so \
#   --bfile 1000G_EUR_Phase3_plink/1000G.EUR.QC.$chr \
#   --out 1000G_EUR_Phase3_plink/1000G.EUR.QC.$chr.run4.ld \
#   --r2min 0.05 --ldscore-r2min 0.05 --ld-window-kb 30000
#done

#cut -f1 w_hm3.snplist | tail -n +2 > w_hm3.justrs


In [None]:
cd /shared/jmarks/projects/nicotine/mixer/data_processing/002

aws s3 sync s3://rti-nd/gwas_meta/categorical_ftnd/results/1df/0001/eur/final_stats/ .

### FTND ### 
## combine into one file
zcat ftnd_wave3_meta_analysis_chr1_eur.txt.gz > ftnd_wave3.txt
for chr in {2..22}; do
    zcat ftnd_wave3_meta_analysis_chr${chr}_eur.txt.gz | tail -n +2  >> ftnd_wave3.txt
done

## QC
python3 ~/bin/python_convert/sumstats.py csv --sumstats ftnd_wave3.txt.gz --out ftnd_wave3_eur.csv --force --auto --head 5 --n-val 46213 

python3 ~/bin/python_convert/sumstats.py zscore --sumstats ftnd_wave3_eur.csv --out ftnd_wave3_eur_z.csv

python3 ~/bin/python_convert/sumstats.py zscore --sumstats ftnd_wave3_eur.csv | \
python3 ~/bin/python_convert/sumstats.py qc --exclude-ranges 6:26000000-34000000 --max-or 1e37 | \
python3 ~/bin/python_convert/sumstats.py neff --drop --factor 4 --out ftnd_wave3_eur_qc_noMHC.csv --force 
gzip ftnd_wave3_eur_qc_noMHC.csv


### GSCAN Age of Initiation ###
cd /shared/jmarks/projects/nicotine/mixer/data_processing/002/gscan/age_initiation
aws s3 cp s3://rti-nd/ldsc_genetic_correlation/data/gscan/raw/AgeOfInitiation.txt.gz .
    
inf=AgeOfInitiation.txt.gz    
outf=age_of_initiation_liu2019

python3 ~/bin/python_convert/sumstats.py csv --sumstats $inf  \
    --force --auto --head 5 --a1 ALT --a2 REF --chr CHROM --out  ${outf}.csv 

python3 ~/bin/python_convert/sumstats.py zscore --sumstats ${outf}.csv  --out  ${outf}_z.csv

python3 ~/bin/python_convert/sumstats.py qc --sumstats ${outf}_z.csv \
    --exclude-ranges 6:26000000-34000000 --max-or 1e37 | \
python3 ~/bin/python_convert/sumstats.py neff --drop --factor 4 --out ${outf}_mixer_ready.csv --force 
    
gzip ${outf}_mixer_ready.csv

    

### GSCAN CPD ###
cd  /shared/jmarks/projects/nicotine/mixer/data_processing/002/gscan/cpd
aws s3 cp s3://rti-nd/ldsc_genetic_correlation/data/gscan/raw/CigarettesPerDay.txt.gz .
    
inf=CigarettesPerDay.txt.gz
outf=cigarettes_per_day_liu2019
    
python3 ~/bin/python_convert/sumstats.py csv --sumstats $inf  \
    --force --auto --head 5 --a1 ALT --a2 REF --chr CHROM  --out  ${outf}.csv 

python3 ~/bin/python_convert/sumstats.py zscore --sumstats ${outf}.csv  --out  ${outf}_z.csv

python3 ~/bin/python_convert/sumstats.py qc --sumstats ${outf}_z.csv \
    --exclude-ranges 6:26000000-34000000 --max-or 1e37 | \
python3 ~/bin/python_convert/sumstats.py neff --drop --factor 4 --out ${outf}_mixer_ready.csv --force 
    
gzip ${outf}_mixer_ready.csv &


### GSCAN Smoking Cessation ###
cd /shared/jmarks/projects/nicotine/mixer/data_processing/002/gscan/smoking_cessation
aws s3 cp s3://rti-nd/ldsc_genetic_correlation/data/gscan/raw/SmokingCessation.txt.gz .
    
inf=SmokingCessation.txt.gz   
outf=smoking_cessation_liu2019

python3 ~/bin/python_convert/sumstats.py csv --sumstats $inf \
    --a1 ALT --a2 REF  --chr CHROM --out ${outf}.csv --force --auto --head 5 

python3 ~/bin/python_convert/sumstats.py zscore --sumstats ${outf}.csv  --out  ${outf}_z.csv 

# for ldsc format
cat ${outf}_z.csv | awk '{print $1,$2,$3,$5,$6,$10,$4,$7}' >\
    ${outf}_ldsc_ready.txt &
head ${outf}_ldsc_ready.txt

python3 ~/bin/python_convert/sumstats.py qc --sumstats ${outf}_z.csv \
    --exclude-ranges 6:26000000-34000000 --max-or 1e37 | \
python3 ~/bin/python_convert/sumstats.py neff --drop --factor 4 --out ${outf}_mixer_ready.csv --force 
    
gzip ${outf}_mixer_ready.csv &


### GSCAN Smoking Initiation ###
cd /shared/jmarks/projects/nicotine/mixer/data_processing/002/gscan/smoking_initiation
aws s3 cp s3://rti-nd/ldsc_genetic_correlation/data/gscan/raw/SmokingInitiation.txt.gz .
    
inf=SmokingInitiation.txt.gz    
outf=smoking_initiation_liu2019

python3 ~/bin/python_convert/sumstats.py csv --sumstats $inf \
    --a1 ALT --a2 REF  --chr CHROM --out ${outf}.csv --force --auto --head 5 

python3 ~/bin/python_convert/sumstats.py zscore --sumstats ${outf}.csv  --out  ${outf}_z.csv

# for ldsc format
cat ${outf}_z.csv | awk '{print $1,$2,$3,$5,$6,$10,$4,$7}' >\
    ${outf}_ldsc_ready.txt &
head ${outf}_ldsc_ready.txt

python3 ~/bin/python_convert/sumstats.py qc --sumstats ${outf}_z.csv \
    --exclude-ranges 6:26000000-34000000 --max-or 1e37 | \
python3 ~/bin/python_convert/sumstats.py neff --drop --factor 4 --out ${outf}_mixer_ready.csv --force 
    
gzip ${outf}_mixer_ready.csv &


### GSCAN Drinks Per Week ###
cd /shared/jmarks/projects/nicotine/mixer/data_processing/002/gscan/
aws s3 cp s3://rti-nd/ldsc_genetic_correlation/data/gscan/DrinksPerWeek.txt.gz .
    
inf=DrinksPerWeek.txt.gz    
outf=drinks_per_week_liu2019

python3 ~/bin/python_convert/sumstats.py csv --sumstats $inf \
    --a1 ALT --a2 REF  --chr CHROM --out ${outf}.csv --force --auto --head 5 

python3 ~/bin/python_convert/sumstats.py zscore --sumstats ${outf}.csv  --out  ${outf}_z.csv

# for ldsc format
cat ${outf}_z.csv | awk '{print $1,$2,$3,$5,$6,$10,$4,$7}' >\
    ${outf}_ldsc_ready.txt &
head ${outf}_ldsc_ready.txt

python3 ~/bin/python_convert/sumstats.py qc --sumstats ${outf}_z.csv \
    --exclude-ranges 6:26000000-34000000 --max-or 1e37 | \
python3 ~/bin/python_convert/sumstats.py neff --drop --factor 4 --out ${outf}_mixer_ready.csv --force 
    
gzip ${outf}_mixer_ready.csv &


### ADHD ###
cd /shared/jmarks/projects/nicotine/mixer/data_processing/002/adhd
aws s3 cp s3://rti-nd/ldsc_genetic_correlation/data/adhd_demontis2019/0_raw/daner_meta_filtered_NA_iPSYCH23_PGC11_sigPCs_woSEX_2ell6sd_EUR_Neff_70.meta.gz
    
inf=daner_meta_filtered_NA_iPSYCH23_PGC11_sigPCs_woSEX_2ell6sd_EUR_Neff_70.meta.gz   # full input file name
outf=adhd    # prefix of outfile

python3 ~/bin/python_convert/sumstats.py csv --sumstats $inf --force --auto --head 5 \
     --ncase Nca --ncontrol Nco  --out ${outf}.csv
## remove singletons

python3 ~/bin/python_convert/sumstats.py zscore --sumstats  ${outf}_remove_singletons.csv --out ${outf}_remove_singletons_z.csv

python3 ~/bin/python_convert/sumstats.py zscore --sumstats  ${outf}_remove_singletons.csv | \
python3 ~/bin/python_convert/sumstats.py qc --exclude-ranges 6:26000000-34000000 --max-or 1e37 | \
python3 ~/bin/python_convert/sumstats.py neff --drop --factor 4 --out ${outf}_qc_noMHC.csv --force 
    
gzip ${outf}_qc_noMHC.csv



### Alcohol Dependence ###
cd /shared/jmarks/projects/nicotine/mixer/data_processing/002/alc_dep
aws s3 cp s3://rti-nd/ldsc_genetic_correlation/data/alcdep_walters2018/pgc_alcdep_eur_n46568_ldsc_ready.txt.gz .
    
inf=pgc_alcdep.eur_discovery.aug2018_release.txt.gz    
outf=pgc_alcdep

python3 ~/bin/python_convert/sumstats.py csv --sumstats $inf --out ${outf}.csv \
    --force --auto --head 5  --ncase-val 11569 --ncontrol-val 34999

python3 ~/bin/python_convert/sumstats.py zscore --sumstats  ${outf}.csv | \
python3 ~/bin/python_convert/sumstats.py qc --exclude-ranges 6:26000000-34000000 --max-or 1e37 | \
python3 ~/bin/python_convert/sumstats.py neff --drop --factor 4 --out ${outf}_qc_noMHC.csv --force 
    
gzip ${outf}_qc_noMHC.csv



### Bipolar ###
cd /shared/jmarks/projects/nicotine/mixer/data_processing/002/bipolar
aws s3 cp s3://rti-nd/ldsc_genetic_correlation/data/bipolar_stahl2019/raw/daner_PGC_BIP32b_mds7a_0416a.gz .
    
inf=daner_PGC_BIP32b_mds7a_0416a.gz
outf=bipolar_stahl2019

python3 ~/bin/python_convert/sumstats.py csv --sumstats $inf --out ${outf}.csv --force --auto --head 5  \
    --ncase Nca --ncontrol Nco 
## removed singletons with python script

python3 ~/bin/python_convert/sumstats.py zscore --sumstats  ${outf}_remove_singletons.csv | \
python3 ~/bin/python_convert/sumstats.py qc --exclude-ranges 6:26000000-34000000 --max-or 1e37 | \
python3 ~/bin/python_convert/sumstats.py neff --drop --factor 4 --out ${outf}_mixer_ready.csv --force
    
gzip ${outf}_mixer_ready.csv



### Depressive Symptoms ###
cd /shared/jmarks/projects/nicotine/mixer/data_processing/002/depressive_symptoms
aws s3 cp s3://rti-nd/ldsc_genetic_correlation/data/depressive_symptoms_okbay2016/raw/DS_full.txt.gz .
    
inf=DS_Full.txt.gz
outf=depressive_symptoms_okbay2016

python3 ~/bin/python_convert/sumstats.py csv --sumstats $inf  \
    --force --auto --head 5 --n-val 161460  --out  ${outf}.csv 

python3 ~/bin/python_convert/sumstats.py zscore --sumstats ${outf}.csv  --out  ${outf}_z.csv

python3 ~/bin/python_convert/sumstats.py qc --sumstats ${outf}_z.csv \
    --exclude-ranges 6:26000000-34000000 --max-or 1e37 | \
python3 ~/bin/python_convert/sumstats.py neff --drop --factor 4 --out ${outf}_mixer_ready.csv --force 
    
gzip ${outf}_mixer_ready.csv


### PTSD ###
cd /shared/jmarks/projects/nicotine/mixer/data_processing/002/ptsd
aws s3 cp s3://rti-nd/ldsc_genetic_correlation/data/ptsd_nievergelt2019/raw/pts_eur_freeze2_overall.results.gz .
    
inf=pts_eur_freeze2_overall.results.gz
outf=ptsd_nievergelt2019

python3 ~/bin/python_convert/sumstats.py csv --sumstats $inf --out ${outf}.csv --force --auto --head 5 \
    --ncase Nca --ncontrol Nco

python3 ~/bin/python_convert/sumstats.py zscore --sumstats  ${outf}.csv | \
python3 ~/bin/python_convert/sumstats.py qc --exclude-ranges 6:26000000-34000000 --max-or 1e37 | \
python3 ~/bin/python_convert/sumstats.py neff --drop --factor 4 --out ${outf}_qc_noMHC.csv --force 

mv ${outf}_qc_noMHC.csv ${outf}_mixer_ready.csv
    
gzip ${outf}_qc_noMHC.csv



### Schizophrenia ###
cd /shared/jmarks/projects/nicotine/mixer/data_processing/002/schizophrenia
aws s3 cp s3://rti-nd/ldsc_genetic_correlation/data/schizophrenia_ripke2014/raw/daner_natgen_pgc_eur.gz .
    
inf=daner_natgen_pgc_eur.gz
outf=scz_ripke2014

python3 ~/bin/python_convert/sumstats.py csv --sumstats $inf --out ${outf}.csv \
    --force --auto --head 5  --ncase Nca --ncontrol Nco 

python3 ~/bin/python_convert/sumstats.py zscore --sumstats  ${outf}.csv --out ${outf}_z.csv



python3 ~/bin/python_convert/sumstats.py zscore --sumstats  ${outf}.csv | \
python3 ~/bin/python_convert/sumstats.py qc --exclude-ranges 6:26000000-34000000 --max-or 1e37 | \
python3 ~/bin/python_convert/sumstats.py neff --drop --factor 4 --out ${outf}_mixer_ready.csv --force 
    
gzip ${outf}_mixer_ready.csv


### Major Depressive Disorder ###
cd /shared/jmarks/projects/nicotine/mixer/data_processing/002/mdd
aws s3 cp s3://rti-nd/ldsc_genetic_correlation/data/depressive_symptoms_okbay2016/raw/DS_full.txt.gz .
    
inf=mdd_howard2019_n807553_ldsc_ready.txt.gz
outf=mdd_howard2019

python3 ~/bin/python_convert/sumstats.py csv --sumstats $inf  --logodds LOG_ODD\
    --force --auto --head 5 --ncase-val 170756 --ncontrol-val 329443  --out  ${outf}.csv 

python3 ~/bin/python_convert/sumstats.py zscore --sumstats ${outf}.csv  --out  ${outf}_z.csv



python3 ~/bin/python_convert/sumstats.py qc --sumstats ${outf}_z.csv \
    --exclude-ranges 6:26000000-34000000 --max-or 1e37 | \
python3 ~/bin/python_convert/sumstats.py neff --drop --factor 4 --out ${outf}_mixer_ready.csv --force 
    
gzip ${outf}_mixer_ready.csv

aws s3 cp ${outf}_mixer_ready.csv.gz s3://rti-nd/mixer/data/schizophrenia_ripke2014/

## Fit model univariate model

In [None]:
### FTND ###
indata=/shared/jmarks/projects/nicotine/mixer/data_processing/002/ftnd/ftnd_wave3_eur_quach2020_mixer_ready.csv.gz
outdata=/shared/jmarks/projects/nicotine/mixer/results/002/ftnd_wave3_eur.fit

python3 ~/bin/mixer/precimed/mixer.py fit1 \
      --trait1-file $indata \
      --out $outdata \
      --extract /shared/data/w_hm3.justrs \
      --bim-file /shared/data/1000G_EUR_Phase3_plink/1000G.EUR.QC.@.bim \
      --ld-file /shared/data/1000G_EUR_Phase3_plink/1000G.EUR.QC.@.run4.ld \
      --lib  ~/bin/mixer/src/build/lib/libbgmg.so


### Bipolar ###
## on EC2 ##
indata=/shared/jmarks/projects/nicotine/mixer/data_processing/002/bipolar/bipolar_stahl2019_mixer_ready.csv.gz
outdata=/shared/jmarks/projects/nicotine/mixer/results/001/bipolar_stahl2019.fit

python3 ~/bin/mixer/precimed/mixer.py fit1 \
      --trait1-file $indata \
      --out $outdata \
      --extract /shared/data/w_hm3.justrs \
      --bim-file /shared/data/1000G_EUR_Phase3_plink/1000G.EUR.QC.@.bim \
      --ld-file /shared/data/1000G_EUR_Phase3_plink/1000G.EUR.QC.@.run4.ld \
      --lib  ~/bin/mixer/src/build/lib/libbgmg.so


### Depressive Symptoms ###
## on EC2 ##
indata=/shared/jmarks/projects/nicotine/mixer/data_processing/002/depressive_symptoms/depressive_symptoms_okbay2016_mixer_ready.csv.gz
outdata=/shared/jmarks/projects/nicotine/mixer/results/001/dep_symp_okbay2016.fit

python3 ~/bin/mixer/precimed/mixer.py fit1 \
      --trait1-file $indata \
      --out $outdata \
      --extract /shared/data/w_hm3.justrs \
      --bim-file /shared/data/1000G_EUR_Phase3_plink/1000G.EUR.QC.@.bim \
      --ld-file /shared/data/1000G_EUR_Phase3_plink/1000G.EUR.QC.@.run4.ld \
      --lib  ~/bin/mixer/src/build/lib/libbgmg.so


### Major Depressive Disorder ###
## on EC2 ##
indata=/shared/jmarks/projects/nicotine/mixer/data_processing/002/mdd/mdd_howard2019_mixer_ready.csv.gz
outdata=/shared/jmarks/projects/nicotine/mixer/results/001/mdd_howard2019.fit

python3 ~/bin/mixer/precimed/mixer.py fit1 \
      --trait1-file $indata \
      --out $outdata \
      --extract /shared/data/w_hm3.justrs \
      --bim-file /shared/data/1000G_EUR_Phase3_plink/1000G.EUR.QC.@.bim \
      --ld-file /shared/data/1000G_EUR_Phase3_plink/1000G.EUR.QC.@.run4.ld \
      --lib  ~/bin/mixer/src/build/lib/libbgmg.so


### PTSD ###
## on EC2 ##
indata=/shared/jmarks/projects/nicotine/mixer/data_processing/002/ptsd/ptsd_nievergelt2019_mixer_ready.csv.gz
outdata=/shared/jmarks/projects/nicotine/mixer/results/001/ptsd_nievergelt2019.fit

python3 ~/bin/mixer/precimed/mixer.py fit1 \
      --trait1-file $indata \
      --out $outdata \
      --extract /shared/data/w_hm3.justrs \
      --bim-file /shared/data/1000G_EUR_Phase3_plink/1000G.EUR.QC.@.bim \
      --ld-file /shared/data/1000G_EUR_Phase3_plink/1000G.EUR.QC.@.run4.ld \
      --lib  ~/bin/mixer/src/build/lib/libbgmg.so


### SCZ ###
## on EC2 ##
indata=/shared/jmarks/projects/nicotine/mixer/data_processing/002/schizophrenia/scz_ripke2014_mixer_ready.csv.gz
outdata=/shared/jmarks/projects/nicotine/mixer/results/001/scz_ripke2014.fit

python3 ~/bin/mixer/precimed/mixer.py fit1 \
      --trait1-file $indata \
      --out $outdata \
      --extract /shared/data/w_hm3.justrs \
      --bim-file /shared/data/1000G_EUR_Phase3_plink/1000G.EUR.QC.@.bim \
      --ld-file /shared/data/1000G_EUR_Phase3_plink/1000G.EUR.QC.@.run4.ld \
      --lib  ~/bin/mixer/src/build/lib/libbgmg.so


### ADHD ###
## on EC2 ##
indata=/shared/jmarks/projects/nicotine/mixer/data_processing/002/adhd/adhd_demontis2018_mixer_ready.csv.gz
outdata=/shared/jmarks/projects/nicotine/mixer/results/001/adhd_demontis2018.fit

python3 ~/bin/mixer/precimed/mixer.py fit1 \
      --trait1-file $indata \
      --out $outdata \
      --extract /shared/data/w_hm3.justrs \
      --bim-file /shared/data/1000G_EUR_Phase3_plink/1000G.EUR.QC.@.bim \
      --ld-file /shared/data/1000G_EUR_Phase3_plink/1000G.EUR.QC.@.run4.ld \
      --lib  ~/bin/mixer/src/build/lib/libbgmg.so


### Alc Dep ###
## on EC2 ##
indata=/shared/jmarks/projects/nicotine/mixer/data_processing/002/alc_dep/alcohol_dependence_walters2018_mixer_ready.csv.gz
outdata=/shared/jmarks/projects/nicotine/mixer/results/001/alc_dep_walters2018.fit

python3 ~/bin/mixer/precimed/mixer.py fit1 \
      --trait1-file $indata \
      --out $outdata \
      --extract /shared/data/w_hm3.justrs \
      --bim-file /shared/data/1000G_EUR_Phase3_plink/1000G.EUR.QC.@.bim \
      --ld-file /shared/data/1000G_EUR_Phase3_plink/1000G.EUR.QC.@.run4.ld \
      --lib  ~/bin/mixer/src/build/lib/libbgmg.so


### GSCAN CPD ###
indata=/shared/jmarks/projects/nicotine/mixer/data_processing/002/gscan/cpd/cigarettes_per_day_liu2019_mixer_ready.csv.gz
outdata=/shared/jmarks/projects/nicotine/mixer/results/001/gscan_cpd.fit

python3 ~/bin/mixer/precimed/mixer.py fit1 \
      --trait1-file $indata \
      --out $outdata \
      --extract /shared/data/w_hm3.justrs \
      --bim-file /shared/data/1000G_EUR_Phase3_plink/1000G.EUR.QC.@.bim \
      --ld-file /shared/data/1000G_EUR_Phase3_plink/1000G.EUR.QC.@.run4.ld \
      --lib  ~/bin/mixer/src/build/lib/libbgmg.so


### GSCAN AI ###
indata=/shared/jmarks/projects/nicotine/mixer/data_processing/002/gscan/age_initiation/age_of_initiation_liu2019_mixer_ready.csv.gz
outdata=/shared/jmarks/projects/nicotine/mixer/results/001/gscan_ai.fit

python3 ~/bin/mixer/precimed/mixer.py fit1 \
      --trait1-file $indata \
      --out $outdata \
      --extract /shared/data/w_hm3.justrs \
      --bim-file /shared/data/1000G_EUR_Phase3_plink/1000G.EUR.QC.@.bim \
      --ld-file /shared/data/1000G_EUR_Phase3_plink/1000G.EUR.QC.@.run4.ld \
      --lib  ~/bin/mixer/src/build/lib/libbgmg.so

### si gscan ###
indata=/shared/jmarks/projects/nicotine/mixer/data_processing/002/gscan/smoking_initiation/smoking_initiation_liu2019_mixer_ready.csv.gz
outdata=/shared/jmarks/projects/nicotine/mixer/results/001/gscan_si.fit

python3 ~/bin/mixer/precimed/mixer.py fit1 \
      --trait1-file $indata \
      --out $outdata \
      --extract /shared/data/w_hm3.justrs \
      --bim-file /shared/data/1000G_EUR_Phase3_plink/1000G.EUR.QC.@.bim \
      --ld-file /shared/data/1000G_EUR_Phase3_plink/1000G.EUR.QC.@.run4.ld \
      --lib  ~/bin/mixer/src/build/lib/libbgmg.so

### sc gscan ###
indata=/shared/jmarks/projects/nicotine/mixer/data_processing/002/gscan/smoking_cessation/smoking_cessation_liu2019_mixer_ready.csv.gz
outdata=/shared/jmarks/projects/nicotine/mixer/results/001/gscan_sc.fit

python3 ~/bin/mixer/precimed/mixer.py fit1 \
      --trait1-file $indata \
      --out $outdata \
      --extract /shared/data/w_hm3.justrs \
      --bim-file /shared/data/1000G_EUR_Phase3_plink/1000G.EUR.QC.@.bim \
      --ld-file /shared/data/1000G_EUR_Phase3_plink/1000G.EUR.QC.@.run4.ld \
      --lib  ~/bin/mixer/src/build/lib/libbgmg.so



### ADHD ###
### Alc Dep  ###
### bipolar ###
### depressive symptoms  ###
### mdd ###
### ptsd ###
### scz ###
### cpd gscan ###
### si gscan ###
### sc gscan ###
### ai gscan ###

## Fit bivariate model

In [None]:
rdir=/shared/jmarks/projects/nicotine/mixer/results/001
outd=/shared/jmarks/projects/nicotine/mixer/results/002
ddir=/shared/jmarks/projects/nicotine/mixer/data_processing/002


### FTND and SCZ ###
python3 ~/bin/mixer/precimed/mixer.py fit2 \
      --trait1-file $ddir/ftnd/ftnd_wave3_eur_quach2020_mixer_ready.csv.gz \
      --trait2-file  $ddir/schizophrenia/scz_ripke2014_mixer_ready.csv.gz \
      --trait1-params-file $rdir/ftnd_wave3_eur.fit.json \
      --trait2-params-file $rdir/scz_ripke2014.fit.json \
      --out $outd/ftnd_scz.fit \
      --extract /shared/data/w_hm3.justrs \
      --bim-file /shared/data/1000G_EUR_Phase3_plink/1000G.EUR.QC.@.bim \
      --ld-file /shared/data/1000G_EUR_Phase3_plink/1000G.EUR.QC.@.run4.ld \
      --lib  ~/bin/mixer/src/build/lib/libbgmg.so

### FTND and Alc Dep  ###
python3 ~/bin/mixer/precimed/mixer.py fit2 \
      --trait1-file $ddir/ftnd/ftnd_wave3_eur_quach2020_mixer_ready.csv.gz \
      --trait2-file  $ddir/alc_dep/alcohol_dependence_walters2018_mixer_ready.csv.gz \
      --trait1-params-file $rdir/ftnd_wave3_eur.fit.json \
      --trait2-params-file $rdir/alc_dep_walters2018.fit.json \
      --out $outd/ftnd_alcdep.fit \
      --extract /shared/data/w_hm3.justrs \
      --bim-file /shared/data/1000G_EUR_Phase3_plink/1000G.EUR.QC.@.bim \
      --ld-file /shared/data/1000G_EUR_Phase3_plink/1000G.EUR.QC.@.run4.ld \
      --lib  ~/bin/mixer/src/build/lib/libbgmg.so

### FTND and bipolar ###
python3 ~/bin/mixer/precimed/mixer.py fit2 \
      --trait1-file $ddir/ftnd/ftnd_wave3_eur_quach2020_mixer_ready.csv.gz \
      --trait2-file  $ddir/bipolar/bipolar_stahl2019_mixer_ready.csv.gz \
      --trait1-params-file $rdir/ftnd_wave3_eur.fit.json \
      --trait2-params-file $rdir/bipolar_stahl2019.fit.json  \
      --out $outd/ftnd_bipolar.fit \
      --extract /shared/data/w_hm3.justrs \
      --bim-file /shared/data/1000G_EUR_Phase3_plink/1000G.EUR.QC.@.bim \
      --ld-file /shared/data/1000G_EUR_Phase3_plink/1000G.EUR.QC.@.run4.ld \
      --lib  ~/bin/mixer/src/build/lib/libbgmg.so

### FTND and depressive symptoms  ###
python3 ~/bin/mixer/precimed/mixer.py fit2 \
      --trait1-file $ddir/ftnd/ftnd_wave3_eur_quach2020_mixer_ready.csv.gz \
      --trait2-file  $ddir/depressive_symptoms/depressive_symptoms_okbay2016_mixer_ready.csv.gz \
      --trait1-params-file $rdir/ftnd_wave3_eur.fit.json \
      --trait2-params-file $rdir/dep_symp_okbay2016.fit.json \
      --out $outd/ftnd_depsym.fit \
      --extract /shared/data/w_hm3.justrs \
      --bim-file /shared/data/1000G_EUR_Phase3_plink/1000G.EUR.QC.@.bim \
      --ld-file /shared/data/1000G_EUR_Phase3_plink/1000G.EUR.QC.@.run4.ld \
      --lib  ~/bin/mixer/src/build/lib/libbgmg.so

### FTND and mdd ###
python3 ~/bin/mixer/precimed/mixer.py fit2 \
      --trait1-file $ddir/ftnd/ftnd_wave3_eur_quach2020_mixer_ready.csv.gz \
      --trait2-file  $ddir/mdd/mdd_howard2019_mixer_ready.csv.gz \
      --trait1-params-file $rdir/ftnd_wave3_eur.fit.json \
      --trait2-params-file $rdir/mdd_howard2019.fit.json \
      --out $outd/ftnd_mdd.fit \
      --extract /shared/data/w_hm3.justrs \
      --bim-file /shared/data/1000G_EUR_Phase3_plink/1000G.EUR.QC.@.bim \
      --ld-file /shared/data/1000G_EUR_Phase3_plink/1000G.EUR.QC.@.run4.ld \
      --lib  ~/bin/mixer/src/build/lib/libbgmg.so


### FTND and ptsd ###
python3 ~/bin/mixer/precimed/mixer.py fit2 \
      --trait1-file $ddir/ftnd/ftnd_wave3_eur_quach2020_mixer_ready.csv.gz \
      --trait2-file  $ddir/ptsd/ptsd_nievergelt2019_mixer_ready.csv.gz \
      --trait1-params-file $rdir/ftnd_wave3_eur.fit.json \
      --trait2-params-file $rdir/ptsd_nievergelt2019.fit.json \
      --out $outd/ftnd_ptsd.fit \
      --extract /shared/data/w_hm3.justrs \
      --bim-file /shared/data/1000G_EUR_Phase3_plink/1000G.EUR.QC.@.bim \
      --ld-file /shared/data/1000G_EUR_Phase3_plink/1000G.EUR.QC.@.run4.ld \
      --lib  ~/bin/mixer/src/build/lib/libbgmg.so


### FTND and ADHD ###
python3 ~/bin/mixer/precimed/mixer.py fit2 \
      --trait1-file $ddir/ftnd/ftnd_wave3_eur_quach2020_mixer_ready.csv.gz \
      --trait2-file  $ddir/adhd/adhd_demontis2018_mixer_ready.csv.gz \
      --trait1-params-file $rdir/ftnd_wave3_eur.fit.json \
      --trait2-params-file $rdir/adhd_demontis2018.fit.json \
      --out $outd/ftnd_adhd.fit \
      --extract /shared/data/w_hm3.justrs \
      --bim-file /shared/data/1000G_EUR_Phase3_plink/1000G.EUR.QC.@.bim \
      --ld-file /shared/data/1000G_EUR_Phase3_plink/1000G.EUR.QC.@.run4.ld \
      --lib  ~/bin/mixer/src/build/lib/libbgmg.so


### FTND and cpd gscan ###
python3 ~/bin/mixer/precimed/mixer.py fit2 \
      --trait1-file $ddir/ftnd/ftnd_wave3_eur_quach2020_mixer_ready.csv.gz \
      --trait2-file  $ddir/gscan/cpd/cigarettes_per_day_liu2019_mixer_ready.csv.gz \
      --trait1-params-file $rdir/ftnd_wave3_eur.fit.json \
      --trait2-params-file $rdir/gscan_cpd.fit.json \
      --out $outd/ftnd_gscan_cpd.fit \
      --extract /shared/data/w_hm3.justrs \
      --bim-file /shared/data/1000G_EUR_Phase3_plink/1000G.EUR.QC.@.bim \
      --ld-file /shared/data/1000G_EUR_Phase3_plink/1000G.EUR.QC.@.run4.ld \
      --lib  ~/bin/mixer/src/build/lib/libbgmg.so

### FTND and si gscan ###
python3 ~/bin/mixer/precimed/mixer.py fit2 \
      --trait1-file $ddir/ftnd/ftnd_wave3_eur_quach2020_mixer_ready.csv.gz \
      --trait2-file  $ddir/gscan/smoking_initiation/smoking_initiation_liu2019_mixer_ready.csv.gz \
      --trait1-params-file $rdir/ftnd_wave3_eur.fit.json \
      --trait2-params-file $rdir/gscan_si.fit.json \
      --out $outd/ftnd_gscan_si.fit \
      --extract /shared/data/w_hm3.justrs \
      --bim-file /shared/data/1000G_EUR_Phase3_plink/1000G.EUR.QC.@.bim \
      --ld-file /shared/data/1000G_EUR_Phase3_plink/1000G.EUR.QC.@.run4.ld \
      --lib  ~/bin/mixer/src/build/lib/libbgmg.so

### FTND and sc gscan ###
python3 ~/bin/mixer/precimed/mixer.py fit2 \
      --trait1-file $ddir/ftnd/ftnd_wave3_eur_quach2020_mixer_ready.csv.gz \
      --trait2-file  $ddir/gscan/smoking_cessation/smoking_cessation_liu2019_mixer_ready.csv.gz \
      --trait1-params-file $rdir/ftnd_wave3_eur.fit.json \
      --trait2-params-file $rdir/gscan_sc.fit.json \
      --out $outd/ftnd_gscan_sc.fit \
      --extract /shared/data/w_hm3.justrs \
      --bim-file /shared/data/1000G_EUR_Phase3_plink/1000G.EUR.QC.@.bim \
      --ld-file /shared/data/1000G_EUR_Phase3_plink/1000G.EUR.QC.@.run4.ld \
      --lib  ~/bin/mixer/src/build/lib/libbgmg.so

### FTND and ai gscan ###
python3 ~/bin/mixer/precimed/mixer.py fit2 \
      --trait1-file $ddir/ftnd/ftnd_wave3_eur_quach2020_mixer_ready.csv.gz \
      --trait2-file  $ddir/gscan/age_initiation/age_of_initiation_liu2019_mixer_ready.csv.gz \
      --trait1-params-file $rdir/ftnd_wave3_eur.fit.json \
      --trait2-params-file $rdir/gscan_ai.fit.json \
      --out $outd/ftnd_gscan_ai.fit \
      --extract /shared/data/w_hm3.justrs \
      --bim-file /shared/data/1000G_EUR_Phase3_plink/1000G.EUR.QC.@.bim \
      --ld-file /shared/data/1000G_EUR_Phase3_plink/1000G.EUR.QC.@.run4.ld \
      --lib  ~/bin/mixer/src/build/lib/libbgmg.so

## Apply Model to All SNPs

In [None]:
rdir=/shared/jmarks/projects/nicotine/mixer/results/001
ddir=/shared/jmarks/projects/nicotine/mixer/data_processing/002


### FTND vs Depressive Symptoms ### (done)
python3 ~/bin/mixer/precimed/mixer.py test2 \
      --trait1-file $ddir/ftnd/nicotine_dependence_quach2020_mixer_ready.gz \
      --trait2-file  $ddir/depressive_symptoms/depressive_symptoms_okbay2016_mixer_ready.csv.gz \
      --load-params-file $rdir/ftnd_dep_symp.fit.json \
      --out $rdir/ftnd_vs_dep_symp.test \
      --bim-file /shared/data/1000G_EUR_Phase3_plink/1000G.EUR.QC.@.bim \
      --ld-file /shared/data/1000G_EUR_Phase3_plink/1000G.EUR.QC.@.run4.ld \
      --lib  ~/bin/mixer/src/build/lib/libbgmg.so 


### FTND vs SCZ ###
python3 ~/bin/mixer/precimed/mixer.py test2 \
      --trait1-file $ddir/ftnd/nicotine_dependence_quach2020_mixer_ready.gz \
      --trait2-file  $ddir/schizophrenia/scz_ripke2014_mixer_ready.csv.gz \
      --load-params-file $rdir/ftnd_scz.fit.json \
      --out $rdir/ftnd_vs_scz.test \
      --bim-file /shared/data/1000G_EUR_Phase3_plink/1000G.EUR.QC.@.bim \
      --ld-file /shared/data/1000G_EUR_Phase3_plink/1000G.EUR.QC.@.run4.ld \
      --lib  ~/bin/mixer/src/build/lib/libbgmg.so 


### FTND and Alc Dep  ###
python3 ~/bin/mixer/precimed/mixer.py test2 \
      --trait1-file $ddir/ftnd/nicotine_dependence_quach2020_mixer_ready.gz \
      --trait2-file  $ddir/alc_dep/alcohol_dependence_walters2018_mixer_ready.csv.gz \
      --load-params-file $rdir/ftnd_alcdep.fit.json \
      --threads 4 \
      --out $rdir/ftnd_vs_alcdep.test \
      --bim-file /shared/data/1000G_EUR_Phase3_plink/1000G.EUR.QC.@.bim \
      --ld-file /shared/data/1000G_EUR_Phase3_plink/1000G.EUR.QC.@.run4.ld \
      --lib  ~/bin/mixer/src/build/lib/libbgmg.so 



### FTND and bipolar ###
python3 ~/bin/mixer/precimed/mixer.py test2 \
      --trait1-file $ddir/ftnd/nicotine_dependence_quach2020_mixer_ready.gz \
      --trait2-file  $ddir/bipolar/bipolar_stahl2019_mixer_ready.csv.gz \
      --load-params-file $rdir/ftnd_bipolar.fit.json \
      --out $rdir/ftnd_vs_bipolar.test \
      --bim-file /shared/data/1000G_EUR_Phase3_plink/1000G.EUR.QC.@.bim \
      --ld-file /shared/data/1000G_EUR_Phase3_plink/1000G.EUR.QC.@.run4.ld \
      --lib  ~/bin/mixer/src/build/lib/libbgmg.so 

### FTND and mdd ###
python3 ~/bin/mixer/precimed/mixer.py test2 \
      --trait1-file $ddir/ftnd/nicotine_dependence_quach2020_mixer_ready.gz \
      --trait2-file  $ddir/mdd/mdd_howard2019_mixer_ready.csv.gz \
      --load-params-file $rdir/ftnd_mdd.fit.json \
      --out $rdir/ftnd_vs_mdd.test \
      --bim-file /shared/data/1000G_EUR_Phase3_plink/1000G.EUR.QC.@.bim \
      --ld-file /shared/data/1000G_EUR_Phase3_plink/1000G.EUR.QC.@.run4.ld \
      --lib  ~/bin/mixer/src/build/lib/libbgmg.so 

### FTND and ptsd ###
python3 ~/bin/mixer/precimed/mixer.py test2 \
      --trait1-file $ddir/ftnd/nicotine_dependence_quach2020_mixer_ready.gz \
      --trait2-file  $ddir/ptsd/ptsd_nievergelt2019_mixer_ready.csv.gz \
      --load-params-file $rdir/ftnd_ptsd.fit.json \
      --out $rdir/ftnd_vs_ptsd.test \
      --bim-file /shared/data/1000G_EUR_Phase3_plink/1000G.EUR.QC.@.bim \
      --ld-file /shared/data/1000G_EUR_Phase3_plink/1000G.EUR.QC.@.run4.ld \
      --lib  ~/bin/mixer/src/build/lib/libbgmg.so 

### FTND and ADHD ###
python3 ~/bin/mixer/precimed/mixer.py test2 \
      --trait1-file $ddir/ftnd/nicotine_dependence_quach2020_mixer_ready.gz \
      --trait2-file  $ddir/adhd/adhd_demontis2018_mixer_ready.csv.gz \
      --load-params-file $rdir/ftnd_adhd.fit.json \
      --out $rdir/ftnd_vs_adhd.test \
      --bim-file /shared/data/1000G_EUR_Phase3_plink/1000G.EUR.QC.@.bim \
      --ld-file /shared/data/1000G_EUR_Phase3_plink/1000G.EUR.QC.@.run4.ld \
      --lib  ~/bin/mixer/src/build/lib/libbgmg.so 



### FTND and cpd gscan ###
### FTND and si gscan ###
### FTND and sc gscan ###
### FTND and ai gscan ###
python3 ~/bin/mixer/precimed/mixer.py test2 \
      --trait1-file $ddir/nicotine_dependence_quach2020_mixer_ready.gz \
      --trait2-file  $ddir/ \
      --load-params-file $rdir/ \
      --out $rdir/.test \
      --bim-file /shared/data/1000G_EUR_Phase3_plink/1000G.EUR.QC.@.bim \
      --ld-file /shared/data/1000G_EUR_Phase3_plink/1000G.EUR.QC.@.run4.ld \
      --lib  ~/bin/mixer/src/build/lib/libbgmg.so 

# Visualization

## Bivariate (cross-trait)

In [None]:
rdir=/shared/jmarks/projects/nicotine/mixer/results/001
ddir=/shared/jmarks/projects/nicotine/mixer/data_processing/002


### ADHD vs FTND ###
python3 ~/bin/mixer/precimed/mixer_figures.py two \
    --trait1 ND \
    --trait2 "ADHD" \
    --json $rdir/ftnd_vs_adhd.test.json \
    --out $rdir/figures/ftnd_vs_adhd.test 

### Alc Dep vs FTND ###
python3 ~/bin/mixer/precimed/mixer_figures.py two \
    --trait1 ND \
    --trait2 "Alcohol Dependence" \
    --json $rdir/ftnd_vs_alcdep.test.json \
    --out $rdir/figures/ftnd_vs_alcdep.test 

### Bipolar vs FTND ###
python3 ~/bin/mixer/precimed/mixer_figures.py two \
    --trait1 ND \
    --trait2 "Bipolar" \
    --json $rdir/ftnd_vs_bipolar.test.json \
    --out $rdir/figures/ftnd_vs_bipolar.test 

### DepSym vs FTND ###
python3 ~/bin/mixer/precimed/mixer_figures.py two \
    --trait1 ND \
    --trait2 "Depressive Symptoms" \
    --json $rdir/ftnd_vs_dep_symp.test.json \
    --out $rdir/figures/ftnd_vs_dep_symp.test 

### mdd FTND ###
python3 ~/bin/mixer/precimed/mixer_figures.py two \
    --trait1 ND \
    --trait2 "MDD" \
    --json $rdir/ftnd_vs_mdd.test.json \
    --out $rdir/figures/ftnd_vs_mdd.test 

### ptsd FTND   ###
python3 ~/bin/mixer/precimed/mixer_figures.py two \
    --trait1 "FTND" \
    --trait2 "PTSD" \
    --json $rdir/ftnd_vs_ptsd.test.json \
    --out $rdir/figures/ftnd_vs_ptsd.test 

### scz FTND  ###
python3 ~/bin/mixer/precimed/mixer_figures.py two \
    --trait1 ND \
    --trait2 "SCZ" \
    --json $rdir/ftnd_vs_scz.test.json \
    --out $rdir/figures/ftnd_vs_scz.test 
### cpd gscan FTND ###
### si gscan  FTND ###
### sc gscan  FTND ###
### ai gscan  FTND ###

# Sandbox
create ldsc and mixer files for other inhouse data we have.

In [None]:
### AUDIT T ###
cd /shared/jmarks/projects/nicotine/ldsc/data_processing
aws s3 sync s3://rti-nd/ldsc_genetic_correlation/data/audit_ukb/ .
    
inf=audit_t_ldsc_cromwell_ready.txt.gz    
outf=audit_t_sanchezroige2018

python3 ~/bin/python_convert/sumstats.py csv --sumstats $inf  \
    --force --auto --head 5 --out  ${outf}.csv \
    --snp rsid --a1 a_1 --a2 a_0 --chr chr --bp pos --beta beta_T --pval p_T  

python3 ~/bin/python_convert/sumstats.py zscore --sumstats ${outf}.csv  --out  ${outf}_z.csv

python3 ~/bin/python_convert/sumstats.py qc --sumstats ${outf}_z.csv \
    --exclude-ranges 6:26000000-34000000 --max-or 1e37 | \
python3 ~/bin/python_convert/sumstats.py neff --drop --factor 4 --out ${outf}_mixer_ready.csv --force 
    
gzip ${outf}_mixer_ready.csv


### AUDIT P ###
inf=audit_p_ldsc_cromwell_ready.txt.gz    
outf=audit_p_sanchezroige2018

python3 ~/bin/python_convert/sumstats.py csv --sumstats $inf  \
    --force --auto --head 5 --out  ${outf}.csv \
    --snp rsid --a1 a_1 --a2 a_0 --chr chr --bp pos --beta beta_P --pval p_P  

python3 ~/bin/python_convert/sumstats.py zscore --sumstats ${outf}.csv  --out  ${outf}_z.csv

python3 ~/bin/python_convert/sumstats.py qc --sumstats ${outf}_z.csv \
    --exclude-ranges 6:26000000-34000000 --max-or 1e37 | \
python3 ~/bin/python_convert/sumstats.py neff --drop --factor 4 --out ${outf}_mixer_ready.csv --force 
    
gzip ${outf}_mixer_ready.csv



### AUDIT C ###
inf=audit_c_ldsc_cromwell_ready.txt.gz    
outf=audit_c_sanchezroige2018

python3 ~/bin/python_convert/sumstats.py csv --sumstats $inf  \
    --force --auto --head 5 --out  ${outf}.csv \
    --snp rsid --a1 a_1 --a2 a_0 --chr chr --bp pos --beta beta_C --pval p_C  

python3 ~/bin/python_convert/sumstats.py zscore --sumstats ${outf}.csv  --out  ${outf}_z.csv

python3 ~/bin/python_convert/sumstats.py qc --sumstats ${outf}_z.csv \
    --exclude-ranges 6:26000000-34000000 --max-or 1e37 | \
python3 ~/bin/python_convert/sumstats.py neff --drop --factor 4 --out ${outf}_mixer_ready.csv --force 
    
gzip ${outf}_mixer_ready.csv



### Anorexia ###
inf=pgc_anorexia.tsv    
outf=anorexia_watson2019

python3 ~/bin/python_convert/sumstats.py csv --sumstats $inf  \
    --force --auto --head 5 --out  ${outf}.csv \
    --snp ID --a1 ALT --a2 REF --chr CHROM --bp POS --beta BETA --pval PVAL --ncase NCAS --ncontrol NCON 

python3 ~/bin/python_convert/sumstats.py zscore --sumstats ${outf}.csv  --out  ${outf}_z.csv

python3 ~/bin/python_convert/sumstats.py qc --sumstats ${outf}_z.csv \
    --exclude-ranges 6:26000000-34000000 --max-or 1e37 | \
python3 ~/bin/python_convert/sumstats.py neff --drop --factor 4 --out ${outf}_mixer_ready.csv --force 
    
gzip ${outf}_mixer_ready.csv