# SUMMARY STATISTICS
Compare different AAM phenotypes with different **psycho-social** variables such as personality traits, socio-demographics, life history.

In [1]:
%load_ext autoreload
%autoreload 2

from imagen_post_hoc_helper import *

Fetchers from the nilearn.datasets module will be updated in version 0.9 to return python strings instead of bytes and Pandas dataframes instead of Numpy arrays.


In [2]:
INSTRUMENTS_DIR = '/ritter/share/data/IMAGEN/posthoc/'
LABELS = ["Binge","Binge_growth",
          "AUDIT","AUDIT_quick","AUDIT_growth",
          "Combined_seo","Combined_ours",
          "Onset","Frequency","Amount"]

In [3]:
# df_all_labels.to_csv(INSTRUMENTS_DIR+'/all_labels.csv') # chmod to me @jihoon #todo


Guide: (right) different data types, and (left) which univariate statistical method to use for each type <br>
<p float="left">
  <img src="figures/datatypes.png" width="650" />
  <img src="figures/guide_correlations_to_use.png" width="500" /> 
</p>

Common variable data types:
1. Numerical Variables
    1. Integer Variables.
    2. Floating Point Variables.
2. Categorical Variables.
    1. Boolean Variables (dichotomous).
    2. Ordinal Variables.
    3. Nominal Variables.

<div class="alert alert-block alert-warning">
To do / bugfix:

1. Debug: The number of subjects seem to not match the `n` used in the experiments. For instance for `Binge` in the FU3 experiment `n=630` but here it is `n=699`. 
2. Automatically determine the type of input & output data (continous/ordinal/categorical) and choose the appropriate correlation function using the diagram above. 
3. Perform multiple comparison correction for p-values
4. To claim significant differences between 2 combinations, additionally perform significance test for the *difference between the 2 groups*
</div>

## Psychological variables

### NEO

In [4]:
show_corr_with_instrument(instrument='NEO', session='FU3', corr_type='pearsonr',
                          use_only_lbls = ["Onset","Frequency","Amount","Binge","AUDIT_quick","AUDIT"], # , 'Onset'
                          val_type='raw')

Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
Openness,-0.12,0.14,-0.07,0.11,0.06,0.08
Conscientiousness,0.1,-0.1,-0.11,-0.13,-0.16,-0.22
Extroversion,-0.09,0.15,0.1,0.16,0.2,0.14
Agreeableness,0.02,0.07,-0.15,0.01,-0.03,-0.09
Neuroticism,-0.02,-0.04,0.02,0.0,-0.06,0.06

Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
Openness,0.0001,0.0,0.0239,0.0001,0.0488,0.0039
Conscientiousness,0.0017,0.0006,0.0002,0.0,0.0,0.0
Extroversion,0.0024,0.0,0.001,0.0,0.0,0.0
Agreeableness,0.5982,0.0181,0.0,0.8502,0.276,0.0018
Neuroticism,0.5381,0.1401,0.4713,0.9174,0.054,0.0363


In [5]:
show_corr_with_instrument(instrument = 'NEO', session = 'FU3',
          use_only_lbls = ["Onset","Frequency","Amount","Binge","AUDIT_quick","AUDIT"], # , 'Onset'
          val_type='categorize')

Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
Openness,0.11,0.15,-0.06,0.12,0.06,0.1
Conscientiousness,-0.07,-0.11,-0.1,-0.13,-0.16,-0.2
Extroversion,0.07,0.13,0.09,0.17,0.18,0.16
Agreeableness,-0.01,0.03,-0.13,0.01,-0.03,-0.05
Neuroticism,0.0,-0.03,0.03,-0.01,-0.04,0.04

Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
Openness,0.0001,0.0,0.0415,0.0,0.0505,0.0004
Conscientiousness,0.0134,0.0001,0.0004,0.0,0.0,0.0
Extroversion,0.024,0.0,0.0022,0.0,0.0,0.0
Agreeableness,0.6457,0.2771,0.0,0.7517,0.3573,0.0779
Neuroticism,0.8851,0.2386,0.3407,0.6088,0.2141,0.1858


In [6]:
for corr_type in ['pearsonr','kendalltau', 'chi2']: 
    show_corr_with_instrument(instrument='NEO', session='FU3', corr_type=corr_type,
              use_only_lbls = ["Onset","Frequency","Amount","Binge","AUDIT_quick","AUDIT"], # , 'Onset'
              val_type='binarize')

Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
Openness,0.13,0.17,-0.08,0.14,0.06,0.12
Conscientiousness,-0.09,-0.13,-0.12,-0.15,-0.2,-0.24
Extroversion,0.08,0.15,0.12,0.2,0.23,0.18
Agreeableness,-0.02,0.03,-0.16,0.01,-0.04,-0.06
Neuroticism,0.01,-0.04,0.03,-0.01,-0.05,0.05

Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
Openness,0.0002,0.0,0.0244,0.0,0.0986,0.0006
Conscientiousness,0.0085,0.0001,0.0013,0.0,0.0,0.0
Extroversion,0.0131,0.0,0.0019,0.0,0.0,0.0
Agreeableness,0.6191,0.3018,0.0,0.6856,0.2818,0.0601
Neuroticism,0.8619,0.2278,0.4609,0.6945,0.1878,0.1584


Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
Openness,0.11,0.14,-0.07,0.12,0.05,0.1
Conscientiousness,-0.07,-0.11,-0.09,-0.12,-0.15,-0.19
Extroversion,0.08,0.13,0.09,0.17,0.19,0.15
Agreeableness,-0.01,0.03,-0.13,0.0,-0.04,-0.06
Neuroticism,0.01,-0.04,0.02,-0.02,-0.04,0.03

Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
Openness,0.0002,0.0,0.0202,0.0001,0.1073,0.0003
Conscientiousness,0.0105,0.0001,0.0055,0.0,0.0,0.0
Extroversion,0.0067,0.0,0.0026,0.0,0.0,0.0
Agreeableness,0.7784,0.3082,0.0,0.8993,0.2219,0.0486
Neuroticism,0.7797,0.2011,0.4533,0.4898,0.1829,0.2264


Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
Openness,63.07,52.83,31.98,54.19,39.45,53.97
Conscientiousness,36.12,51.58,46.85,53.8,61.96,92.34
Extroversion,27.15,61.28,46.98,60.81,60.39,58.54
Agreeableness,39.85,31.19,37.28,28.85,22.64,28.85
Neuroticism,37.59,47.33,39.63,31.89,37.26,50.59

Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
Openness,0.0035,0.0348,0.6147,0.0338,0.2775,0.0212
Conscientiousness,0.5569,0.0562,0.1286,0.0286,0.0062,0.0
Extroversion,0.7917,0.0054,0.0684,0.0044,0.0018,0.0102
Agreeableness,0.1915,0.6058,0.2024,0.6737,0.8894,0.6742
Neuroticism,0.8349,0.4182,0.7348,0.9296,0.8173,0.3336


### SURPS

In [7]:
show_corr_with_instrument(instrument='SURPS', session='FU3',
          use_only_lbls = ["Onset","Frequency","Amount","Binge","AUDIT_quick","AUDIT"], # , 'Onset'
          val_type='raw')

Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
Anxiety Sensitivity,-0.02,-0.03,0.01,0.04,-0.03,0.04
Hopelessness,-0.01,0.03,-0.06,0.01,-0.01,-0.08
Impulsivity,-0.13,0.04,0.15,0.08,0.15,0.24
Sensation seeking,-0.19,0.21,0.15,0.22,0.27,0.26

Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
Anxiety Sensitivity,0.4139,0.2915,0.8261,0.2029,0.2553,0.1906
Hopelessness,0.7527,0.3524,0.0411,0.7006,0.7766,0.0073
Impulsivity,0.0,0.1475,0.0,0.0096,0.0,0.0
Sensation seeking,0.0,0.0,0.0,0.0,0.0,0.0


In [8]:
show_corr_with_instrument(instrument='SURPS', session='FU3',
          use_only_lbls = ["Onset","Frequency","Amount","Binge","AUDIT_quick","AUDIT"], # , 'Onset'
          val_type='categorize')

Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
Anxiety Sensitivity,-0.0,-0.05,0.01,0.01,-0.03,0.02
Hopelessness,0.03,0.01,-0.06,0.03,-0.02,-0.05
Impulsivity,0.1,0.06,0.13,0.08,0.15,0.19
Sensation seeking,0.13,0.24,0.13,0.24,0.24,0.28

Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
Anxiety Sensitivity,0.9446,0.1168,0.7527,0.6503,0.2339,0.4472
Hopelessness,0.2941,0.6278,0.0298,0.3085,0.5695,0.0995
Impulsivity,0.0011,0.0444,0.0,0.0075,0.0,0.0
Sensation seeking,0.0,0.0,0.0,0.0,0.0,0.0


In [9]:
for corr_type in ['pearsonr','kendalltau','chi2']:
    show_corr_with_instrument(instrument='SURPS', session='FU3', corr_type=corr_type,
              use_only_lbls = ["Onset","Frequency","Amount","Binge","AUDIT_quick","AUDIT"], # , 'Onset'
              val_type='binarize')

Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
Anxiety Sensitivity,0.0,-0.05,0.01,0.03,-0.05,0.02
Hopelessness,0.04,0.02,-0.07,0.04,-0.02,-0.06
Impulsivity,0.12,0.07,0.17,0.09,0.18,0.23
Sensation seeking,0.16,0.27,0.17,0.29,0.3,0.31

Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
Anxiety Sensitivity,0.9164,0.1153,0.7974,0.4794,0.1741,0.476
Hopelessness,0.2603,0.6483,0.0494,0.2852,0.5648,0.1037
Impulsivity,0.0005,0.0387,0.0,0.013,0.0,0.0
Sensation seeking,0.0,0.0,0.0,0.0,0.0,0.0


Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
Anxiety Sensitivity,0.01,-0.04,-0.01,0.01,-0.05,0.01
Hopelessness,0.05,0.01,-0.08,0.03,-0.03,-0.05
Impulsivity,0.11,0.06,0.13,0.07,0.16,0.19
Sensation seeking,0.13,0.23,0.15,0.24,0.25,0.26

Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
Anxiety Sensitivity,0.8604,0.1772,0.7303,0.6909,0.0994,0.7037
Hopelessness,0.0782,0.6719,0.0166,0.3161,0.4039,0.0747
Impulsivity,0.0002,0.0389,0.0001,0.0158,0.0,0.0
Sensation seeking,0.0,0.0,0.0,0.0,0.0,0.0


Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
Anxiety Sensitivity,13.61,18.2,27.73,20.01,26.61,23.26
Hopelessness,26.76,14.0,33.51,19.62,27.03,31.99
Impulsivity,34.04,11.98,42.17,17.62,40.55,58.23
Sensation seeking,36.38,85.47,35.08,77.27,84.87,109.06

Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
Anxiety Sensitivity,0.5553,0.2525,0.0233,0.1299,0.0321,0.0788
Hopelessness,0.0442,0.5987,0.004,0.1872,0.0411,0.01
Impulsivity,0.002,0.608,0.0001,0.2247,0.0001,0.0
Sensation seeking,0.0063,0.0,0.0061,0.0,0.0,0.0


<div class="alert alert-block alert-success">
Insights:

1. The starkest differences between the label groups :
    
    1. `Impulsivity` is lowest in `binge`(9) compared to `AUDIT_quick`(18) or `Amount` (17) or `AUDIT` (23) 
    2. `Amount` seems to be different from all others with higher `Agreeableness` (-16->1-6), lower `Sens. seeking` (17->27-31)
    
2. The process of binarizing makes the correlation with certain psychological characteristics more pronounced. Example:
    
    1. Sens. Seeking for BINGE (20->29), AUDIT (26->31), FREQ (21->27)
    2. Extroversion for AUDIT (14->18)
    3. Conscientiousness for BINGE (16->20), AUDIT_quick (16->20)
    4. Openness for FREQ (14->17)
</div>

### TCI

In [10]:
for corr_type in ['pearsonr','kendalltau','chi2']:
    show_corr_with_instrument(instrument='TCI', session='FU3', corr_type=corr_type,
              use_only_lbls = ["Onset","Frequency","Amount","Binge","AUDIT_quick","AUDIT"], # , 'Onset'
              val_type='binarize')

Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
Exploratory excitability vs. Stoic rigidity,0.17,0.17,0.05,0.2,0.19,0.18
Impulsiveness vs. Reflection,0.13,0.16,0.06,0.18,0.2,0.17
Extravagance vs. Reserve,0.17,0.09,0.13,0.15,0.17,0.15
Disorderliness vs. Regimentation,0.13,0.24,0.18,0.23,0.28,0.26
Total Novelty Seeking score,0.24,0.25,0.16,0.3,0.33,0.3

Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
Exploratory excitability vs. Stoic rigidity,0.0,0.0,0.1734,0.0,0.0,0.0
Impulsiveness vs. Reflection,0.0001,0.0,0.0906,0.0,0.0,0.0
Extravagance vs. Reserve,0.0,0.0056,0.0007,0.0,0.0,0.0
Disorderliness vs. Regimentation,0.0001,0.0,0.0,0.0,0.0,0.0
Total Novelty Seeking score,0.0,0.0,0.0,0.0,0.0,0.0


Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
Exploratory excitability vs. Stoic rigidity,0.14,0.14,0.04,0.17,0.15,0.15
Impulsiveness vs. Reflection,0.11,0.11,0.05,0.14,0.16,0.13
Extravagance vs. Reserve,0.14,0.07,0.1,0.11,0.14,0.11
Disorderliness vs. Regimentation,0.11,0.2,0.15,0.19,0.23,0.21
Total Novelty Seeking score,0.2,0.2,0.13,0.24,0.26,0.23

Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
Exploratory excitability vs. Stoic rigidity,0.0,0.0,0.2135,0.0,0.0,0.0
Impulsiveness vs. Reflection,0.0002,0.0001,0.1308,0.0,0.0,0.0
Extravagance vs. Reserve,0.0,0.0126,0.0016,0.0002,0.0,0.0001
Disorderliness vs. Regimentation,0.0003,0.0,0.0,0.0,0.0,0.0
Total Novelty Seeking score,0.0,0.0,0.0,0.0,0.0,0.0


Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
Exploratory excitability vs. Stoic rigidity,41.09,44.1,28.42,58.95,55.53,49.04
Impulsiveness vs. Reflection,44.25,47.37,29.89,57.06,58.55,55.15
Extravagance vs. Reserve,52.82,31.79,39.45,41.96,43.75,40.31
Disorderliness vs. Regimentation,30.49,70.38,41.39,60.3,73.85,74.03
Total Novelty Seeking score,108.28,101.63,55.26,112.94,122.59,125.24

Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
Exploratory excitability vs. Stoic rigidity,0.008,0.0051,0.1622,0.0001,0.0002,0.0008
Impulsiveness vs. Reflection,0.0102,0.002,0.2286,0.0003,0.0002,0.0005
Extravagance vs. Reserve,0.0004,0.1324,0.0245,0.0063,0.0082,0.0271
Disorderliness vs. Regimentation,0.0825,0.0,0.0107,0.0,0.0,0.0
Total Novelty Seeking score,0.0001,0.0005,0.5778,0.0,0.0,0.0


### PANAS

In [11]:
for corr_type in ['pearsonr','kendalltau','chi2']:
    show_corr_with_instrument(instrument='PANAS', session='FU3', corr_type=corr_type,
              use_only_lbls = ["Onset","Frequency","Amount","Binge","AUDIT_quick","AUDIT"], # , 'Onset'
              val_type='binarize')

Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
Positive Affect Score,0.03,0.17,0.23,0.12,0.19,0.15
Negative Affect Score,0.06,-0.18,-0.2,-0.12,-0.19,-0.15

Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
Positive Affect Score,0.3363,0.0,0.0,0.0008,0.0,0.0
Negative Affect Score,0.0879,0.0,0.0,0.001,0.0,0.0


Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
Positive Affect Score,0.03,0.14,0.18,0.09,0.16,0.11
Negative Affect Score,0.04,-0.16,-0.18,-0.1,-0.16,-0.13

Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
Positive Affect Score,0.2928,0.0,0.0,0.0013,0.0,0.0001
Negative Affect Score,0.1275,0.0,0.0,0.0006,0.0,0.0


Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
Positive Affect Score,27.19,55.93,75.89,43.51,63.5,54.23
Negative Affect Score,20.81,61.54,59.07,43.11,61.8,58.79

Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
Positive Affect Score,0.9229,0.0386,0.0003,0.3243,0.0104,0.0532
Negative Affect Score,0.8938,0.0006,0.0008,0.0445,0.0004,0.0013


## Demographics
- sex,  handedness, height, weight, ~age~
- site (location), ~ethnicity~
- ~puberty maturity index~

In [12]:
# first generate a file with all this information
BMI_FILE = {
    'BL':'/ritter/share/data/IMAGEN/IMAGEN_RAW/2.7/BL/psytools/IMAGEN-IMGN_NI_DATA_RC5-BASIC_DIGEST.csv',
    'FU2':'/ritter/share/data/IMAGEN/IMAGEN_RAW/2.7/FU2/psytools/IMAGEN-IMGN_NI_DATA_FU2-IMAGEN_DIGEST.csv',
    'FU3':'/ritter/share/data/IMAGEN/IMAGEN_RAW/2.7/FU3/psytools/IMAGEN-IMGN_NI_DATA_FU3-IMAGEN_DIGEST.csv',
        }


dataset = Imagen(exclude_holdout=False)

df = pd.DataFrame()
for tp in dataset.df.columns.get_level_values(0).unique():
    # get 'sex', 'site', 'handedness', 'gender' information
    dfi = dataset.df[tp][['sex', 'site', 'handedness', 'gender']]
    # get 'BMI'
    df_bmi = pd.read_csv(BMI_FILE[tp], usecols=["User code", 'ni_height', 'ni_mass'], dtype={"User code":str})
    df_bmi["ID"] = df_bmi["User code"].str.replace("-I", "").astype(int)
    df_bmi = df_bmi.set_index("ID")
    dfi["Height"] = df_bmi['ni_height'].apply(lambda x:x*0.01 if x>0 else np.NaN)
    dfi["Weight"] = df_bmi['ni_mass'].apply(lambda x:x if x>0 else np.NaN)
    
    dfi.columns = map(str.title, dfi.columns)
    # add session information
    dfi['Session'] = tp
    # concatenate
    df = pd.concat([df, dfi])
    
df.to_csv(INSTRUMENTS_DIR+'all_DEMO.csv')

In [13]:
for corr_type in ['pearsonr','kendalltau', 'chi2']: 
    show_corr_with_instrument(instrument='DEMO',
                          corr_type=corr_type, session='FU3',
                          ignore_cols=['Session'], # dont ignore Sex and Site
                          use_only_lbls = ["Onset","Frequency","Amount","Binge","AUDIT_quick","AUDIT"])

Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
Sex,0.01,0.2,0.14,0.23,0.21,0.19
Site,-0.01,0.06,0.05,-0.05,0.03,0.03
Handedness,0.03,-0.01,0.04,0.05,0.01,0.02
Gender,0.02,0.2,0.16,0.23,0.22,0.21
Height,0.01,0.15,0.11,0.17,0.17,0.18
Weight,-0.0,0.04,0.13,0.13,0.11,0.07

Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
Sex,0.6579,0.0,0.0,0.0,0.0,0.0
Site,0.7947,0.0492,0.1185,0.1169,0.2687,0.2829
Handedness,0.3791,0.778,0.2032,0.1029,0.63,0.4601
Gender,0.4034,0.0,0.0,0.0,0.0,0.0
Height,0.7203,0.0,0.0008,0.0,0.0,0.0
Weight,0.9311,0.1853,0.0001,0.0001,0.0012,0.0352


Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
Sex,0.01,0.19,0.13,0.22,0.2,0.18
Site,-0.0,0.05,0.03,-0.04,0.02,0.02
Handedness,0.02,-0.01,0.04,0.05,0.02,0.02
Gender,0.03,0.19,0.15,0.22,0.2,0.2
Height,-0.01,0.15,0.09,0.15,0.14,0.15
Weight,0.01,0.06,0.14,0.13,0.13,0.1

Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
Sex,0.5889,0.0,0.0,0.0,0.0,0.0
Site,0.8739,0.0537,0.1707,0.0903,0.3278,0.3248
Handedness,0.4725,0.8359,0.1314,0.0881,0.5567,0.3613
Gender,0.3558,0.0,0.0,0.0,0.0,0.0
Height,0.6828,0.0,0.0003,0.0,0.0,0.0
Weight,0.6315,0.0174,0.0,0.0,0.0,0.0001


Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
Sex,0.84,51.71,24.73,67.72,52.86,44.0
Site,23.87,67.04,171.29,236.38,109.64,106.27
Handedness,7.2,4.75,8.53,6.38,4.83,5.99
Gender,2.73,53.51,37.91,70.11,58.46,54.13
Height,258.02,290.04,329.92,309.56,322.46,347.84
Weight,549.58,599.78,569.85,570.09,616.35,605.8

Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
Sex,0.658,0.0,0.0,0.0,0.0,0.0
Site,0.0475,0.0,0.0,0.0,0.0,0.0
Handedness,0.3024,0.5759,0.202,0.3819,0.5664,0.4239
Gender,0.604,0.0,0.0,0.0,0.0,0.0
Height,0.5922,0.1299,0.0036,0.0282,0.0081,0.0004
Weight,0.7616,0.2208,0.541,0.5383,0.1075,0.1732


## Socioeconomic

### Economic
  - parental education (composed of mother’s and father’s education score), 
  - neighborhood-related factors (composed of neighborhood score and home inadequacy score), and 
  - income-related variables (composed of financial difficulties score, financial crisis score, and family stress unemployment score). 
  - parental income level, parental homeownership, and parental level of education. 

(TO DO) Paper : https://pubmed.ncbi.nlm.nih.gov/32444868/

### CTQ
The Childhood Trauma Questionnaire (CTQ, Bernstein & Fink, 1997) is a 28-item self-report inventory that provides brief, reliable, and valid screening for histories of abuse and neglect. It inquires about five types of maltreatment - emotional, physical, and sexual abuse, and emotional and physical neglect. Also included is a 3 item Minimization /Denial scale for detecting false-negative trauma reports. The CTQ has been widely and successfully employed in several studies including gene-neuroimaging studies (Caspi et al., 2010). Suitable for adolescents and adults 12+ years.

In [14]:
# show_corr_with_instrument(instrument='CTQ', session='FU3',
#           use_only_lbls = ["Onset","Frequency","Amount","Binge","AUDIT_quick","AUDIT"], # , 'Onset'
#           val_type='raw')

In [15]:
# show_corr_with_instrument(instrument='CTQ', session='FU3',
#           use_only_lbls = ["Onset","Frequency","Amount","Binge","AUDIT_quick","AUDIT"], # , 'Onset'
#           val_type='categorize')

In [16]:
for corr_type in ['pearsonr','kendalltau', 'chi2']: 
    show_corr_with_instrument(instrument='CTQ',
                          corr_type=corr_type, session='FU3',
                          use_only_lbls = ["Onset","Frequency","Amount","Binge","AUDIT_quick","AUDIT"],
                          val_type='binarize')

Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
Emotional abuse sum,0.07,-0.02,0.11,0.07,0.05,0.1
Physical abuse sum,0.06,0.01,0.09,0.04,0.02,0.07
Sexual abuse sum,0.06,0.03,0.03,0.06,0.03,0.05
Emotional neglect sum,-0.03,0.06,-0.08,-0.05,-0.02,-0.03
Physical neglect sum,-0.01,0.07,0.02,0.06,0.06,0.05
Denial sum,-0.1,-0.02,-0.1,-0.14,-0.06,-0.1
Denial q1,-0.08,0.06,-0.04,-0.09,0.01,-0.03
Denial q2,-0.1,-0.06,-0.14,-0.14,-0.09,-0.11
Denial q3,-0.09,-0.06,-0.07,-0.12,-0.08,-0.1

Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
Emotional abuse sum,0.0703,0.5277,0.0059,0.0874,0.2092,0.005
Physical abuse sum,0.1059,0.7455,0.0351,0.2686,0.5504,0.0474
Sexual abuse sum,0.1267,0.3597,0.451,0.1146,0.3964,0.1447
Emotional neglect sum,0.3826,0.0943,0.0413,0.1589,0.6355,0.4985
Physical neglect sum,0.7572,0.0487,0.6278,0.1214,0.1131,0.1456
Denial sum,0.0049,0.6213,0.0147,0.0004,0.1126,0.0091
Denial q1,0.0345,0.0824,0.2946,0.0236,0.7036,0.3725
Denial q2,0.0076,0.1155,0.0008,0.0002,0.0156,0.0024
Denial q3,0.019,0.1013,0.072,0.0018,0.0309,0.0055


Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
Emotional abuse sum,0.08,-0.01,0.11,0.1,0.09,0.12
Physical abuse sum,0.04,-0.01,0.07,0.03,0.02,0.06
Sexual abuse sum,0.04,-0.04,-0.03,0.02,-0.04,-0.01
Emotional neglect sum,-0.03,0.05,-0.08,-0.06,-0.03,-0.04
Physical neglect sum,-0.0,0.09,0.02,0.09,0.09,0.07
Denial sum,-0.1,-0.01,-0.09,-0.13,-0.06,-0.09
Denial q1,-0.08,0.05,-0.03,-0.09,0.0,-0.04
Denial q2,-0.08,-0.06,-0.13,-0.15,-0.1,-0.11
Denial q3,-0.08,-0.05,-0.07,-0.12,-0.08,-0.1

Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
Emotional abuse sum,0.014,0.7535,0.0034,0.004,0.0088,0.0002
Physical abuse sum,0.2314,0.7429,0.0744,0.448,0.6851,0.0897
Sexual abuse sum,0.2249,0.2449,0.4851,0.5989,0.3589,0.8143
Emotional neglect sum,0.4259,0.0936,0.0257,0.0975,0.3707,0.1847
Physical neglect sum,0.8928,0.0069,0.6121,0.0123,0.0171,0.048
Denial sum,0.0029,0.6392,0.0149,0.0001,0.0715,0.0047
Denial q1,0.0269,0.147,0.3462,0.0099,0.9258,0.2241
Denial q2,0.0166,0.0819,0.0007,0.0,0.0068,0.0011
Denial q3,0.0193,0.1129,0.0523,0.001,0.0359,0.0046


Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
Emotional abuse sum,21.12,14.12,32.53,21.87,27.13,28.82
Physical abuse sum,9.26,6.23,16.5,8.07,12.41,8.03
Sexual abuse sum,12.84,16.4,12.46,10.04,11.72,11.56
Emotional neglect sum,22.55,11.56,20.59,18.52,16.87,21.53
Physical neglect sum,4.55,16.78,12.98,14.76,13.26,12.76
Denial sum,21.98,9.33,24.73,19.8,11.75,11.44
Denial q1,6.66,8.3,5.54,9.24,4.52,5.36
Denial q2,8.9,8.88,14.2,20.54,11.05,15.46
Denial q3,14.68,6.9,7.76,14.09,8.03,10.22

Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
Emotional abuse sum,0.3304,0.7214,0.019,0.1899,0.0562,0.0362
Physical abuse sum,0.5975,0.9372,0.2233,0.7069,0.413,0.6259
Sexual abuse sum,0.4603,0.2893,0.5691,0.6909,0.4683,0.4819
Emotional neglect sum,0.2579,0.869,0.3008,0.2944,0.3943,0.1208
Physical neglect sum,0.9911,0.2095,0.4495,0.2549,0.3507,0.3865
Denial sum,0.0557,0.6741,0.025,0.0709,0.4661,0.4918
Denial q1,0.2473,0.0811,0.3538,0.0553,0.3403,0.2521
Denial q2,0.1132,0.1141,0.0144,0.001,0.0505,0.0086
Denial q3,0.0118,0.2282,0.1702,0.0151,0.1544,0.0693


### CTS
conflict tactics scale (CTS) is used in the research of family violence. Scales are based on the premise that conflict is an inevitable aspect of all human association, but that the use of coercion (including force and violence) as a conflict-resolution tactic is harmful. But widely criticized. 

In [17]:
# show_corr_with_instrument(instrument='CTS', session='FU3',
#           use_only_lbls = ["Onset","Frequency","Amount","Binge","AUDIT_quick","AUDIT"], # , 'Onset'
#           val_type='raw', vmax=0.2)

In [18]:
# show_corr_with_instrument(instrument='CTS', session='FU3',
#           use_only_lbls = ["Onset","Frequency","Amount","Binge","AUDIT_quick","AUDIT"], # , 'Onset'
#           val_type='categorize', vmax=0.2)

In [19]:
for corr_type in ['pearsonr','kendalltau', 'chi2']: 
    show_corr_with_instrument(instrument='CTS', corr_type=corr_type, session='FU3', 
          use_only_lbls = ["Onset","Frequency","Amount","Binge","AUDIT_quick","AUDIT"], # , 'Onset'
          val_type='binarize')

Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
Assault,-0.0,-0.02,0.03,0.03,-0.03,-0.0
Injury,-0.02,-0.03,0.01,-0.0,-0.04,-0.03
Negotiation,0.02,0.04,-0.05,0.07,0.05,0.01
Psychological Aggression,0.01,0.0,0.04,0.05,0.04,0.03
Sexual Coercion,0.01,-0.07,-0.09,-0.07,-0.06,-0.05

Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
Assault,0.9116,0.5669,0.3987,0.4309,0.3974,0.9454
Injury,0.6434,0.3596,0.7275,0.9942,0.2301,0.3276
Negotiation,0.6244,0.2495,0.1685,0.0396,0.1437,0.8592
Psychological Aggression,0.689,0.9601,0.2765,0.1675,0.3363,0.4491
Sexual Coercion,0.6926,0.0523,0.019,0.0612,0.0958,0.1361


Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
Assault,0.04,-0.03,-0.03,0.02,-0.05,-0.01
Injury,0.03,-0.05,-0.01,0.01,-0.03,-0.01
Negotiation,-0.0,0.03,-0.05,0.05,0.03,0.0
Psychological Aggression,-0.02,-0.02,0.0,0.03,0.03,0.02
Sexual Coercion,-0.01,-0.08,-0.1,-0.12,-0.1,-0.06

Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
Assault,0.2915,0.4144,0.4825,0.6606,0.1436,0.8065
Injury,0.4117,0.1598,0.7051,0.8001,0.4095,0.7306
Negotiation,0.9678,0.2914,0.0938,0.0904,0.255,0.9026
Psychological Aggression,0.5817,0.5837,0.9224,0.3863,0.3518,0.5152
Sexual Coercion,0.6697,0.0161,0.0048,0.0004,0.0034,0.0784


Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
Assault,24.6,28.97,17.61,36.42,19.36,23.48
Injury,22.12,9.47,10.33,9.63,10.32,10.27
Negotiation,214.78,192.54,215.98,213.84,216.95,219.84
Psychological Aggression,109.59,87.34,89.14,90.52,84.29,89.56
Sexual Coercion,53.8,45.52,26.07,48.38,37.0,47.57

Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
Assault,0.4849,0.3122,0.6128,0.0842,0.623,0.6059
Injury,0.0361,0.5786,0.5008,0.5641,0.6679,0.6721
Negotiation,0.272,0.6718,0.2084,0.2389,0.2094,0.242
Psychological Aggression,0.1454,0.6461,0.2762,0.524,0.5014,0.3752
Sexual Coercion,0.1478,0.2895,0.8331,0.1705,0.6061,0.2562


### LEQ
Life-Events Questionnaire (LEQ) is an adaptation of the Stressful Life-Event Questionnaire. Following 
domains:  Family/Parents,  Accident/lllness,  Sexuality,  Autonomy,  Deviance,  Relocation,  and  Distress. 

In [20]:
# show_corr_with_instrument(instrument='LEQ', session='FU3',
#           use_only_lbls = ["Onset","Frequency","Amount","Binge","AUDIT_quick","AUDIT"], # , 'Onset'
#           val_type='raw')

In [21]:
for corr_type in ['pearsonr','kendalltau', 'chi2']: 
    show_corr_with_instrument(instrument='LEQ', corr_type=corr_type, session='FU3',
          use_only_lbls = ["Onset","Frequency","Amount","Binge","AUDIT_quick","AUDIT"], # , 'Onset'
          val_type='binarize')

Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
Family valence,0.14,-0.01,-0.01,-0.02,-0.0,-0.02
Accident valence,-0.02,-0.02,-0.16,-0.1,-0.09,-0.08
Sexuality valence,-0.05,0.04,-0.05,-0.1,-0.03,-0.05
Autonomy valence,0.03,0.01,-0.01,-0.0,-0.03,-0.03
Devience valence,0.02,-0.05,0.02,-0.02,-0.01,0.0
Relocation valence,0.03,-0.02,-0.14,-0.1,-0.05,-0.06
Distress valence,0.05,-0.02,0.05,-0.0,0.02,-0.02
Noscale valence,-0.05,-0.03,-0.01,-0.06,-0.04,-0.07
Overall valence,0.03,-0.01,-0.05,-0.09,-0.05,-0.07
Family frequency,0.12,0.01,-0.01,0.09,0.01,0.05

Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
Family valence,0.0001,0.8796,0.7273,0.4989,0.8953,0.4702
Accident valence,0.5268,0.464,0.0,0.0047,0.0095,0.019
Sexuality valence,0.137,0.2465,0.2166,0.004,0.4261,0.1376
Autonomy valence,0.3259,0.7135,0.8835,0.9847,0.4037,0.3215
Devience valence,0.521,0.1604,0.6498,0.506,0.8665,0.8986
Relocation valence,0.3435,0.4845,0.0002,0.0062,0.1693,0.0927
Distress valence,0.1793,0.623,0.2047,0.8896,0.6557,0.5797
Noscale valence,0.1501,0.4257,0.7877,0.069,0.3292,0.0326
Overall valence,0.3389,0.7459,0.1447,0.014,0.1941,0.0352
Family frequency,0.0007,0.7502,0.8856,0.0124,0.6966,0.1431


Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
Family valence,0.12,-0.01,-0.04,-0.03,-0.02,-0.03
Accident valence,-0.02,-0.03,-0.15,-0.1,-0.1,-0.08
Sexuality valence,-0.04,0.04,-0.06,-0.08,-0.03,-0.04
Autonomy valence,0.03,0.01,-0.02,-0.01,-0.03,-0.02
Devience valence,0.02,-0.05,0.01,-0.01,-0.0,0.01
Relocation valence,0.03,-0.04,-0.13,-0.1,-0.06,-0.07
Distress valence,0.05,-0.02,0.02,-0.01,-0.01,-0.02
Noscale valence,-0.04,-0.03,-0.01,-0.06,-0.03,-0.07
Overall valence,0.03,-0.02,-0.08,-0.09,-0.06,-0.06
Family frequency,0.12,0.01,-0.0,0.07,0.02,0.05

Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
Family valence,0.0,0.6294,0.2609,0.3602,0.5926,0.3626
Accident valence,0.6022,0.3866,0.0,0.0021,0.0025,0.0074
Sexuality valence,0.1678,0.1674,0.0648,0.0075,0.3692,0.1661
Autonomy valence,0.3049,0.7072,0.6044,0.8318,0.4007,0.4167
Devience valence,0.6145,0.1252,0.8468,0.6876,0.9745,0.8093
Relocation valence,0.2866,0.2435,0.0001,0.0024,0.089,0.0317
Distress valence,0.1146,0.4963,0.6366,0.6706,0.8201,0.4434
Noscale valence,0.1499,0.3845,0.6629,0.0445,0.2739,0.0207
Overall valence,0.3001,0.5617,0.0081,0.0037,0.0612,0.0268
Family frequency,0.0002,0.6933,0.9271,0.0289,0.502,0.1051


Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
Family valence,33.38,16.41,19.21,23.14,13.88,22.28
Accident valence,10.9,14.11,34.87,19.39,17.7,15.29
Sexuality valence,21.3,12.96,28.74,32.93,17.02,12.27
Autonomy valence,17.2,29.06,20.32,23.23,17.51,21.34
Devience valence,12.51,9.95,6.66,9.29,15.42,13.81
Relocation valence,11.8,13.43,22.48,16.13,8.93,17.57
Distress valence,18.06,15.68,25.48,9.65,20.66,25.06
Noscale valence,10.73,11.92,11.37,14.78,8.63,15.3
Overall valence,46.61,47.29,69.63,54.87,56.68,57.22
Family frequency,16.41,0.42,0.79,7.58,1.79,3.11

Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
Family valence,0.0042,0.495,0.3166,0.11,0.535,0.1344
Accident valence,0.5373,0.2938,0.0003,0.0795,0.1251,0.226
Sexuality valence,0.2131,0.8406,0.0516,0.017,0.5217,0.8331
Autonomy valence,0.5762,0.065,0.3754,0.1821,0.5551,0.4382
Devience valence,0.2523,0.4449,0.7574,0.4107,0.1176,0.182
Relocation valence,0.2984,0.266,0.0209,0.096,0.539,0.0921
Distress valence,0.4515,0.6152,0.0846,0.841,0.242,0.0934
Noscale valence,0.552,0.452,0.4125,0.1926,0.734,0.2255
Overall valence,0.7522,0.7605,0.0885,0.2618,0.4493,0.4295
Family frequency,0.0025,0.9805,0.9394,0.1082,0.7737,0.5391


## Co-morbidities

### FTND

In [22]:
df = pd.read_csv(INSTRUMENTS_DIR + 'all_FTND.csv').drop(columns=['Likelihood of nicotine dependence child'])

for corr_type in ['pearsonr','kendalltau', 'chi2']: 
    show_corr_with_instrument(instrument=df, corr_type=corr_type, session='FU3',
          use_only_lbls = ["Onset","Frequency","Amount","Binge","AUDIT_quick","AUDIT"], # , 'Onset'
          val_type='binarize')

Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
FTND Sum,0.16,-0.02,0.11,0.12,0.12,0.14

Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
FTND Sum,0.0,0.6154,0.0039,0.0006,0.0007,0.0


Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
FTND Sum,0.19,0.01,0.12,0.14,0.14,0.15

Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
FTND Sum,0.0,0.7658,0.0011,0.0,0.0001,0.0


Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
FTND Sum,38.31,9.37,15.05,22.08,21.29,26.37

Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
FTND Sum,0.0,0.2269,0.0354,0.0025,0.0034,0.0004


### GEN

In [23]:
df_gen = pd.read_csv("/ritter/share/data/IMAGEN/posthoc/all_GEN.csv")

# print out all parental mental disorders diagnosed in this questionnaire
all_disorders = set()
for disorder in  (df_gen["Pd_list"].tolist() + df_gen["Md_list"].tolist()):
    disorder = eval(disorder)
    if disorder:
        all_disorders.update(disorder)
print("Parental mental disorders diagnosed in this questionnaire:")
display(all_disorders)

# df_gen = df_gen.groupby('Session').get_group('FU3')
# 1) create a column with a count of whether both parents have some mental disorder
df_gen['parental_disorder_cnts_all'] = df_gen.apply(lambda row: len(eval(row["Pd_list"]))+len(eval(row["Md_list"])),  axis=1)

# drop 'eating disorder' and 'other'
for col in ["Pd_list", "Md_list"]:
    df_gen[col] = df_gen[col].astype('object')
    for i, row in df_gen.iterrows():
        disorders = eval(row[col]) 
        if len(disorders):
            if 'Other' in disorders: disorders.remove('Other')
            if 'Eating Disorder' in disorders: disorders.remove('Eating Disorder')
#             if len(disorders) != len(eval(row[col])):
#                 print(eval(row[col]), '-->', disorders)
            df_gen.at[i, col] = list(disorders)

# 2) count of the main disorders only
df_gen['parental_disorder_cnts_main'] = df_gen.apply(lambda row: len((row["Pd_list"]))+len((row["Md_list"])),  axis=1)
df_gen['parental_disorder_exists_main'] = df_gen.apply(lambda row: bool((row["Pd_list"])) or bool((row["Md_list"])),  axis=1)
df_gen['parent_depressed'] = df_gen.apply(lambda row: bool([dis for dis in (row["Pd_list"]) if 'depress' in dis.lower()]) or \
                                          bool([dis for dis in (row["Md_list"]) if 'depress' in dis.lower()]),  axis=1).astype(int)
df_gen['parent_alcoholabuse'] = df_gen.apply(lambda row: bool([dis for dis in (row["Pd_list"]) if 'alcohol' in dis.lower()]) or \
                                          bool([dis for dis in (row["Md_list"]) if 'alcohol' in dis.lower()]),  axis=1).astype(int)

df_gen = df_gen.drop(columns=['Paternal_disorder', 'Maternal_disorder', "Pd_list", "Md_list"])
df_gen.groupby('Session').get_group('FU3').describe() #['parent_alcoholabuse'].value_counts()

Parental mental disorders diagnosed in this questionnaire:


{'Alcohol problems',
 'Anxiety Disorder',
 'Bipolar I Disorder',
 'Bipolar II Disorder',
 'Drug problems',
 'Eating Disorder',
 'Major Depression recurrent',
 'Major Depression single episode',
 'Obessive-compulsive Disroder',
 'Other',
 'Schizoaffective Disorder',
 'Schizophrenia',
 'Suicide / Suicidal Attempt'}

Unnamed: 0,ID,parental_disorder_cnts_all,parental_disorder_cnts_main,parent_depressed,parent_alcoholabuse
count,2191.0,2191.0,2191.0,2191.0,2191.0
mean,49779090.0,0.722958,4.246463,0.092652,0.031949
std,28912510.0,1.889026,1.163939,0.29001,0.175904
min,1274.0,0.0,2.0,0.0,0.0
25%,24646390.0,0.0,4.0,0.0,0.0
50%,49433850.0,0.0,4.0,0.0,0.0
75%,75100680.0,0.0,4.0,0.0,0.0
max,99954900.0,10.0,9.0,1.0,1.0


In [24]:
# show_corr_with_instrument(instrument=df_gen, session='FU3',
#           use_only_lbls = ["Onset","Frequency","Amount","Binge","AUDIT_quick","AUDIT"], # , 'Onset'
#           val_type='raw')

In [25]:
for corr_type in ['pearsonr','kendalltau', 'chi2']: 
    show_corr_with_instrument(instrument=df_gen, corr_type=corr_type, session='FU3',
          use_only_lbls = ["Onset","Frequency","Amount","Binge","AUDIT_quick","AUDIT"], # , 'Onset'
          val_type='binarize')

Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
parental_disorder_cnts_all,0.09,0.02,0.07,0.08,0.02,0.04
parental_disorder_cnts_main,0.05,0.0,0.05,0.06,0.01,0.02
parental_disorder_exists_main,,,,,,
parent_depressed,0.04,-0.02,0.06,0.06,0.02,0.0
parent_alcoholabuse,0.05,0.01,0.06,0.05,-0.0,-0.0

Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
parental_disorder_cnts_all,0.008,0.6346,0.0717,0.0169,0.577,0.2953
parental_disorder_cnts_main,0.1304,0.9341,0.2231,0.1096,0.7786,0.5024
parental_disorder_exists_main,,,,,,
parent_depressed,0.2778,0.5183,0.0836,0.1144,0.5381,0.9857
parent_alcoholabuse,0.1234,0.8825,0.0854,0.1362,0.9423,0.9047


Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
parental_disorder_cnts_all,0.09,0.01,0.06,0.08,0.02,0.02
parental_disorder_cnts_main,0.03,0.02,0.03,0.04,0.02,0.04
parental_disorder_exists_main,,,,,,
parent_depressed,0.04,-0.02,0.06,0.06,0.02,0.0
parent_alcoholabuse,0.05,0.01,0.06,0.05,-0.0,-0.0

Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
parental_disorder_cnts_all,0.0056,0.7962,0.0874,0.0197,0.659,0.5483
parental_disorder_cnts_main,0.307,0.5876,0.3535,0.2713,0.6346,0.2896
parental_disorder_exists_main,,,,,,
parent_depressed,0.2775,0.518,0.0835,0.1143,0.5377,0.9857
parent_alcoholabuse,0.1233,0.8824,0.0854,0.1361,0.9422,0.9046


Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
parental_disorder_cnts_all,15.9,3.01,11.72,9.91,10.5,12.05
parental_disorder_cnts_main,9.1,8.35,6.53,8.31,5.4,8.64
parental_disorder_exists_main,0.0,0.0,0.0,0.0,0.0,0.0
parent_depressed,0.91,0.27,2.51,2.09,0.22,0.01
parent_alcoholabuse,1.75,0.0,2.14,1.62,0.03,0.01

Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
parental_disorder_cnts_all,0.0261,0.6984,0.164,0.2712,0.2319,0.1488
parental_disorder_cnts_main,0.1681,0.2137,0.4793,0.3061,0.6115,0.2797
parental_disorder_exists_main,1.0,1.0,1.0,1.0,1.0,1.0
parent_depressed,0.3401,0.6033,0.1129,0.1478,0.6357,0.9084
parent_alcoholabuse,0.1862,0.9592,0.1439,0.2031,0.8641,0.9312


### BSI
The Brief Symptom Inventory (BSI) is the shortened version of the Symptoms Checklist-90. The BSI is a 53-item questionnaire covering nine 
psychological symptoms: 
1) Somatization, 
2) Obsession-compulsion, 
3) Interpersonal sensitivity, 
4) Depression, 
5) Anxiety, 
6) Hostility, 
7) Phobic anxiety, 
8) Paranoid ideation, and 
9) Psychoticism. 

The BSI is one of the most widely accepted psychometric screening tools of general psychopathology in both research and clinical practice

In [26]:
# show_corr_with_instrument(instrument=df_gen, session='FU3',
#           use_only_lbls = ["Onset","Frequency","Amount","Binge","AUDIT_quick","AUDIT"], # , 'Onset'
#           val_type='raw')

In [27]:
for corr_type in ['pearsonr','kendalltau', 'chi2']: 
    show_corr_with_instrument(instrument='BSI', corr_type=corr_type, session='FU3',
          use_only_lbls = ["Onset","Frequency","Amount","Binge","AUDIT_quick","AUDIT"], # , 'Onset'
          val_type='binarize')

Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
Somatization,0.07,-0.03,0.05,0.04,0.02,0.04
Obsession-Compulsion,0.05,-0.03,0.01,0.02,0.02,0.08
Interpersonal Sensitivity,0.03,-0.03,0.04,0.04,-0.03,0.04
Depression,0.06,0.01,0.08,0.08,0.03,0.1
Anxiety,0.09,-0.0,0.02,0.05,0.02,0.06
Hostility,0.1,-0.05,0.01,0.02,-0.03,0.0
Phobic Anxiety,0.1,-0.06,0.02,-0.0,-0.02,0.0
Paranoid Ideation,0.03,-0.06,0.05,-0.03,-0.02,0.02
Psychoticism,0.07,0.02,0.07,0.06,0.03,0.09
Positive Symptom Distress Index,0.08,-0.03,0.05,0.04,0.01,0.07

Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
Somatization,0.0304,0.3387,0.1621,0.2633,0.6355,0.2803
Obsession-Compulsion,0.1478,0.4136,0.7813,0.5443,0.5952,0.0208
Interpersonal Sensitivity,0.3504,0.3517,0.3423,0.3206,0.4757,0.2557
Depression,0.0674,0.8694,0.0332,0.0327,0.4149,0.0049
Anxiety,0.0109,0.9246,0.6877,0.1864,0.5883,0.0907
Hostility,0.0053,0.1249,0.793,0.667,0.4526,0.9168
Phobic Anxiety,0.0049,0.0949,0.5867,0.9903,0.5632,0.8931
Paranoid Ideation,0.3206,0.0591,0.1755,0.4056,0.6159,0.5109
Psychoticism,0.0488,0.6416,0.0808,0.073,0.4355,0.0099
Positive Symptom Distress Index,0.0178,0.3704,0.1856,0.2423,0.8448,0.0545


Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
Somatization,0.08,-0.02,0.04,0.04,0.01,0.05
Obsession-Compulsion,0.05,0.01,0.0,0.03,0.03,0.09
Interpersonal Sensitivity,0.03,-0.02,0.01,0.04,-0.02,0.04
Depression,0.04,0.04,0.05,0.07,0.03,0.1
Anxiety,0.09,-0.01,-0.03,0.03,0.01,0.05
Hostility,0.06,-0.02,0.0,0.04,0.0,0.04
Phobic Anxiety,0.08,-0.04,-0.01,0.01,-0.03,0.02
Paranoid Ideation,0.03,-0.05,0.02,-0.04,-0.01,0.03
Psychoticism,0.06,0.04,0.04,0.07,0.05,0.1
Positive Symptom Distress Index,0.07,-0.0,0.02,0.05,0.02,0.08

Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
Somatization,0.0088,0.551,0.3076,0.1801,0.7371,0.1099
Obsession-Compulsion,0.1126,0.7587,0.9039,0.2732,0.2731,0.002
Interpersonal Sensitivity,0.3015,0.5414,0.6625,0.1855,0.5037,0.2547
Depression,0.1715,0.1763,0.1664,0.0381,0.3442,0.0011
Anxiety,0.0054,0.7407,0.4175,0.3115,0.6884,0.0898
Hostility,0.0409,0.4608,0.9377,0.1913,0.9028,0.241
Phobic Anxiety,0.011,0.1875,0.6874,0.7285,0.4588,0.5996
Paranoid Ideation,0.3619,0.0982,0.5838,0.2697,0.8144,0.2655
Psychoticism,0.0724,0.1935,0.2027,0.0295,0.1133,0.0011
Positive Symptom Distress Index,0.0178,0.9626,0.5216,0.1157,0.6109,0.0058


Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
Somatization,30.24,23.73,15.57,13.15,17.86,30.68
Obsession-Compulsion,19.14,28.25,14.21,26.41,25.75,31.15
Interpersonal Sensitivity,22.54,18.6,18.9,13.94,7.27,5.05
Depression,36.01,18.88,26.68,28.94,23.29,42.46
Anxiety,35.09,24.93,17.09,15.09,22.9,18.91
Hostility,27.56,15.12,18.26,15.36,25.29,27.3
Phobic Anxiety,23.22,17.41,8.86,13.29,11.13,10.5
Paranoid Ideation,14.38,20.5,17.51,11.55,26.32,29.27
Psychoticism,18.47,13.92,10.48,20.0,11.2,19.82
Positive Symptom Distress Index,96.33,92.66,80.96,67.85,77.22,93.87

Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
Somatization,0.0247,0.2066,0.3401,0.7263,0.3978,0.0313
Obsession-Compulsion,0.5764,0.1333,0.7711,0.1912,0.2162,0.0711
Interpersonal Sensitivity,0.0944,0.1808,0.1686,0.4539,0.8877,0.9742
Depression,0.0218,0.6525,0.1446,0.1155,0.2746,0.0037
Anxiety,0.0136,0.1628,0.5173,0.5893,0.242,0.4628
Hostility,0.0163,0.3701,0.148,0.3541,0.0318,0.0176
Phobic Anxiety,0.0568,0.2949,0.7148,0.426,0.6758,0.787
Paranoid Ideation,0.5704,0.2493,0.3536,0.7746,0.0497,0.0321
Psychoticism,0.1406,0.3796,0.5735,0.0953,0.5939,0.0997
Positive Symptom Distress Index,0.1687,0.2428,0.4176,0.8103,0.6287,0.3414


### MAST

In [28]:
for corr_type in ['pearsonr','kendalltau', 'chi2']: 
    show_corr_with_instrument(instrument='MAST', corr_type=corr_type, session='FU3',
          use_only_lbls = ["Onset","Frequency","Amount","Binge","AUDIT_quick","AUDIT"], # , 'Onset'
          val_type='binarize')

[WARN]MAST flag vs Amount: Only 526 out of 2315 subjects used due to NaN values (77% dropped)
[WARN]MAST flag vs Binge: Only 564 out of 2315 subjects used due to NaN values (76% dropped)
[WARN]MAST flag vs AUDIT_quick: Only 515 out of 2315 subjects used due to NaN values (78% dropped)
[WARN]MAST total vs Amount: Only 526 out of 2315 subjects used due to NaN values (77% dropped)
[WARN]MAST total vs Binge: Only 564 out of 2315 subjects used due to NaN values (76% dropped)
[WARN]MAST total vs AUDIT_quick: Only 515 out of 2315 subjects used due to NaN values (78% dropped)
[WARN]MAST Alcohol dependency symptoms vs Amount: Only 526 out of 2315 subjects used due to NaN values (77% dropped)
[WARN]MAST Alcohol dependency symptoms vs Binge: Only 564 out of 2315 subjects used due to NaN values (76% dropped)
[WARN]MAST Alcohol dependency symptoms vs AUDIT_quick: Only 515 out of 2315 subjects used due to NaN values (78% dropped)
[WARN]MAST sum vs Amount: Only 526 out of 2315 subjects used due to Na

Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
MAST flag,0.02,-0.02,-0.01,0.05,0.0,0.01
MAST total,0.15,0.17,0.2,0.3,0.29,0.34
MAST Alcohol dependency symptoms,0.21,0.32,0.32,0.45,0.48,0.52
MAST sum,0.12,0.11,0.15,0.22,0.2,0.26

Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
MAST flag,0.6036,0.6916,0.8505,0.1938,0.9583,0.7897
MAST total,0.0002,0.0,0.0,0.0,0.0,0.0
MAST Alcohol dependency symptoms,0.0,0.0,0.0,0.0,0.0,0.0
MAST sum,0.0037,0.005,0.0006,0.0,0.0,0.0


[WARN]MAST flag vs Amount: Only 526 out of 2315 subjects used due to NaN values (77% dropped)
[WARN]MAST flag vs Binge: Only 564 out of 2315 subjects used due to NaN values (76% dropped)
[WARN]MAST flag vs AUDIT_quick: Only 515 out of 2315 subjects used due to NaN values (78% dropped)
[WARN]MAST total vs Amount: Only 526 out of 2315 subjects used due to NaN values (77% dropped)
[WARN]MAST total vs Binge: Only 564 out of 2315 subjects used due to NaN values (76% dropped)
[WARN]MAST total vs AUDIT_quick: Only 515 out of 2315 subjects used due to NaN values (78% dropped)
[WARN]MAST Alcohol dependency symptoms vs Amount: Only 526 out of 2315 subjects used due to NaN values (77% dropped)
[WARN]MAST Alcohol dependency symptoms vs Binge: Only 564 out of 2315 subjects used due to NaN values (76% dropped)
[WARN]MAST Alcohol dependency symptoms vs AUDIT_quick: Only 515 out of 2315 subjects used due to NaN values (78% dropped)
[WARN]MAST sum vs Amount: Only 526 out of 2315 subjects used due to Na

Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
MAST flag,0.02,-0.02,-0.01,0.05,0.0,0.01
MAST total,0.14,0.25,0.23,0.36,0.36,0.42
MAST Alcohol dependency symptoms,0.18,0.29,0.28,0.41,0.43,0.46
MAST sum,0.11,0.18,0.16,0.29,0.27,0.33

Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
MAST flag,0.6033,0.6913,0.8502,0.1935,0.9582,0.7894
MAST total,0.0,0.0,0.0,0.0,0.0,0.0
MAST Alcohol dependency symptoms,0.0,0.0,0.0,0.0,0.0,0.0
MAST sum,0.0015,0.0,0.0,0.0,0.0,0.0


[WARN]MAST flag vs Amount: Only 526 out of 2315 subjects used due to NaN values (77% dropped)
[WARN]MAST flag vs Binge: Only 564 out of 2315 subjects used due to NaN values (76% dropped)
[WARN]MAST flag vs AUDIT_quick: Only 515 out of 2315 subjects used due to NaN values (78% dropped)
[WARN]MAST total vs Amount: Only 526 out of 2315 subjects used due to NaN values (77% dropped)
[WARN]MAST total vs Binge: Only 564 out of 2315 subjects used due to NaN values (76% dropped)
[WARN]MAST total vs AUDIT_quick: Only 515 out of 2315 subjects used due to NaN values (78% dropped)
[WARN]MAST Alcohol dependency symptoms vs Amount: Only 526 out of 2315 subjects used due to NaN values (77% dropped)
[WARN]MAST Alcohol dependency symptoms vs Binge: Only 564 out of 2315 subjects used due to NaN values (76% dropped)
[WARN]MAST Alcohol dependency symptoms vs AUDIT_quick: Only 515 out of 2315 subjects used due to NaN values (78% dropped)
[WARN]MAST sum vs Amount: Only 526 out of 2315 subjects used due to Na

Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
MAST flag,0.15,0.08,0.0,1.37,0.01,0.02
MAST total,46.65,101.32,80.09,147.5,156.61,218.6
MAST Alcohol dependency symptoms,28.93,92.48,65.62,121.33,158.39,193.45
MAST sum,53.35,89.25,60.49,109.32,111.42,161.67

Unnamed: 0,Onset,Frequency,Amount,Binge,AUDIT_quick,AUDIT
MAST flag,0.6999,0.7837,0.9648,0.2421,0.9393,0.8875
MAST total,0.0037,0.0,0.0,0.0,0.0,0.0
MAST Alcohol dependency symptoms,0.0001,0.0,0.0,0.0,0.0,0.0
MAST sum,0.0003,0.0,0.0001,0.0,0.0,0.0
