# Marital Depression Analysis

<img src = "https://www.mydr.com.au/wp-content/uploads/2018/10/what_is_depression.jpeg" height = "200" width = "1000">


Analysis of Depression among married individuals living in Istanbul
Dataset contains details like Gender, Education, Working Status and Marriage details. Also responses to Beck's Depression Inventory Questions are also include in columns from B1-B21

[Dataset Link](https://www.kaggle.com/babyoda/depression-in-married-couples)

Thanks to [@babyoda](https://www.kaggle.com/babyoda) for the dataset

## Beck's Depression Inventory
The Beck Depression Inventory (BDI), created by Aaron T. Beck, is a 21-question multiple-choice self-report inventory, one of the most widely used psychometric tests for measuring the severity of depression. Its development marked a shift among mental health professionals, who had until then, viewed depression from a psychodynamic perspective, instead of it being rooted in the patient's own thoughts.

Each of the 21 questions are 4 optioned questions having score from 0 to 3. Minium score being 0 and maximum being 63.The degree of depression is directly proportional to the BDI score

Some of the questions are :
1. - 0 I do not feel sad
   - 1 I feel sad
   - 2 I am sad all the time and I can't snap out of it.
   - 3 I am so sad and unhappy that I can't stand it. 
   
   
2. - 0 I am not particularly discouraged about the future.
   - 1 I feel discouraged about the future.
   - 2 I feel I have nothing to look forward to.
   - 3 I feel the future is hopeless and that things cannot improve. 
  
For more details on the questions [visit here](https://www.ismanet.org/doctoryourspirit/pdfs/Beck-Depression-Inventory-BDI.pdf)

## BDI Score
Based on the BDI score, the patient is categorised as follows :
- Below 10 : These ups and downs are considered normal
- 11-16    : Mild mood disturbance
- 17-20    : Borderline clinical depression
- 21-30    : Moderate depression
- 31-40    : Severe depression
- Over 40  : Extreme depression 

## Standard Imports

In [None]:
import numpy as np
import pandas as pd
import matplotlib.pyplot as plt
import seaborn as sns

Ignoring harmless warnings

In [None]:
import warnings
warnings.filterwarnings('ignore')

In [None]:
sns.set_style("darkgrid")

## Loading Data

In [None]:
data = pd.read_csv("../input/depression-in-married-couples/Dataset 1.csv")

## Data Cleaning 

In [None]:
data.head()

Calculating BDI Score

In [None]:
def bdi(data):
    finalBDIScore = []
    for i in data:
        finalBDIScore.append(i.sum())
    return np.array(finalBDIScore)

In [None]:
data['BDI Score'] = bdi(data.drop(labels = 'No\tGender\tEducation	Working Status	Marriage Style	Status of Having a Child'.split("\t"),axis = 1).values)

In [None]:
data = data['Gender\tEducation\tWorking Status\tMarriage Style\tStatus of Having a Child\tBDI Score'.split("\t")]

# Exploratory Data Analysis

Creating a separate copy for Data Visualization

In [None]:
depr = data.copy()

Defining keys for Encoded Categorical variables

In [None]:
genderKey = {1 : 'Female', 2:'Male'}
educationKey = {1 : 'Primary', 2 : "High School", 3 : "Bachelors", 4 : "Masters and Above"}
workingKey = {1 : "Employed" , 2 : 'Unemployed'}
marriageKey = {1 : "Arranged", 2 : "Love"}
childKey = {1 : "Yes", 2 : "No"}
keys = [genderKey,educationKey,workingKey,marriageKey,childKey]

Storing all categorical variables to convert Encoded variables into Original Form

In [None]:
categorical = 'Gender\tEducation\tWorking Status\tMarriage Style\tStatus of Having a Child'.split("\t")

In [None]:
for i in range(5):
    depr[categorical[i]] = depr[categorical[i]].apply(lambda x : keys[i][x])

Taking a look at our cleaned data

In [None]:
depr.head()

In [None]:
depr.info()

In [None]:
depr[['BDI Score']].describe()

We have 433 rows indicating it is a small dataset

In [None]:
plt.figure(figsize=(10,6))
sns.distplot(depr['BDI Score'],kde = False);

#### Some observations from BDI Distribution
- Most people have score 0 indicating no depression (which is a good sign)
- Though, the mean BDI score is 10 (stretched due to outliers)
- First Quartile is 3 while the third Quartile is 15 which shows the presence of outliers
- There are exactly 2 values having BDI maximum, meaning that there exists an extremely depressed couple

###### most depressed couple

In [None]:
depr[depr['BDI Score'] == 61]

In [None]:
plt.figure(figsize=(14,6))
sns.catplot(x = 'Gender', y = 'BDI Score',data = depr);

Females have denser distribution with higher BDI Scores showing that **Females are more depressed than Males**

In [None]:
plt.figure(figsize=(10,6))
sns.boxplot(y = 'Education', x = 'BDI Score',data = depr);

**Poorly Educated individuals tend to be more depressed than individuals having a decent educational background**. Though there are some exceptions as the two most depressed couple have Masters and Bachelors degree 

In [None]:
plt.figure(figsize=(10,6))
sns.distplot(depr['BDI Score'][depr['Working Status'] == 'Unemployed'],label='Unemployed')
sns.distplot(depr['BDI Score'][depr['Working Status'] == 'Employed'],label = 'Employed')
plt.legend();

An obvious observation, **Unemployed people tend to be more depressed than the employed**

In [None]:
plt.figure(figsize=(10,6))
sns.barplot(x = 'Marriage Style', y = 'BDI Score',data = depr,estimator=np.median,ci = None);

**Couples having Love Marriage are happier compared to their Arranged counterpart**

In [None]:
plt.figure(figsize=(10,6))
sns.catplot(x = 'Status of Having a Child', y = 'BDI Score',data = depr,kind = 'violin');

**Similarly, Couples who can afford children tend to be happier than those who cannot**