# **Predicting Depression: Machine Learning Challenge**

### **Data Downlord**


The dataset for this competition (both train and test) was gathered through a broad survey designed to explore the various factors influencing depression risk in adults. Conducted anonymously from January to June 2023, the survey spanned multiple cities, including individuals from diverse professional and demographic backgrounds. Participants, aged between 18 and 60, willingly shared information on factors like age, gender, location, education, job satisfaction, study/work hours, and family medical history, among other aspects. No professional mental health evaluations or diagnostic tests were involved.

The primary variable, 'Depression', indicates whether an individual is at risk of depression, with 'Yes' or 'No' responses based on lifestyle and demographic inputs. The dataset is intended to help identify how everyday factors could be linked to mental health risks, making it a valuable resource for developing machine learning models focused on predicting mental health outcomes.

***Files***
- `train.csv` - the training dataset; Depression column is the binary target (either 0 or 1)
- `test.csv` - the test dataset; your objective is to predict target Depression for each row
- `sample_submission.csv` - a sample submission file in the correct format

In [1]:
import numpy as np
import pandas as pd
import json

In [2]:
with open('../data/kaggle/kaggle.json') as f:
    data = json.load(f)

In [3]:
import os
os.environ['KAGGLE_USERNAME'] = data['username']
os.environ['KAGGLE_KEY'] = data['key']

from kaggle.api.kaggle_api_extended import KaggleApi

api = KaggleApi()
api.authenticate()

### **Depression Prediction Dataset Downlord**

In [4]:
api.competition_download_files('predicting-depression-machine-learning-challenge', path='../data/kaggle')

### **Extrct the Dataset**

In [5]:
import zipfile

with zipfile.ZipFile("../data/kaggle/predicting-depression-machine-learning-challenge.zip", 'r') as zip_ref:
    zip_ref.extractall("../data/extracted_data")