🚀 Addiction Risk Prediction

This repository provides a comprehensive machine learning pipeline designed to predict addiction risk using behavioral, psychological, and demographic data. The solution empowers early identification of high-risk individuals and supports data-driven intervention strategies.

🎯 Project Objectives

The goal of this project is to predict the likelihood of addiction (Yes/No) based on user-specific inputs including:

Substance usage (Alcohol, Cannabis, Tobacco, etc.)
Age of first use and current age
Frequency of usage
Reported stress levels and diagnosed mental health conditions
Coping mechanisms and presence of support systems

Through this, we aim to:

Understand patterns of substance exposure, especially in adolescence or pre-teen years
Explore behavioral risk factors contributing to addiction
Enable predictive modeling to assist mental health professionals

📊 Dataset Overview

The dataset was collected firsthand using Google Forms, ensuring authentic, real-world insights into individual behavioral patterns.

📎 Google Form Link (Original Survey)

📁 Files Provided

Dataset.xlsx – Raw collected data
Cleaned_Encoded_Dataset.xlsx – Preprocessed dataset used for modeling
New.xlsx – Data used for inference or testing predictions

⚙️ How to Run the Project

✅ Step-by-Step Instructions

Launch the Notebook
- Open Addiction_Risk_Prediction.ipynb
- Click “Open in Colab” for browser-based execution
Upload the Required Files
- Download and upload the following to Colab:
  - Dataset.xlsx
  - Cleaned_Encoded_Dataset.xlsx
  - New.xlsx
Run the Notebook
- Execute each cell sequentially to perform:
  - Data preprocessing
  - Feature engineering
  - Model training and evaluation
  - Final binary prediction (Addiction: Yes / No)

🤖 Machine Learning Approach

The project initially uses multiclass classification on the substances_used variable, which categorizes different types of substances. Post-training, the focus shifts to a binary classification task to determine whether an individual is at risk of addiction.

🔍 Models Benchmarked

Random Forest ✅ (Best Performing Model)
Support Vector Machine (SVM)
XGBoost
Logistic Regression
K-Nearest Neighbors
Decision Tree
Naive Bayes

🧠 Ensemble Model

We also developed a stacked ensemble model to boost performance:

Base Models: Random Forest, SVM, XGBoost
Meta-Learner: Logistic Regression

Performance was evaluated using:

Accuracy
Precision
Recall
F1-score

📌 Real-World Insight

Our findings suggest that:

Individuals are often exposed to addictive substances during pre-teen and adolescent years
Stress levels, inadequate coping mechanisms, and mental health diagnoses are significant predictors
Early intervention can be guided using such predictive models

📈 Future Roadmap

Deploy as a REST API or interactive dashboard
Integrate interpretability tools (e.g., SHAP, LIME)
Automate data ingestion pipelines for real-time risk prediction
Extend dataset for temporal analysis or time series modeling

📬 Contact

For contributions, queries, or to collaborate on health-tech initiatives, reach out via GitHub Issues.

📎 License

This project is licensed under the MIT License. Please review the LICENSE file for details.

Developed with an intent to support mental health awareness, data literacy, and predictive analytics in public health.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
Addiction_Risk_Prediction.ipynb		Addiction_Risk_Prediction.ipynb
Cleaned_Dataset_Encoded.xlsx		Cleaned_Dataset_Encoded.xlsx
Dataset.xlsx		Dataset.xlsx
New.xlsx		New.xlsx
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🚀 Addiction Risk Prediction

🎯 Project Objectives

📊 Dataset Overview

📁 Files Provided

⚙️ How to Run the Project

✅ Step-by-Step Instructions

🤖 Machine Learning Approach

🔍 Models Benchmarked

🧠 Ensemble Model

📌 Real-World Insight

📈 Future Roadmap

📬 Contact

📎 License

About

Uh oh!

Releases

Packages

Languages

CodEEBuzZ/Addiction-Risk-Prediction-Using-Python

Folders and files

Latest commit

History

Repository files navigation

🚀 Addiction Risk Prediction

🎯 Project Objectives

📊 Dataset Overview

📁 Files Provided

⚙️ How to Run the Project

✅ Step-by-Step Instructions

🤖 Machine Learning Approach

🔍 Models Benchmarked

🧠 Ensemble Model

📌 Real-World Insight

📈 Future Roadmap

📬 Contact

📎 License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages