This project analyzes mobile device usage patterns to classify users into behavior classes (from light to extreme usage). It covers the full data science lifecycleβfrom data cleaning and EDA to building and deploying a machine learning model.
To explore, model, and predict user behavior based on mobile device usage data using a Random Forest Classifier and visualize insights through a Power BI dashboard and Flask web app.
- Source: Kaggle Dataset
- Records: 700 user entries
- Features:
- Device Model
- Operating System
- App Usage Time (minutes/day)
- Screen On Time (hours/day)
- Battery Drain (mAh/day)
- Number of Installed Apps
- Data Usage (MB/day)
- Age
- Gender
- User Behavior Class (Target Variable)
- Data Inspection β Overview of features, missing values, data types
- Data Cleaning β Removing duplicates, handling outliers
- Preprocessing β Label encoding categorical data
- Exploratory Data Analysis (EDA) β Visualizations and insights on usage patterns
- Feature Correlation β Using heatmaps to study relationships
- Model Building β RandomForestClassifier (scikit-learn)
- Model Evaluation β Accuracy, classification report, confusion matrix
- Deployment β Flask web app with Heroku
- Dashboard β Power BI report for visualization
- The classifier demonstrated high accuracy and performance across all metrics.
- Strong correlation observed between screen-on time, app usage, battery drain, and behavior class.
- Flask app allows live user behavior class prediction based on inputs.
- Power BI dashboard provides dynamic data exploration and insights.
- π Power BI Dashboard: Power BI Report (If Available)
| Purpose | Tool/Library |
|---|---|
| Data Manipulation | Pandas |
| Data Visualization | Matplotlib, Seaborn |
| Machine Learning | Scikit-learn |
| Web Deployment | Flask, Heroku |
| Dashboard Visualization | Power BI |