<a href="https://colab.research.google.com/github/Ash99-commits/ML_Health_Insurance_Customers_CrossSell_Prediction/blob/main/Health2Vehicle_CrossSell_Insurance_Prediction_by_Ashwani_Kumar_Patra.ipynb" target="_parent"><img src="https://colab.research.google.com/assets/colab-badge.svg" alt="Open In Colab"/></a>

# Project Name : **Health Insurance Customers CrossSell Prediction (Classification)**

# Problem Statement

Our client, a leading Insurance provider, is looking to strategically broaden its portfolio by introducing Vehicle Insurance to its existing Health Insurance customer base. A key business challenge lies in identifying which policyholders are most likely to engage with this new product. Addressing this challenge requires a predictive, data-driven approach that can accurately forecast customer interest and guide targeted marketing efforts.

The project focuses on developing a machine learning model designed to estimate the probability of customers purchasing Vehicle Insurance. This model will leverage a wide range of variables, including demographic details (age, gender, region), vehicle-related factors (vehicle age, damage history), and insights from their current health insurance policies (premium size, distribution channels, and policy details). By uncovering hidden patterns within these attributes, the model aims to highlight customers with the highest conversion potential.

# Project Summary

A structured process is followed:

* Exploratory Data Analysis (EDA): To extract meaningful insights about customer behavior and product engagement.

* Feature Engineering: To enhance and refine data for optimal predictive power.

* Model Development & Validation: To build a reliable predictive analytics solution for business use.

Upon deployment, the solution is expected to optimize marketing efficiency, enabling the client to reach the right customers with the right offer at the right time. This not only reduces acquisition costs but also enhances customer experience and contributes to higher revenue growth through improved conversion rates.

Ultimately, this project delivers a scalable, data-driven decision-support system that empowers the insurance company to make informed, targeted, and impactful marketing decisions for its Vehicle Insurance expansion strategy.

# Dataset Features/Columns Description

**id:** Identifier for each customer.

**Gender:** Gender of the policyholder.

**Age:** Age of the policyholder.

**Driving_License:** Indicates whether the policyholder has a driving license (1 for yes, 0 for no).

**Region_Code:** Coded representation of the region of the policyholder.

**Previously_Insured:** Indicates whether the policyholder already has vehicle insurance (1 for yes, 0 for no).

**Vehicle_Age:** Age of the vehicle.

**Vehicle_Damage:** Indicates whether the vehicle was previously damaged (Yes or No).

**Annual_Premium:** The amount of premium paid annually by the policyholder.

**Policy_Sales_Channel:** Code for the channel of outreach to the policyholder.

**Vintage:** Number of days the policyholder has been associated with the company.

**Response:** Target variable indicating whether the policyholder is interested in vehicle insurance (1 for interested, 0 for not interested).

# Importing Data

In [1]:
# Importing the necessary libraries

import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
import seaborn as sns

In [2]:
# Importing the Dataset via my Github Repository

data = pd.read_csv('https://raw.githubusercontent.com/Ash99-commits/ML_Health_Insurance_Customers_CrossSell_Prediction/main/Health_Insurance_Customers_Dataset.csv')

In [None]:
# Peeking onto the dataset

data.head()

Unnamed: 0,id,Gender,Age,Driving_License,Region_Code,Previously_Insured,Vehicle_Age,Vehicle_Damage,Annual_Premium,Policy_Sales_Channel,Vintage,Response
0,1,Male,44,1,28.0,0,> 2 Years,Yes,40454.0,26.0,217,1
1,2,Male,76,1,3.0,0,1-2 Year,No,33536.0,26.0,183,0
2,3,Male,47,1,28.0,0,> 2 Years,Yes,38294.0,26.0,27,1
3,4,Male,21,1,11.0,1,< 1 Year,No,28619.0,152.0,203,0
4,5,Female,29,1,41.0,1,< 1 Year,No,27496.0,152.0,39,0
