## Alphagoal Customer Subcription Model

### Business Understanding

#### Problem Statement
Current marketing strategies often involve reaching out to a large pool of clients without a clear understanding of their likelihood to subscribe. This approach results in wasted resources and low conversion rates. A predictive model can provide a data-driven solution to identify potential subscribers more effectively.

#### Objectives
- The primary business goal is to enhance the effectiveness of marketing campaigns for term deposits by predicting the likelihood of a client subscribing to one. By leveraging operational data, the organization aims to target the most promising clients, thereby improving campaign efficiency, reducing costs, and increasing subscription rates.
- The key objective is to create a machine learning model capable to predict whether a given customer will subscribe to a serveice or not based on his 
activitities.

#### Key Stakeholders

- Marketing  (End-Users of Product): Use insights from the model to design targeted marketing campaigns, optimize outreach efforts, and monitor campaign performance to increase subscription rates and ensure customer satisfaction.

- Sales Team: Leverage model predictions to prioritize leads and improve follow-up efficiency to enhance client engagement and close deals effectively.

- Data Science Team: Develop and maintain the predictive model. Collect, clean, and analyze data; build and validate the model; and provide actionable insights to ensure technical accuracy, scalability, and usability of the model.

- IT and Data Engineering Team: Support system integration and data pipelines. Provide infrastructure, ensure data availability, and integrate the model into existing systems to maintain smooth technical operations and scalability.

- Executive Management: Decision-makers and project sponsors. Define strategic objectives, allocate resources, and monitor project impact to align the project outcomes with business goals, such as cost reduction and revenue growth.

#### Success Metrics
- Business Success: Increase in term deposit subscriptions and reduced marketing costs.
- Analytical Success: A predictive model with high accuracy, precision, and recall (90% or more) to identify potential subscribers.
- Deployment Success: Integration of the predictive model into the marketing workflow to guide decision-making.

#### Hypothesis Statement
- **Null Hypothesis (H_0)**: The number of campaigns sent to a client does not significantly impact their likelihood of subscribing to a term deposit.
- **Altenative Hypothesis(H_a)**: The number of campaigns sent to a client significantly impacts their likelihood of subscribing to a term deposit

#### Business Questions
1. How does the number of campaigns affect the likelihood of a client subscribing to a term deposit?
2. What client characteristics (e.g., age, job, marital status, education) are most strongly associated with subscribing to a term deposit?
3. Does the timing of the campaigns (e.g., contact day, month) influence subscription rates?
4. Is there a diminishing return on additional campaigns (i.e., after a certain point, additional campaigns do not increase subscription rates)?
5. How do past campaign outcomes (poutcome) affect the likelihood of subscription in future campaigns?

#### Source of Data 
There are four datasets:  
- bank-additional-full.csv with all examples (41188) and 20 inputs, ordered by date (from May 2008 to November 2010), very close to the data analyzed. 
- bank-additional.csv with 10% of the examples (4119), randomly selected from 1), and 20 inputs. 
- bank-full.csv with all examples and 17 inputs, ordered by date (older version of this dataset with less inputs).  
- bank.csv with 10% of the examples and 17 inputs, randomly selected from 3 (older version of this dataset with less inputs).  
git
#### Data Dictionary


| **Variable Name** | **Description**                                                  | **Data Type** | **Possible Values**                                                                                              |
|--------------------|------------------------------------------------------------------|---------------|------------------------------------------------------------------------------------------------------------------|
| **age**            | Age of the client                                               | Numeric       | Any positive integer (e.g., 18, 35, etc.)                                                                        |
| **job**            | Type of job                                                     | Categorical   | "admin.", "unknown", "unemployed", "management", "housemaid", "entrepreneur", "student", "blue-collar", "self-employed", "retired", "technician", "services" |
| **marital**        | Marital status of the client                                    | Categorical   | "married", "divorced", "single"                                                                                  |
| **education**      | Level of education                                              | Categorical   | "unknown", "secondary", "primary", "tertiary"                                                                    |
| **default**        | Has credit in default?                                          | Binary        | "yes", "no"                                                                                                      |
| **balance**        | Average yearly balance of the client (in euros)                 | Numeric       | Any real number (e.g., -500, 0, 1500)                                                                            |
| **housing**        | Does the client have a housing loan?                            | Binary        | "yes", "no"                                                                                                      |
| **loan**           | Does the client have a personal loan?                           | Binary        | "yes", "no"                                                                                                      |
| **contact**        | Contact communication type used                                 | Categorical   | "unknown", "telephone", "cellular"                                                                               |
| **day**            | Last contact day of the month                                   | Numeric       | 1–31                                                                                                             |
| **month**          | Last contact month of the year                                  | Categorical   | "jan", "feb", "mar", ..., "nov", "dec"                                                                           |
| **duration**       | Duration of the last contact, in seconds                        | Numeric       | Any non-negative integer (e.g., 0, 120, 300)                                                                     |
| **campaign**       | Number of contacts performed during this campaign               | Numeric       | Any positive integer (e.g., 1, 2, 10)                                                                            |
| **pdays**          | Number of days since the client was last contacted in a previous campaign | Numeric       | -1 (not previously contacted), or any positive integer (e.g., 5, 30, etc.)                                       |
| **previous**       | Number of contacts performed before this campaign               | Numeric       | Any non-negative integer (e.g., 0, 1, 5)                                                                         |
| **poutcome**       | Outcome of the previous marketing campaign                      | Categorical   | "unknown", "other", "failure", "success"                                                                         |
| **y**              | Target variable: has the client subscribed to a term deposit?   | Binary        | "yes", "no"                                                                                                      |



#### Importations

In [1]:
print("hello")

hello


#### Data Understanding

#### Data Preparation

#### Modeling & Evaluation

#### Persist Model