# Build a scoring model - OpenClassrooms Project P4 - Engineer IA

### To Grant Consumer Loans

To grant consumer loans, banks use several relevant variables to assess the risk of customer default. Here is a list of important variables and an example of a calculation commonly used in banks for this type of financing, considering regulatory constraints such as the usury rate and maximum credit limits.

#### Relevant Variables for Consumer Credit Scoring

**Socio-Demographic Variables:**
- **SK_ID_CURR:** Unique identifier of the client.
- **CODE_GENDER:** Gender of the client.
- **DAYS_BIRTH:** Age of the client in days (negative).

**Financial Variables:**
- **AMT_INCOME_TOTAL:** Total income of the client.
- **AMT_CREDIT:** Amount of credit requested.
- **AMT_ANNUITY:** Credit annuity.
- **AMT_GOODS_PRICE:** Price of goods for which the credit is requested.
- **NAME_INCOME_TYPE:** Type of income of the client.
- **NAME_FAMILY_STATUS:** Family status of the client.
- **NAME_HOUSING_TYPE:** Type of housing of the client.

**Stability Variables:**
- **DAYS_EMPLOYED:** Number of days the client has been employed (negative, 0 if unemployed).
- **DAYS_REGISTRATION:** Number of days since the client's registration.
- **OCCUPATION_TYPE:** Type of employment of the client.

**Behavioral Variables:**
- **CNT_CHILDREN:** Number of dependents of the client.
- **CNT_FAM_MEMBERS:** Number of family members of the client.
- **FLAG_OWN_CAR:** Binary indicator if the client owns a car.
- **FLAG_OWN_REALTY:** Binary indicator if the client owns real estate.

**External Scores:**
- **EXT_SOURCE_1, EXT_SOURCE_2, EXT_SOURCE_3:** External scores from various sources.

#### Examples of other Ratios and Calculations Used

**The relevant business information is:**
- Amount borrowed / Price of the purchased good: `AMT_CREDIT / AMT_GOODS_PRICE`
- Amount of annuities / Amount borrowed: `AMT_ANNUITY / AMT_CREDIT`
- Amount of annuities / Annual income: `AMT_ANNUITY / AMT_INCOME_TOTAL`
- Employment duration / Age: `DAYS_EMPLOYED / DAYS_BIRTH`


#### Regulatory Requirements in France

**Usury Rate:**
Banks in France must ensure that the proposed interest rate does not exceed the usury rate set by the Banque de France. The usury rate is calculated quarterly and is the maximum APR (annual percentage rate) allowed for loans, which includes the base interest rate, fees, and insurance costs. If the APR exceeds the usury rate, the loan is considered illegal and is subject to severe penalties, including fines and imprisonment.

**Credit Ceilings:**
Banks must respect maximum credit limits based on the type of credit and the client's profile. These limits are set to ensure that clients do not become over-indebted and to maintain financial stability

**Repayment Capacity:**
Banks must assess the client's repayment capacity based on their income and expenses. The general rule is that the client's debt-to-income ratio should not exceed 33%, although this can be higher in exceptional cases. Loans should not be granted if the data shows that the client cannot reasonably repay the loan without financial hardship.
### Current Usury Rate as of June 2024

As of June 2024, the usury rates for consumer loans are set quarterly by the Banque de France. The current usury rate for consumer loans depends on the amount borrowed and the loan duration. For precise rates, you can refer to the latest publication by the Banque de France or consult their [official website](https://www.banque-france.fr).

While the specific variables mentioned above do not directly provide the usury rate, they are essential in calculating the APR (Annual Percentage Rate) for loans, which must comply with the usury rate. The APR includes the nominal interest rate, various fees, and insurance costs, and must be compared against the usury threshold to ensure compliance.

### Calculating the Interest Rate (APR) of a Loan

From the provided variables, it is not possible to directly determine the interest rate of the loan. The listed variables are primarily socio-demographic, financial, and behavioral characteristics of the clients, as well as stability indicators and external scores. To calculate the interest rate of a loan, we need specific information about the terms of the loan itself, such as the total amount to be repaid, associated fees, and the duration of the loan.

**<span style="color:red;">This information is missing, and in a professional context, the absence of this variable would not allow for futur accurate loan scoring.</span>**


## Load Data that will be used for the project are issued from Kaggle and are available [Home Credit Default Risk - Can you predict how capable each applicant is of repaying a loan?]  

### Data Description : 
Data are from several sources and are related to the loan application of clients. The data are divided into several tables, each containing information about the client, the loan, and the client's previous credit history. The main table is the application_train.csv file, which contains the following variables:
- application_train.csv: the main table containing information about the client's loan application.
- SK_ID_CURR: Unique identifier of the client which will be used to link the tables.

![Home Credit data relations](https://storage.googleapis.com/kaggle-media/competitions/home-credit/home_credit.png)