# Credit Risk Adjudication Model Documentation

This notebook is a simple example of how organisations could standardize their approach to AI by defining a series of steps that any data science team ought to address prior to a model validation. Although not exhaustive, this shows that most of the questions required by IVU process could be addressed upfront (at model development phase) to reduce the friction between regulatory imposed silos, increase model validation success rate and drammatically reduce time from exploration to productionization of AI use cases. 

<br>

<a href="https://www.ey.com/en_ca" target="_blank">
  <img src='https://raw.githubusercontent.com/databricks-industry-solutions/fsi-mrm-generation/main/templates/figs/ey_logo.png' width=150> 
</a>

## Executive Summary

**Note:* Provide a summary that sums up the model purpose, scope of usage, along with the target portfolio. The
model risk ranking, the model limits, and the future developments or enhancements (if applicable)
should be highlighted as well.*

**[Your text goes here]**

## 1 Introduction

### 1.1 Model Background and Initiation

*Note: Describe the history behind the model development. If applicable, describe at high level the different significant changes that were previously made and provide the reason for this current development. It clarifies whether it is a new model or significant changes are being brought to an existing model. This section also should describe the model working group composition and how it was established, if applicable.*

**[Your text goes here]**

### 1.2 Model Purpose

*Note: Describe the model purpose and scope of usage. More precisely, clarify what the model is designed for, along with the model users, the model interdependencies, and any limitations to the model use.*

**[Your text goes here]**

### 1.3 Model Portfolio

*Note: Describe the underlying business product(s) and portfolio of the model. Provide any relevant information about the recent significant changes and trends in the portfolio. Outline key statistics on the portfolio evolution, regarding e.g., the number of borrowers, the amount of exposures, etc.*

**[Your text goes here]**

### 1.4 Model Risk Rating

*Note: Provide and justify the model risk ranking.*

**[Your text goes here]**

### 1.5 Model Log of Changes (if applicable)

*Note: Briefly describe all the previous (minor and major) changes that were made to the model,
including the date, justification and how they improved the model performance or use.*

**[Your text goes here]**

### 1.6 Business-Driven Risk Considerations

*Note: Explain the business risks that are explored and assessed during the model development process, and how they are accounted for in the final model (outputs). Describe and justify any mitigation action (plan) that helps reduce the business-driven risk.*

**[Your text goes here]**

### 1.7 Economic and Market Outlook

*Note: Explain how the current and forward-looking overall economic conditions may impact the business line and subsequently the model outcome.*

**[Your text goes here]**

### 1.8 Model Development Process

*Note: Describe the overall model development process, the different milestones of the process, along with the roles and responsibilities of the stakeholders involved at each of these key steps.*

**[Your text goes here]**

### 1.9 Economic and Market Outlook

*Note: Explain how the current and forward-looking overall economic conditions may impact the business line and subsequently the model outcome.*

**[Your text goes here]**

## 2 Data

### 2.1 Borrower Definition

*Note: Describe the borrowers’ categories of the model portfolio/population. For instance, whether the model applies to borrowers with a certain range of exposure, within a geographical area, or with a minimum/maximum of total asset (e.g., when the model also applies to SMEs). It outlines the borrower identification process in the data bases as well.*

**[Your text goes here]**

### 2.2 Data Sources

#### 2.2.1 Internal Data Sources
*Note: Describe the internal data sources, as well as their appropriateness with the model purpose and model population.*

**[Your text goes here]**

#### 2.2.2 External Data Sources
*Note: Describe the external data sources, as well as their appropriateness with the internal definitions and practices, model purpose and model population.*

**[Your text goes here]**

### 2.3 Data Historical Coverage and Suitability
*Note: Describe the data extraction process, along with the period spanned by the data and the statistics on the extracted observations. The section should not only evidence that the extracted data reflects the business practices and experiences, but is also suitable for the model purpose, modeling methodology and modeling assumptions.*

**[Your text goes here]**

### 2.4 Modeling Timeframes

#### 2.4.1 Timeframe Concepts
*Note: Explain the different concepts of the modeling timeframes used for the model development, specifically the observation period, the lag period, along with the performance period.*

**[Your text goes here]**

#### 2.4.2 Determination of the Performance and Lag Periods

*Note: Describe the determination process of the lag and performance periods, including the judgemental considerations that were used. Provide a justification of the selections and their consistency with the model product and the observed borrowers’ experience. Explain the different concepts of the modeling timeframes used for the model development, specifically the observation period, the lag period, along with the performance period.*

**[Your text goes here]**

#### 2.4.3 Modeling Timeframes

*Note: Describe the different modeling timeframes that were finally selected (i.e., the corresponding periods to the concepts explained in Section 2.4.1) for the model development and validation.*

**[Your text goes here]**

### 2.5 Target Variable Definition

*Note: Define the target variable (i.e., bad/default) and the other possible outcomes such as good and intermediate status.*

**[Your text goes here]**

In [0]:
# [Code goes here to show target variable distribution. The doc generation tool will be configured to pick the result of this cell]

### 2.6 Modeling Populations

#### 2.6.1 Eligible Population

*Note: Describe and provide statistics on all the extracted population that is qualified for the model development, without any treatment yet (e.g., without exclusions).*

**[Your text goes here]**

In [0]:
# [Code goes here to show statistics about the population. The doc generation tool will be configured to pick the result of this cell]

#### 2.6.2 Good-Bad Observations

*Note: Describe and provide statistics on observations that can classified as good or bad observations.*

**[Your text goes here]**

In [0]:
# [Code goes here to show statistics about the good/bad observations. The doc generation tool will be configured to pick the result of this cell]

#### 2.6.3 Indeterminate Observations

*Note: Describe and provide statistics on observations that cannot be classified as good or bad observations.*

**[Your text goes here]**

In [0]:
# [Code goes here to show statistics about the indeterminate observations. The doc generation tool will be configured to pick the result of this cell]

#### 2.6.4 Statistically Inferred Performance Data

*Note: Describe the observations whose performance could not be observed (e.g.,indeterminate observations), the reject inference technique used to infer the performance. The reason supporting the selected technique, along with the considered population should be described as well.*

**[Your text goes here]**

In [0]:
# [Code goes here to show statistics. The doc generation tool will be configured to pick the result of this cell]

### 2.7 Data Exclusions and Treatment

*Note: Describe exclusions and any treatments (e.g., outlier and missing value treatment, and application of floors and caps) applied to the data, along with the supporting justification.*

**[Your text goes here]**

### 2.8 Sampling Methodology
*Note: Describe the data sampling methodology, along with the supporting justification. Provide evidence that the in-sample and out-of-sample populations have enough data points to allow for reliable
statistical test results.*

**[Your text goes here]**

### 2.9 Modeling Data Assessment
*Note: Describe the final dataset that will be used for the model development. Describe the data quality, using statistics and graphs, describe any data limitations and their potential impact on the model
output.*

**[Your text goes here]**

## 3 Model Development

### 3.1 Methodology Selection
*Note: Describe the modeling methodology selection process. More specifically, first present and compare the different alternatives through the literature and industry practice review, and then explain
the rationale behind the selected approach. In addition, outline the mathematical definitions and equations, along with the assumptions and limitations of the selected modeling methodology.*

**[Your text goes here]**

### 3.2 Model Segmentation

*Note: Describe the model segmentation process, including the judgemental considerations, the statistical analyses, and the supporting rationale for the selected segments.*

**[Your text goes here]**

### 3.3 Model Variable Selection

*Note: Describe the variable selection process from the initial list until the selected variables. The statistical analyses with their results and the business considerations should be described in the corresponding sub-sections below. Only relevant and applicable sub-sections should documented. Additional analyses or tests may be added.*

**[Your text goes here]**

#### 3.3.1 Variable Reduction

*Note: Describe the first step of the variable selection process, which primarily consists in narrowing down the initial list of variables. Describe each of the listed topics below that are applicable in your approach and add others that were used*
<br/><br/>

* *Business Considerations*
* *WOE Analysis*
* *Binning Process*
* *Information Value Analysis*
* *Variable Clustering*
* *Additional Considerations (if applicable)*

**[Your text goes here]**

In [0]:
# [Code goes here to display data and plots to support the variable selection methods and results. The doc generation tool will be configured to pick the result of this cell]

#### 3.3.2 Final Variable Reduction

*Note: Describe the additional step(s) to select and refine the model variables after the initial list is reduced.*
<br/><br/>

* *Univariate Analysis*
* *Multivariate and Multicollinearity Analysis*
* *Additional Considerations (if applicable)*

**[Your text goes here]**

In [0]:
# [Code goes here to display data and plots to support thefinal  variable selection methods and results. The doc generation tool will be configured to pick the result of this cell]

### 3.4 Model Estimation

*Note: Describe the model estimation methodology and its suitability with the model purpose and the modeling data. Outline and interpret the specifications and estimation results.*

**[Your text goes here]**

### 3.5 Model Scaling

*Note: Describe the model scaling process. More specifically, cover the selection of the scaling equations and parameters, as well as the expert judgements that were considered. Display and interpret the model final results.*

**[Your text goes here]**

## 4 Model Performance Assessment

*Note: Thoroughly assess the model performance in this section. Each sub-section is designed to cover particular dimension that is assessed, outline the analysis or statistical test that is performed and
provide the results interpretation. Keep only relevant and applicable sub-sections. Add additional analyses or tests.*

**[Your text goes here]**

### 4.1 Output Analysis
**[Your text goes here]**

### 4.2 Discriminatory Power Testing

**[Your text goes here]**

In [0]:
# [Code goes here to display data and plots to support the tests and results. The doc generation tool will be configured to pick the result of this cell]

#### 4.2.1 Accuracy Ratio Test

**[Your text goes here]**

In [0]:
# [Code goes here to display data and plots to support the tests and results. The doc generation tool will be configured to pick the result of this cell]

#### 4.2.2 Kolmogorov-Smirnov Test

**[Your text goes here]**

In [0]:
# [Code goes here to display data and plots to support the tests and results. The doc generation tool will be configured to pick the result of this cell]

### 4.3 Sensitivity Analysis


**[Your text goes here]**

In [0]:
# [Code goes here to display data and plots to support the tests and results. The doc generation tool will be configured to pick the result of this cell]

### 4.4 Population Stability Analysis

**[Your text goes here]**

In [0]:
# [Code goes here to display data and plots to support the tests and results. The doc generation tool will be configured to pick the result of this cell]

### 4.5 Benchmarking


**[Your text goes here]**

In [0]:
# [Code goes here to display data and plots to support the tests and results. The doc generation tool will be configured to pick the result of this cell]

## 5 Model Assumptions and Limitations

### 5.1 Model Assumptions
*Note: Describe the key assumptions made throughout the model development process and provide evidence to support their reasonableness and soundness.*


**[Your text goes here]**

### 5.2 Model Limitations
*Note: Describe the key model limitations, their potential impact on the model, as well as the corresponding mitigation action plan(s) to reduce the model risk.*

**[Your text goes here]**

## 6 Model Ongoing Monitoring

### 6.1 Ongoing Performance Assessment
*Note: Describe the ongoing model performance monitoring plan. Cover the statistical tests (including e.g., the frequency and acceptance thresholds) that will be performed on an ongoing basis to
ensure the model is still performing adequately.*

**[Your text goes here]**

### 6.2 Documentation Review
*Note: Describe the conditions or types of model changes that trigger the model documentation review, as well as the key components that need to be reviewed.*

**[Your text goes here]**

## 7 References

**[Your text goes here]**

<div style="text-align: center; margin-top: 30px;">
    <img src='https://raw.githubusercontent.com/databricks-industry-solutions/fsi-mrm-generation/main/templates/figs/ey_logo.png' height="50px">
    <br>
    <br>
    <em>Disclaimer: The views and opinions expressed in this blog are those of the authors and do not necessarily reflect the policy or position of EY.</em>
</div>