This repository is dedicated to my upcoming book, 📕 𝐀𝐩𝐩𝐥𝐢𝐞𝐝 𝐃𝐚𝐭𝐚 𝐒𝐜𝐢𝐞𝐧𝐜𝐞 𝐟𝐨𝐫 𝐂𝐫𝐞𝐝𝐢𝐭 𝐑𝐢𝐬𝐤 and other topics related to credit risk modeling. It will be regularly updated with GitHub pages, slides, and PDF documents covering various modeling subjects.
The motivation behind writing this book and creating the repository stems from the observed gap between academic literature, industry practices, and the evolving landscape of data science. While there's been a notable increase in literature on credit risk modeling, discrepancies persist. The evolution of data science has led to significant automation in processes. Still, it has also brought the risk of overreliance on pre-programmed procedures, sometimes leading to the misuse of statistical methods. Moreover, many practitioners entering credit risk modeling often overlook fundamental principles, hindering their professional development. Hence, the repository aims to serve as a centralized hub for continuous education and consolidating essential concepts.
The repository and book will encompass practical examples utilizing both R
and Python
.
Below are links providing an overview of the repository's main topics, which include summaries from the book and insights gleaned from practical experience.
- Consequences of Violating the Normality Assumption for OLS Regression (pdf,
R
&Python
code) - Consequences of Heteroscedasticity for OLS Regression (pdf,
R
&Python
code) - Consequences of Multicollinearity for OLS Regression (pdf,
R
&Python
code) - Consequences of Autocorrelation for OLS Regression (pdf,
R
&Python
code) - Power Play: Probability of Default Predictive Ability Testing (pdf, presentation)
- Nested Dummy Encoding (pdf, presentation)
- Marginal Information Value (pdf, presentation)