Skip to content

KeatonSpiller/Statistical-Learning

Repository files navigation

Download the PowerPoint to see the analysis breakdown.

Summary

Using the German Credit dataset to statistically model Default Rates. This involved exploratory analysis, variable selection, and statistical modeling. In order to analyze the default rate of the german people I compared logistic regression, KNN, LDA and QDA models for the best error rate, sensitivity, specificity, and AUC. LDA had the best model for this project with the lowest error rate, highest accuracy with the best balance of AUC.

Statistical-Learning

Statistical Learning

  • Requirements
  • R Markdown
  1. Load the germancredit.csv data

  2. Exploratory Analysis of data

  3. Building a reasonably “good” logistic regression model

  4. Analyzing regression coefficients and error rate

  5. Fit KNN model and find error rate, sensitivity, specificity, and AUC

  6. Fit LDA model and find error rate, sensitivity, specificity, and AUC

  7. Fit QDA model and find error rate, sensitivity, specificity, and AUC

  8. Comparing every classifier

  9. Cross validation LOOCV and k-fold analysis

  10. Ridge and Lasso analysis

About

Statistical Learning

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages