Explorative data analysis of loan data in the US, 2005-2014
October 2017, Christopher Buss
In this project, I explored load data from Prosper, a US-based lending platform. The data set contains data from 113,937 loans and 84 variables. The objectives of the analysis was to summarize the data to determine (1) the relationship between the various variables of interest and (2) how the interest rates for individuals loans can be predicted with the available data.I used a wide range of plots to explore the data and find relationships.
- prosperLoanData.csv: main data file, can be obtained from: https://www.google.com/url?q=https://s3.amazonaws.com/udacity-hosted-downloads/ud651/prosperLoanData.csv&sa=D&ust=1509024323909000&usg=AFQjCNEyzeoOYi0AJAm3ZLHU3L_Ke5o-CA
- prosper_data_variables.xlms: variable definitions
- prosper_analysis.Rmd: data exploration and answers to the questions
- prosper_analysis.html: html knitted from prosper_analysis.Rmd