Applied Machine Learning
Machine learning is the study and application of algorithms that learn from and make predictions on data. From search results to self-driving cars, it has manifested itself in all areas of our lives and is one of the most exciting and fast-growing fields of research in the world of data science.
This two-day course will provide an overview of using R for supervised learning. The session will step through the process of building, visualizing, testing, and comparing models that are focused on prediction.
The goal of the course is to provide a thorough workflow in R that can be used with many different regression or classification techniques. Case studies on real data will be used to illustrate the functionality and several different predictive models are illustrated. The course focuses on both low- and high-level approaches to modeling using the tidyverse and uses several types of models for illustration.
Attendees will be able to use the tidymodels packages to create, tune, fit, visualize, and assess models created for the purpose of prediction.
Is this course for me?
This course requires basic familiarity with R and the tidyverse.
We will have RStudio server pro instances with all of the packages installed as well as the above GitHub repository available.
If you would like to run R locally, the installation instructions are:
install.packages( c( 'AmesHousing', 'C50', 'devtools', 'discrim', 'earth', 'ggthemes', 'glmnet', # See important note below 'klaR', 'lubridate', 'modeldata', 'party', 'pROC', 'rpart', 'stringr', 'textfeatures', 'tidymodels' ), repos = "http://cran.rstudio.com" ) devtools::install_github(c( "tidymodels/tidymodels", "tidymodels/tune", "tidymodels/textrecipes", "koalaverse/vip", "gadenbuie/countdown" ))
Important note! A new version of
glmnet was released on 2019-11-09. Although it states that it depends on R (≥ 3.5.0), it may not install on R versions < 3.6.0.
We will be on-site at least 30min before the workshop commences in case you need any help getting packages installed. Prior to this, you can email
email@example.com with questions.
We don’t provide the
Rmd files for the slides mostly because they are complex and we don’t support them. However, we do get requests for people who would like to use them as a template so we provide
Part_1.Rmd if you want to use this format for your presentations.
|09:00 - 10:30||Session 1|
|10:30 - 11:00||Coffee break|
|11:00 - 12:30||Session 2|
|12:30 - 13:30||Lunch break|
|13:30 - 15:00||Session 3|
|15:00 - 15:30||Coffee break|
|15:30 - 17:00||Session 4|
Max Kuhn and Davis Vaughan
This work is licensed under a Creative Commons Attribution 4.0 International License.