Skip to content

robertzk/r-book

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

34 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Classifier Development in R

This book is intended to serve as an introduction to production-level classifier development in the R programming language. The sections denoted by * are optional.

  • Introduction

  • Data Preparation

  • Modeling

    • Some manual exercises
      • A linear regression
      • A GBM model
    • Tundra (docs/turning models into R objects)
      • A linear regression tundra model
      • A GBM tundra model
      • Understanding training parameters
      • Using prediction parameters
      • The philosophy of tundra
      • * Writing our own GLM tundra container
    • Stagerunner
      • An example of the full modeling process
      • Introducing stagerunner: parametrizing our modeling process
      • Quick detour: other use cases for stagerunner
      • * Advanced features of stagerunner objects
      • * Debugging stagerunner objects
      • Interactive stagerunners and caching
      • An ensemble of stagerunners
    • Syberia
      • Introduction
      • The import stage
      • The data stage
      • The model stage
      • The export stage
      • * The evaluation stage
      • Testing your syberia models
      • Re-factoring complicated models with Ramd
  • Deployment

    • Microserver
      • Launching a microserver on EC2
      • Deploying your syberia model to S3
    • Validation
      • The philosophy of validation
      • Testing alpha versus beta operations
    • Scaling with nginx

About

Classifier Development in R

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published