Skip to content

galiya/mr4ds

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Microsoft R for Data Science Workshop

Join the chat at https://gitter.im/mr4ds/Lobby

Welcome to the Microsoft R for Data Science Course Repository. You can find the latest materials from the workshop here, and links for course materials from prior iterations of the course ca be found in the version pane. While this course is intended for data scientists and analysts interested in the Microsoft R programming stack (i.e., Microsoft employees in the Algorithms and Data Science group), other programmers might find the material useful as well.

Class Links

  • course webpage
  • gitter page
    • we are going to try and use gitter as a discussion forum for anything related to the course materials, and Microsoft R Server more generally
  • Course wiki
    • the course wiki contains some instructions on how to install the class applications locally
    • it also contains the course syllabus
  • Class Playlist
    • As your instructor, I'll also be your workshop dj. Feel free to make requests.

Course Outline

Please refer to the course syllabus for the full syllabus. The goal of this course is to cover the following modules, although some of the latter modules may be repalced for a hackathon/office hours.

  • Topics:
    • R Fundamentals
    • Data Manipulation with dplyr
    • Data Manipulation with dplyrXdf
    • Modeling and Scoring with Microsoft R
    • Parallel Computing with the RevoScaleR package
    • Deploying Models with the AzureML package
    • RxSpark and R APIs for Spark

DSVMs

We will use DSVMs (Data Science Virtual Machines) from the Azure marketplace to run the course materials. For the Spark training, we will use Spark HDInsight Premium clusters, also from Azure. If you are interested in running these materials in a different environment, see the course wiki for instructions.

Credentials

  • I'll send you your credentials by email

About

R and Microsoft R Workflows for Data Science

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • HTML 82.0%
  • Jupyter Notebook 12.6%
  • JavaScript 3.0%
  • CSS 2.3%
  • R 0.1%
  • Shell 0.0%