Microsoft R with Spark on HDInsight
This is a one-day workshop on Using R Server on Spark. Student can find course modules as rmarkdown documents in the Student-Resources directory. Instructions on how to deliver the course can be found in the Instructor-Resources directory. It is usually expected that the student has already completed the Microsoft R for Data Science Course.
Tutorial Cluster Deployment Instructions
Click the “Deploy to Azure” button
Fill in the form and click “Purchase”. IMPORTANT: Set Cluster Login User Name = "admin" and Ssh User Name = "sshuser". Here is an example:
Wait 30-40 minutes for the cluster to deploy
We will run our R scripts using the RStudio IDE. To launch RStudio in your browser, from the cluster overview in the Azure portal, click "R Server dashboards" and then "R Studio server". At the first login screen, enter "admin" and the password you supplied. At the second login screen, enter "sshuser" and the password you supplied.
Once in RStudio, go to the Files pane in the lower right-hand corner and click on "SparkMLADS" and then "Code". Here you will find the directories for the hands-on tutorial scripts.
Resources for re-delivery can be found in the Instructor Resources folder.