Skip to content
Code and Slides from Spark on Azure presentation at Silicon Valley Code Camp 2015
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
1 titanic_logistic_regression.ipynb
2 titanic_decision_tree.ipynb
3 titanic_randomforest_grid_search.ipynb
4 titanic_dataframes.ipynb
5 titanic_pipelines.ipynb
README.md
spark_on_azure.pptx
svcc_submission1.csv
svcc_submission2.csv

README.md

SparkOnAzure

Code and Slides from Spark on Azure presentation at Silicon Valley Code Camp 2015 A set of Jupyter notebooks could be run sequentially (1-3) to progressively generate better submissions for the TItanic competition on kaggle.com

The last two notebooks illustrate Spark SQL and Dataframes, as well as (incomplete) Spark ML pipeline, which will be the topic of future presentations at Bay Area Azure meetup (http://www.meetup.com/bayazure)

@EugeneChuvyrov

You can’t perform that action at this time.