Code and Presentation for PyCon2016
Jupyter Notebook
Permalink
Failed to load latest commit information.
notebook
.gitignore
LICENSE
README.md

README.md

PyCon2016

Code and Presentation for PyCon2016

Join the chat at https://gitter.im/shagunsodhani/PyCon2016

Topic

Big Data Analysis using PySpark

Presentation

Local Setup

  • Follow the instructions on iota to setup execution environment and get and process the raw StackExchange Data.
  • cd notebook
  • ./run.sh

Databricks Community Edition

Dataset

  • Processed data (which is used for demo) can be downloaded here.
  • For getting the latest version of data, follow the instructions on iota to setup execution environment and get and process the raw StackExchange Data.

More Notebooks

  • This repo only contains notebooks which will be demoed at PyCon2016.
  • For more notebooks related to Stack Exchange data, check out iota

Attribution

The image showing the workflow to import the notebooks is created by Databricks. Licence at https://creativecommons.org/licenses/by-nc-nd/4.0/