Code and Presentation for PyCon2016
Jupyter Notebook
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
notebook
.gitignore
LICENSE
README.md

README.md

PyCon2016

Code and Presentation for PyCon2016

Join the chat at https://gitter.im/shagunsodhani/PyCon2016

Topic

Big Data Analysis using PySpark

Presentation

Local Setup

  • Follow the instructions on iota to setup execution environment and get and process the raw StackExchange Data.
  • cd notebook
  • ./run.sh

Databricks Community Edition

Dataset

  • Processed data (which is used for demo) can be downloaded here.
  • For getting the latest version of data, follow the instructions on iota to setup execution environment and get and process the raw StackExchange Data.

More Notebooks

  • This repo only contains notebooks which will be demoed at PyCon2016.
  • For more notebooks related to Stack Exchange data, check out iota

Attribution

The image showing the workflow to import the notebooks is created by Databricks. Licence at https://creativecommons.org/licenses/by-nc-nd/4.0/