Skip to content
master
Go to file
Code

Latest commit

 

Git stats

Files

Permalink
Failed to load latest commit information.
Type
Name
Latest commit message
Commit time
 
 
 
 
 
 
 
 

README.md

PyCon2016

Code and Presentation for PyCon2016

Join the chat at https://gitter.im/shagunsodhani/PyCon2016

Topic

Big Data Analysis using PySpark

Presentation

Local Setup

  • Follow the instructions on iota to setup execution environment and get and process the raw StackExchange Data.
  • cd notebook
  • ./run.sh

Databricks Community Edition

Dataset

  • Processed data (which is used for demo) can be downloaded here.
  • For getting the latest version of data, follow the instructions on iota to setup execution environment and get and process the raw StackExchange Data.

More Notebooks

  • This repo only contains notebooks which will be demoed at PyCon2016.
  • For more notebooks related to Stack Exchange data, check out iota

Attribution

The image showing the workflow to import the notebooks is created by Databricks. Licence at https://creativecommons.org/licenses/by-nc-nd/4.0/

About

Code and Presentation for PyCon2016

Resources

License

Releases

No releases published
You can’t perform that action at this time.