🌐 Interactive Workshop on GeoAnalysis using PySpark
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
code
data
docker
images
tests/unit
work-flow
.gitignore
001-data.ipynb
002-what-are-we-doing.ipynb
003-geopandas-and-spark.ipynb
004-counting-museums.ipynb
005-join-udfs-and-spatial-predicates.ipynb
006-spatial-spark.ipynb
007-Analysis-of-all-Cities.ipynb
README.md
circle.yml
jars
slides.md

README.md

Docker Image Test Status:

CircleCI

A Small Course on Big Data - GeoAnalysis using PySpark

House Keeping

Who's Here?

I love staying in touch here's a link to a form where you can add your details for me to stay in touch with you. I also love feedback good and bad! I love to get better at my job. So as we go though this course I want you to keep in mind that I will ask you to provide some feedback afterwards. You can keep it anonymous of choose to tell me who you are. See feedback form here: Feedback Form

  • Who is using Spark in Production?
  • Who is doing Geospatial Analysis using Spark?
  • Who is a programmer?
  • Who is a Data Janitor... err I mean Scientist 😄
  • Who is a hedge fund manager? ... here's my number 181821113 (bank account number, that is!)
  • Who is doing something else? I have missed?

Introduction

This workshop will introduce you to Apache Spark via the exciting domain of Geospatial Analysis.

Setup

Dependencies:

See: docker/README.md

Data

If you use docker the data will automatically downloaded into the work-flow folder. See docker/README.md