Skip to content

joaocerca/PySpark-Scripts

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

PySpark-Scripts

Collection of pyspark scripts

This repository is a place where I share my scripts created while learning Spark, within the Hortonworks Sandbox.

US Congress Members

It explores a bit the data compiled by the website FiveThirtyEight, related to the members of US Congress, from 1947 to 2014. I used PySpark SQL. The source link.
USCongress_age.py

International Football Stats

With the International football results dataset, compiled by Mart Jürisoo, I explored some of the metrics of the Scottish National Team. For the moment, I am not taking into consideration the shootouts data, however, the file is also included in the source files folder. The source link.
football_stats.py

About

Collection of pyspark scripts

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages