A portfolio of data science projects analysing open source data.
Data Playground |
---|
World Bank Group Finances Analysing World Bank Group Finances data using sodapy to establish an API connection. |
World Bank Group Finances: Zimbabwe Analysis Analysing World Bank Group Finances data. Focus: Zimbabwe. |
World Bank Group Finances: Disbursed Amount Analysis Analysing World Bank Group Finances data. Focus: Disbursed Amounts. |
Product Analytics Case Study: Telecommunications Analysing the product data from a telecommunications company. |
Fifa18 Ultimate Team Data Analysis Analysing fantasy football stats. |
arXiv Submissions & Downloads Simulating time series for arXiv submissions and downloads. Includes a second Jupyter notebook which explores the spatial aspect of the submissions and downloads. |
Backblaze Hard Drive Failure Rates Looking at hard drive failure data from Backblaze. |
Amazon Unlocked Phone Reviews Exploring sentiments from unlocked phones sold on Amazon. |
When Does Praying In Public Make Others Uncomfortable? Exploring the data behind the story When does praying in public make others uncomfortable?. Survey was conducted from 29 July - 1 August 2016 and the data can be found in this repo or the original fivethirtyeight repo. |
Trump Tweets Exploring the data behind the story The World’s Favorite Donald Trump Tweets. |
Twitter Ratio An in-depth look at the data behind the story The Worst Tweeter In Politics Isn't Trump. We look at the topics of interest based on sentiments shared by U.S. senators, the 44th U.S. President, Barack Obama and the 45th President, Donald Trump on Twitter. |
Tweets Predictor Building a resuable machine learning pipeline that can predict the author of a particular sentiment. We also make use of Dask , a Python module for scaling machine learning models. |
Bayesian Inference in Practice Gaining a deeper understanding of Bayesian inference through complete (end-to-end) data analytics projects. |
311 Service Requests Data Pipeline A simple data pipeline for retrieving data for 311 service calls and storing them on a Postgresql database. |
Natural Language Processing (NLP) with Tensorflow In this repo, we explore some deep learning concepts using Tensorflow. |
Loan Prediction Challenge In this project, we implement end-to-end data science, from exploration, feature engineering, feature selection, modeling, and optimisation. |
DvD Rental Pipeline A data pipeline to show the most populary rented out movies. |