Skip to content
This repository has been archived by the owner on Jan 16, 2023. It is now read-only.

wildlyclassyprince/data-playground

Repository files navigation

Data Playground

A portfolio of data science projects analysing open source data.

Data Playground
World Bank Group Finances
Analysing World Bank Group Finances data using sodapy to establish an API connection.
World Bank Group Finances: Zimbabwe Analysis
Analysing World Bank Group Finances data. Focus: Zimbabwe.
World Bank Group Finances: Disbursed Amount Analysis
Analysing World Bank Group Finances data. Focus: Disbursed Amounts.
Product Analytics Case Study: Telecommunications
Analysing the product data from a telecommunications company.
Fifa18 Ultimate Team Data Analysis
Analysing fantasy football stats.
arXiv Submissions & Downloads
Simulating time series for arXiv submissions and downloads. Includes a second Jupyter notebook which explores the spatial aspect of the submissions and downloads.
Backblaze Hard Drive Failure Rates
Looking at hard drive failure data from Backblaze.
Amazon Unlocked Phone Reviews
Exploring sentiments from unlocked phones sold on Amazon.
When Does Praying In Public Make Others Uncomfortable?
Exploring the data behind the story When does praying in public make others uncomfortable?. Survey was conducted from 29 July - 1 August 2016 and the data can be found in this repo or the original fivethirtyeight repo.
Trump Tweets
Exploring the data behind the story The World’s Favorite Donald Trump Tweets.
Twitter Ratio
An in-depth look at the data behind the story The Worst Tweeter In Politics Isn't Trump. We look at the topics of interest based on sentiments shared by U.S. senators, the 44th U.S. President, Barack Obama and the 45th President, Donald Trump on Twitter.
Tweets Predictor
Building a resuable machine learning pipeline that can predict the author of a particular sentiment. We also make use of Dask, a Python module for scaling machine learning models.
Bayesian Inference in Practice
Gaining a deeper understanding of Bayesian inference through complete (end-to-end) data analytics projects.
311 Service Requests Data Pipeline
A simple data pipeline for retrieving data for 311 service calls and storing them on a Postgresql database.
Natural Language Processing (NLP) with Tensorflow
In this repo, we explore some deep learning concepts using Tensorflow.
Loan Prediction Challenge
In this project, we implement end-to-end data science, from exploration, feature engineering, feature selection, modeling, and optimisation.
DvD Rental Pipeline
A data pipeline to show the most populary rented out movies.