Skip to content

Performed the Extract, Transform and Load (ETL) process to create a data pipeline on movie datasets using Python, Pandas, Jupyter Notebook and PostgreSQL.

Notifications You must be signed in to change notification settings

nhafer88/Movies_ETL

Repository files navigation

etl_banner

Amazing Prime Hackathon

Project Overview

This analysis project provides a visualization for predicting which low-budget movies being released will become popular. Data from Wikipedia (movies released since 1990) and Movielens/Kaggle (movie ratings) were utilized in this project. The tasks in this project:

  • Extract the data from the data sources
  • Tranform the data in clean data set using Python and Pandas
  • Load the data set into a SQL table

SQL Queries

Row count for Movies Table

movies_query

Row count for Ratings Table

ratings_query

About

Performed the Extract, Transform and Load (ETL) process to create a data pipeline on movie datasets using Python, Pandas, Jupyter Notebook and PostgreSQL.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published