Movies - Extract, Transform, Load

Overview

Amazing Prime would like to automate a pipeline which takes in movie data from 3 sources (Wikipedia, Kaggle and MovieLens) and performs an extract, transform, load process to a PostgreSQL database. Existing code from a Hackathon was refactored using Python in a Jupyter Notebook to create one function to perform this operation. The outputs are 2 tables in a movie_data database titled: movies and ratings.

Table: movies

The table has 22 columns and 6052 rows

Table: ratings

The table has 5 columns and 26024289 rows

Name		Name	Last commit message	Last commit date
Latest commit History 47 Commits
Resources		Resources
module 8 code		module 8 code
.gitattributes		.gitattributes
.gitignore		.gitignore
ETL_clean_kaggle_data.ipynb		ETL_clean_kaggle_data.ipynb
ETL_clean_wiki_movies.ipynb		ETL_clean_wiki_movies.ipynb
ETL_create_database.ipynb		ETL_create_database.ipynb
ETL_function_test.ipynb		ETL_function_test.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Resources

Resources

module 8 code

module 8 code

.gitattributes

.gitattributes

.gitignore

.gitignore

ETL_clean_kaggle_data.ipynb

ETL_clean_kaggle_data.ipynb

ETL_clean_wiki_movies.ipynb

ETL_clean_wiki_movies.ipynb

ETL_create_database.ipynb

ETL_create_database.ipynb

ETL_function_test.ipynb

ETL_function_test.ipynb

README.md

README.md

Repository files navigation

Movies - Extract, Transform, Load

Overview

Table: movies

Table: ratings

About

Releases

Packages

Languages

lnshewmo/Movies-ETL

Folders and files

Latest commit

History

Repository files navigation

Movies - Extract, Transform, Load

Overview

Table: movies

Table: ratings

About

Resources

Stars

Watchers

Forks

Languages