Movies-ETL

Module 8 of Data Analytics Bootcamp

Overview

A company known as Amazing Prime is hosting a hackathon and is asking us to help prepare the datasets that the coders will be working with. They gathered data from Wikipedia and Kaggle for us to work with. The main focus is to create a function that will help us clean up large datasets and merge them together.

We are using python and pandas in Jupyter Notebook, as well as SQL in PGAdmin 4 to clean up a significant amount of data from Wikipedia and Kaggle. We first read in all the data and clean it up in jupyter notebook, then we merge the datasets and pass them over to SQL.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
Resources		Resources
.gitignore		.gitignore
ETL_clean_kaggle_data.ipynb		ETL_clean_kaggle_data.ipynb
ETL_clean_wiki_movies.ipynb		ETL_clean_wiki_movies.ipynb
ETL_create_database.ipynb		ETL_create_database.ipynb
ETL_function_test.ipynb		ETL_function_test.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Movies-ETL

Module 8 of Data Analytics Bootcamp

Overview

About

Releases

Packages

Languages

mdbinger/Movies-ETL

Folders and files

Latest commit

History

Repository files navigation

Movies-ETL

Module 8 of Data Analytics Bootcamp

Overview

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages