Skip to content

Udacity Data Engineering Nanodegree Capstone Project

Notifications You must be signed in to change notification settings

Jia-0308/Capstone_Project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Capstone_Project

Project Summary

This projects aggregate several data sources (immigration, temperature, demographic and airport codes), which aims to help the immigration office to better understand the pattern of migrations.

The project follows the follow steps:

  • Step 1: Scope the Project and Gather Data
  • Step 2: Explore and Assess the Data
  • Step 3: Define the Data Model
  • Step 4: Run ETL to Model the Data
  • Step 5: Complete Project Write Up

Datasets

  1. I94 Immigration Data: This data comes from the US National Tourism and Trade Office. A data dictionary is included in the workspace. This is where the data comes from. There's a sample file so you can take a look at the data in csv format before reading it all in. You do not have to use the entire dataset, just use what you need to accomplish the goal you set at the beginning of the project.
  2. World Temperature Data: This dataset came from Kaggle. You can read more about it here.
  3. U.S. City Demographic Data: This data comes from OpenSoft. You can read more about it here.
  4. Airport Code Table: This is a simple table of airport codes and corresponding cities. It comes from here.

Conceptual Data Model

image

About

Udacity Data Engineering Nanodegree Capstone Project

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published