Data Modelling to analyse impact of immigrants on the climate of USA

The aim of this project is to perform data modelling required to analyse the behavioral impact of Immigrants on the climate of different states of USA. To accomplish this task we are creating a data pipeline built up using the two different massive datasets containing immigrants data and the temperature data. We need to create a ELT database which is well optimized for running queries and performing other analytical operation to gather the facts/insights from this data.

Project Dependencies

Python 3.x
PySpark >= 2.4.x
Numpy
Pandas

Datasets

Two different datasets has been used with their details provided below:

Climate Change Dataset: This dataset comes from Kaggle contains information of average temperature at different cities across the globe.
Immigrants Dataset: This dataset comes from US National Tourism and Trade Office containg information regarding their arrival arrival information.

Running the Project

Clone the project in your local.
Install all the required dependencies.
Open the Jupyter notebook file named Project.ipynb and run all cells.
The modelled data will be processed and dumped in <BASE_PROJECT_PATH>/output directory.

Author

@maheshjindal

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
sas_data		sas_data
.gitignore		.gitignore
I94_SAS_Labels_Descriptions.SAS		I94_SAS_Labels_Descriptions.SAS
Project.ipynb		Project.ipynb
README.md		README.md
airport-codes_csv.csv		airport-codes_csv.csv
imm_dest_mapping.csv		imm_dest_mapping.csv
immigration_data_sample.csv		immigration_data_sample.csv
us-cities-demographics.csv		us-cities-demographics.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Data Modelling to analyse impact of immigrants on the climate of USA

Project Dependencies

Datasets

Running the Project

Author

About

Releases

Packages

Languages

maheshjindal/climate_analysis_modelling

Folders and files

Latest commit

History

Repository files navigation

Data Modelling to analyse impact of immigrants on the climate of USA

Project Dependencies

Datasets

Running the Project

Author

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages