Skip to content

guc-met/DE-W22

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

32 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Data-Engineering-W22

This repository contains the notebooks for the Data Engineering course.

Lab
#
Topic Lab
Notebook
Exercise
Solutions Notebook
1 Explore Your Data Lab 1
Open In Colab
2 Data Visualization Lab 2
Open In Colab
Lab 2 Solution
Open In Colab
3 Data Visualization II Lab 3
Open In Colab
Lab 3 Solution
Open In Colab
4 Data Cleaning Lab 4
Open In Colab
Lab 4 Solution
Open In Colab
5 Data Transformation Lab 5
Open In Colab
Lab 5 Solution
Open In Colab
6 Data Integration Lab 6
Open In Colab
Lab 6 Solution
Open In Colab
7 Data Tidying Lab 7
Open In Colab
Lab 7 Solution
Open In Colab
8 Airflow Slides

Prerequisites

This repository requires that you have:-

Installation of Prerequisites

Easy way (More HD space, less hassle)

Install Anaconda then just run Jupyter.

Hard way (Less HD space, more hassle)

Install Python3+

Make sure Python and pip are added to environment variables Python

From your Linux, Mac, or Windows terminal, verify that both are installed correctly.

$ python --version
$ pip --version

Using the same terminal install numpy, matplotlib, pillow and notebook

$ pip install numpy matplotlib pillow notebook

Alternative way (Cloud but you have to upload the data)

Click on the Open in Colab next to the notebook you would like to exercise.

Upload the data needed as Data.zip using the following command

from google.colab import files
uploaded = files.upload()

Extract the zipped folder into the cloud using the following command

!unzip [foldername].zip

How To Run

From your terminal, run this command then navigate to the .ipynb file you would like to exercise

jupyter notebook

License

MIT License

About

Data Engineering Course W22

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published