Building a Data Warehouse using AWS Redshift

Project structure

Project contains seven files:

blob/data-pipeline-architecture.drawio: an diagram illustrates how data is loaded from Sparkify souce to Redshift datawarehouse followed by an visualization tool supporting for visual data
create_tables.py: drop tables in Amazon Redshift in case it already exist and create tables. The scipts will be run before each time running ETL scripts for loading data to Redshift.
sql_queries.py: contains all the query command, data is loaded to Redshift from S3
README.md provide information about project and step instructions how to run it

Run The Project

Synchonize data:

Synchonize data from Sparkify to Data Lake (S3) using AWS cli s3 sync command Create Tables in Redshift:
Run the create_tables

Scripts - Draft

Create a connection to Redshift Datawarehouse

  import psycopg2
  conn = psycopg2.connect("host='ktdl-redshift-cluster-1.cfl4l9luhdaw.ap-southeast-1.redshift.amazonaws.com' dbname=dev user=clphan password=Clphan259 port=5439")

  cur = conn.cursor()
  query = 
  cur.execute(query)
  conn.commit()

  conn.close()

Connect to Redshift using psql, the example below uses dummy credential

Name		Name	Last commit message	Last commit date
Latest commit History 51 Commits
clickstream-kafka		clickstream-kafka
dw-datapipeline		dw-datapipeline
dw-sparkify		dw-sparkify
stream/kinesis/iac		stream/kinesis/iac
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Building a Data Warehouse using AWS Redshift

Project structure

Project contains seven files:

Run The Project

Scripts - Draft

About

Releases

Packages

Languages

chrisp018/data-pipeline

Folders and files

Latest commit

History

Repository files navigation

Building a Data Warehouse using AWS Redshift

Project structure

Project contains seven files:

Run The Project

Scripts - Draft

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages