Skip to content

sarathchandrikak/ETL-Bank-Transcation

Repository files navigation

ETL-Bank-Transcation

Data Analysis of bank transaction data

Steps Performed:

Extracting the transactional data from a given MySQL RDS server to HDFS(EC2) instance using Sqoop.

Transforming the transactional data according to the given target schema using PySpark. 

This transformed data is to be loaded to an S3 bucket.

Creating the Redshift tables according to the given schema.

Loading the data from Amazon S3 to Redshift tables.

Performing the analysis queries.

Releases

No releases published

Packages

No packages published