Skip to content

This project builds a pipeline to analyze Superstore sales data using the power of AWS. It transforms the data to make it ready for exploration. Querying the transformed data using SQL queries to uncover trends and patterns. Analyzing results and creates easy-to-understand visualizations, providing clear insights into Superstore sales performance.

Notifications You must be signed in to change notification settings

bhavanachitragar/Superstore-Data-Analysis-using-AWS

Repository files navigation

Superstore Data Analysis using AWS


This project builds a pipeline to analyze Superstore sales data using the power of AWS. It transforms the data to make it ready for exploration. Querying the transformed data using SQL queries to uncover trends and patterns. Analyzing results and creates easy-to-understand visualizations, providing clear insights into Superstore sales performance.

AWS services used:

  • IAM
  • S3
  • AWS Glue
  • AWS Athena
  • AWS QuickSight

Screenshot 2024-05-22 205016

1. IAM (Identity and Access Management):

Creating an IAM user which defines permissions for users and applications to access and manage data in other services like S3, Glue, Athena, and QuickSight.

2. S3 (Simple Storage Service):

S3 Bucket serves as the data storage repository where raw data is uploaded before processing. Created different folders which helps Crawler for Partition.

3. AWS Glue:

Glue helps in extract, transform, and load (ETL) data. Running a Crawler to create a Data Catalog.

4. AWS Athena:

Athena enables querying the transformed data stored in S3 by Glue. It helped in running SQL queries.

Screenshot 2024-05-22 181525

Screenshot 2024-05-22 181538

5. Amazon QuickSight:

QuickSight helps in creating visualizations and dashboards from data sources. It used the results from Athena’s analysis of the data for data visualization.

Sanpshots:

Screenshot 2024-05-22 181446

Screenshot 2024-05-22 203727


Kaggle dataset: https://www.kaggle.com/datasets/vivek468/superstore-dataset-final

About

This project builds a pipeline to analyze Superstore sales data using the power of AWS. It transforms the data to make it ready for exploration. Querying the transformed data using SQL queries to uncover trends and patterns. Analyzing results and creates easy-to-understand visualizations, providing clear insights into Superstore sales performance.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published