A PySpark-based data pipeline for cleaning, validating, and analyzing retail sales data using AWS S3, Glue, and Athena
-
Updated
Aug 2, 2025 - Python
A PySpark-based data pipeline for cleaning, validating, and analyzing retail sales data using AWS S3, Glue, and Athena
Add a description, image, and links to the pyspark-aws-etl-sql-data-pipeline topic page so that developers can more easily learn about it.
To associate your repository with the pyspark-aws-etl-sql-data-pipeline topic, visit your repo's landing page and select "manage topics."