The primary objective of the project is to build a Bash Command Line tool that performs useful data preparation tasks such as cleaning, truncating, and sorting data.
-
Updated
Jan 5, 2023 - Dockerfile
The primary objective of the project is to build a Bash Command Line tool that performs useful data preparation tasks such as cleaning, truncating, and sorting data.
Data-Driven Software Engineering Studies
Custom development Spark cluster + a Python SnowFlake connector. Docker image. (Current versions Spark: 3.2.1 Hadoop: 3.2 Python: 3.9 Snowflake connector: 2.7.4)
Add a description, image, and links to the dataengineering topic page so that developers can more easily learn about it.
To associate your repository with the dataengineering topic, visit your repo's landing page and select "manage topics."