Reusable Python classes that extend open source PySpark capabilities. Examples of implementation is available under notebooks of repo https://github.com/bennyaustin/synapse-dataplatform
-
Updated
Nov 1, 2024 - Python
Reusable Python classes that extend open source PySpark capabilities. Examples of implementation is available under notebooks of repo https://github.com/bennyaustin/synapse-dataplatform
End-to-end ETL pipeline in the Microsoft Azure cloud - (Jun '24 - Jul '24)
Development of a Data Pipeline using Azure Synapse
An end-to-end data engineering pipeline that fetches data from Wikipedia, cleans and transforms it with Apache Airflow and saves it on Azure Data Lake. Other processing takes place on Azure Data Factory, Azure Synapse and Tableau.
A B2B solution and architecture. This includes code for Azure Function App, SQL Server. Integration with azure synapse and event hubs as well.
Containerized tool for load testing Azure SQL Database and Azure Synapse Analytics SQL pool
Python script that cleans the JSON document used by an Azure Data Factory or Azure Synapse Copy Activity to specify the source-sink mapping.
Azure ARM Templates + Synapse Analytics Playground
Add a description, image, and links to the azure-synapse-analytics topic page so that developers can more easily learn about it.
To associate your repository with the azure-synapse-analytics topic, visit your repo's landing page and select "manage topics."