Skip to content

Extract Load Transform (ELT) framework is a metadata based batch orchestration framework for modern data platforms. Implemented using Azure PaaS data services. Common ingestion and transformation patterns available out of box. Reusable code can be easily extended to cater to custom patterns.

License

Notifications You must be signed in to change notification settings

bennyaustin/elt-framework

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

elt-framework

The Extract Load Transform (ELT) framework is a metadata-driven orchestration framework designed for modern cloud data platforms. It simplifies ingestion and transformation pipelines, ensuring a consistent development experience and ease of maintenance. The framework supports batch ingestion and has been extensively tested with Microsoft Fabric and Azure managed services like Azure Databricks and Azure Synapse. It utilizes an ANSI-compatible control database as the metadata repository.

Key Features:

  • Configurable and Extendable: Easily adapt the framework to meet specific needs.
  • Data Source Agnostic: Ingest data from various sources such as databases, Delta Lake, REST API, flat files, JSON, XML, without storing connection strings as metadata.
  • Delta and Full Loads: Support for both incremental and full data loads.
  • Re-run and Retry Capability: Automatically handle failures without manual intervention.
  • In-built Audit Tracking: Track data processing activities with built-in audit capabilities.
  • Extended Audit Capability: Enhance audit tracking with Azure PaaS services like Diagnostic Logging.
  • Eliminates Manual Data Patching: Streamline data processing by removing the need for manual interventions.
  • Data Lineage Support: Maintain data lineage throughout the data lifecycle.
  • Level1 and Level2 Transformations: Support for one-to-many and many-to-many transformations.
  • On-demand Pipeline and Transformation Management: Enable or disable pipelines and transformations as needed.

The framework includes several reusable artifacts such as data source-specific Data Factory pipelines, Spark notebooks, and Logic apps, which can be readily used or extended for custom use cases.

For extensive documentation, visit our Wiki

Implementation References

About

Extract Load Transform (ELT) framework is a metadata based batch orchestration framework for modern data platforms. Implemented using Azure PaaS data services. Common ingestion and transformation patterns available out of box. Reusable code can be easily extended to cater to custom patterns.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages