Skip to content

Python package and custom runtime to use in Azure Databricks as part of Ingenii's Data Platform

License

Notifications You must be signed in to change notification settings

ingenii-solutions/azure-data-platform-databricks-runtime

Repository files navigation

Ingenii Databricks Platform

Maintainer License Contributing

Details

Intermediate Images

  • Base OS Repository: databricks-runtime-base-os
  • Base OS Version: 0.1.0
  • Base Python Repository: databricks-runtime-base-python
  • Base Python Version: 0.1.0

Overview

This image is used with Databricks' Container Services to customise the cluster runtime in the engineering cluster of in the Ingenii Data Platform. This contains an installation of dbt and Ingenii's python package for data engineering.

Data Pipeline Overview

For an overview of the data pipeline and the stages it goes through, please refer to the Data Pipeline documentation

dbt Integration

For reading files and testing data we use dbt as a framework. For an explanation on how we use dbt and how to set up your own data sources, please refer to the Ingenii Data Engineering Example repository.

Contributions

About

Python package and custom runtime to use in Azure Databricks as part of Ingenii's Data Platform

Topics

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •  

Languages