Skip to content

Rucal-Data-Solutions/datalakefoundation

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

34 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Datalake(house) Foundation

Datalake Foundation is a library to process data, making it ready for transformation (your business logic), it is founded on DataLakehouse principals, therefor fits perfect within DataLakehouse architecture. It takes a data slice (parquet) from bronze and processes the data according to the config (Metadata)

Features that are implemented:

  • Processing

    • Full load
    • Delta processing
  • Metadata

    • Json Metadata (JsonMetadataSettings)
    • Sql server database (SqlMetadataSettings)
  • Data factory

    • Item generator (for loops)

This documentation is work in progress, please dm me for details.

Slice (Bronze) location:
<root_folder>/bronze/<connection>/<entity_name>/<slice_file>

Silver location:
<root_folder>/silver/<connection>/<entity_name>/<slice_file>