Skip to content

Release life cycle for data specifically, as opposed to software development life cycle

License

Notifications You must be signed in to change notification settings

data-engineering-helpers/data-life-cycle

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 

Repository files navigation

Data Release Life Cycle

Table of Content (ToC)

Overview

This project intends to document requirements and referential material about data life cycle, in particular to differentiate it from the software delivery.

Even though the members of the GitHub organization may be employed by some companies, they speak on their personal behalf and do not represent these companies.

References

Articles

Dev/Stage/Prod is the Wrong Pattern for Data Pipelines

Putting the Write-Audit-Publish Pattern (WAP) into Practice with lakeFS

How to Implement Write-Audit-Publish (WAP)

Virtual Data Environments

Books

Continuous Delivery

  • Title: Continuous Delivery: Reliable Software Releases through Build, Test, and Deployment Automation
  • Authors: Jez Humble and David Farley
  • Date: 27 Jul. 2010
  • ASIN:‎ 0321601912
  • Publisher: ‎Addison-Wesley Professional; 1st edition
  • ISBN-10: ‎ 9780321601919
  • ISBN-13: ‎ 978-0321601919
  • Link to the book home page: https://martinfowler.com/bliki/ContinuousDelivery.html

Frameworks / tools

LakeFS

lakeFS is an open-source tool that transforms your object storage into a Git-like repository. It enables you to manage your data lake the way you manage your code.

With lakeFS you can build repeatable, atomic, and versioned data lake operations - from complex ETL jobs to data science and analytics.

lakeFS supports AWS S3, Azure Blob Storage, and Google Cloud Storage as its underlying storage service. It is API compatible with S3 and works seamlessly with all modern data frameworks such as Spark, Hive, AWS Athena, DuckDB, and Presto.

For more information, see the documentation.

About

Release life cycle for data specifically, as opposed to software development life cycle

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published