Skip to content

Using ModularPipelines for Data Transformation #528

Answered by thomhurst
rlisnoff-css asked this question in Q&A
Discussion options

You must be logged in to vote

This library was definitely designed as more of a CI/CD library but really it should be generic enough to perform any sort of pipeline. Really, in essence, it's just an orchestrator for your jobs, handling the concurrency and dependencies on other modules for you.

You define a module for each action you want your code to do, and then tell it if that module relies on any of your other modules so that it will wait for it before starting etc.

The module is an abstract class, so you define whatever code/action you want to perform, and full dependency injection is supported if needed. So your execute method of a module simply does that data transformation and then returns it. What you return i…

Replies: 1 comment 7 replies

Comment options

You must be logged in to vote
7 replies
@rlisnoff-css
Comment options

@thomhurst
Comment options

Answer selected by rlisnoff-css
@rlisnoff-css
Comment options

@thomhurst
Comment options

@thomhurst
Comment options

@rlisnoff-css
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants