Skip to content

[FEATURE] Skip reprocessing of entire rows based on finer-grain lineage information #1057

@georgeh0

Description

@georgeh0

Currently, when source row + the flow logic doesn't change from last time, we skip reprocessing.

The flow logic is determined based on the entire flow spec. This is too coarse grain. Not all changes should affect (e.g. exclude_pattern changes from sources, intermediate field ordering / name changes). We want to detect this with finer granularity based on lineage information.


❤️ Contributors, please refer to 📙Contributing Guide.
Unless the PR can be sent immediately (e.g. just a few lines of code), we recommend you to leave a comment on the issue like I'm working on it or Can I work on this issue? to avoid duplicating work. Our Discord server is always open and friendly.

Metadata

Metadata

Assignees

No one assigned

    Labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions