Conditional Merges [REQUEST] #97

angelosantos4 · 2023-09-20T18:35:23Z

Is your feature request related to a problem? Please describe.
When we ingest datatypes from different sources, we may run into the issue where we ingest data from an old source and one from a recent source with different properties. The current implementation builds a merge query which simply overwrites the properties on matched nodes based on which record was ingested first. I would like for there to be a way to conditionally change properties on a node.

Describe the solution you'd like
When I create an interpretation within my pipeline, I would like to declare the following:

merge_condition: latest # Where we could have different options latest being greater value wins. default=None
condition_key: date_created # key of the value we are comparing with.
condition_value: !!python/jmespath date_created #The value from the record we are pulling from

This would then modify the merge query which currently performs the following for source nodes:

MERGE(node:$node_type) WHERE node.key = $key
ON CREATE
    SET node.param = param.value
ON MATCH
    SET node.param = param.value

I would like it to create the following:

MERGE(node:$node_type) WHERE node.key = $key
ON CREATE
    SET node.param = param.value
ON MATCH
    SET node.condition = CASE WHEN $condition THEN true ELSE false END // We need a variable for the condition in some way
    SET node.param = CASE WHEN node.condition THEN param.value ELSE node.param END
    // Find a way to unset node.condition

Where condition in our case would be (date_created > node.date_created)

Describe alternatives you've considered
The alternative I can perform to ensure the recency of my data is I can schedule my pipelines such that the recent data comes in after the old data.
In my pipeline, I can create an interpreter that makes a call to the database to get the value, then conditionally write to the database (this takes too long.)

Additional context

The text was updated successfully, but these errors were encountered:

zprobst · 2023-10-19T23:40:42Z

I agree that this could be handy. Question marks around whether this is required in an ETL framework. Definitely willing to take PRs on this.

One major challenge is going to be around retaining the abstraction behind graph databases.

angelosantos4 added the enhancement New feature or request label Sep 20, 2023

angelosantos4 assigned zprobst Sep 20, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Conditional Merges [REQUEST] #97

Conditional Merges [REQUEST] #97

angelosantos4 commented Sep 20, 2023 •

edited

zprobst commented Oct 19, 2023

Conditional Merges [REQUEST] #97

Conditional Merges [REQUEST] #97

Comments

angelosantos4 commented Sep 20, 2023 • edited

zprobst commented Oct 19, 2023

angelosantos4 commented Sep 20, 2023 •

edited