perf(sql): Store child pipelines as references #3986

robzienert · 2020-11-02T18:54:32Z

Changes pipeline triggers to be stored as a reference, rather than as the full execution.
This makes individual pipelines smaller to store, trading additional SQL queries to hydrate
the full PipelineExecution.

cfieber · 2020-11-02T19:18:31Z

...rc/main/kotlin/com/netflix/spinnaker/orca/sql/pipeline/persistence/SqlExecutionRepository.kt

@@ -830,10 +832,39 @@ class SqlExecutionRepository(
        stages.forEach { storeStageInternal(ctx, it, executionId) }
      }
    } finally {
+      // Restore original object state.


the fact that we need to do this seems a bit wild to me - I wonder if it's worth just objectMapper copying the execution on the way in here so we are free to mutate it as we see fit..

unless the expectation is that the repository mutates to execution to set Ids on it or something...

anyway not super relevate to the overall change in this PR

This was an optimization. Saving the execution is a high-throughput (as far as Orca is concerned) operation, and serialization of executions is very high cost.

cfieber · 2020-11-02T19:19:12Z

the approach definitely makes sense to me

marchello2000 · 2020-11-02T19:32:49Z

orca-sql/src/main/kotlin/com/netflix/spinnaker/orca/sql/pipeline/persistence/ExecutionMapper.kt

+        // throw an exception, we'll continue to load the execution with [PipelineRefTrigger] and let downstream
+        // consumers throw exceptions if they need to. We don't want to throw here as it would break pipeline list
+        // operations, etc.
+        log.error("Attempted to load parent execution for '${execution.id}', but it no longer exists: ${trigger.parentExecutionId}")


nit: I would make this a warn so it doesn't show up sentry, etc.
Also, it's not really an error here, as you state. It's an error at a time when someone downstream needs it

marchello2000 · 2020-11-02T19:35:23Z

I remember someone mentioning that some users have (dozens/hundreds?!) parent pipelines. What happens in those cases, does the perf suffer a lot? Otherwise, i think it makes a ton of sense

robzienert · 2020-11-02T22:31:31Z

@marchello2000 Perf suffers to the point until where MySQL's (or any SQL's) buffer space runs out -- in some pathological community deployments, people are seeing pipeline payloads going as high as 1GB because of the embedded pipeline trigger. When this buffer space runs out, it causes the query to fail and, as a result, anything in Orca failing since it's an irrecoverable error until someone increases the buffer size... which is not always possible.

ajordens · 2020-11-03T22:54:34Z

This is sensible to me.

Changes pipeline triggers to be stored as a reference, rather than as the full execution. This makes individual pipelines smaller to store, trading additional SQL queries to hydrate the full `PipelineExecution`.

robzienert

This PR is now ready for review.

robzienert · 2020-11-06T20:47:17Z

...sql/src/main/kotlin/com/netflix/spinnaker/orca/sql/PipelineRefTriggerDeserializerSupplier.kt

+import com.netflix.spinnaker.orca.pipeline.model.support.mapValue
+import com.netflix.spinnaker.orca.sql.pipeline.persistence.PipelineRefTrigger
+
+class PipelineRefTriggerDeserializerSupplier : CustomTriggerDeserializerSupplier {


This needed to be made as a CustomTriggerDeserializerSupplier because TriggerDeserializer is in orca-core, whereas PipelineRefTrigger is an implementation detail of orca-sql only.

dilippai · 2020-12-10T18:25:39Z

I would encourage the community to not consider use cases like nested pipelines as pathological or abnormal. As Spinnaker adoption grows, especially in the enterprise, you will find usage patterns that are different from yours -- an open source pipelining system needs to be at least somewhat flexible to pipeline patterns that users are naturally likely to gravitate towards.

This is an issue that has become existential for us (Salesforce). I'm not sure why the decision was made to store json as a blob in Orca, but this is leading to huge performance issues culminating in failures.

Nirmalyasen · 2020-12-17T05:55:52Z

Since this is reverted back, what is the solution? Is there another PR to address the issue?

This is becoming a big issue. Also, somewhere in 1.20, the no of artifacts produced out of a deployment manifest stage has increased many folds. So, that compounds the issue of child contexts in the pipelines.

Changes pipeline triggers to be stored as a reference, rather than as the full execution. This makes individual pipelines smaller to store, trading additional SQL queries to hydrate the full `PipelineExecution`.

robzienert requested review from cfieber, plumpy, ajordens, ezimanyi, dreynaud, jonsie and ethanfrogers November 2, 2020 18:54

robzienert marked this pull request as draft November 2, 2020 18:54

cfieber reviewed Nov 2, 2020

View reviewed changes

marchello2000 reviewed Nov 2, 2020

View reviewed changes

titirigaiulian mentioned this pull request Nov 5, 2020

Large deployment pipelines with multiple children results in huge pipeline body spinnaker/spinnaker#6159

Open

jitzpop mentioned this pull request Nov 6, 2020

Orca: excessive MySQL DB usage for pipelines with many stages spinnaker/spinnaker#6006

Closed

robzienert force-pushed the child-pipeline-optimization branch from 10ca824 to 8dc6210 Compare November 6, 2020 20:39

robzienert marked this pull request as ready for review November 6, 2020 20:45

robzienert force-pushed the child-pipeline-optimization branch from 8dc6210 to 23b1561 Compare November 6, 2020 20:47

perf(sql): Store child pipelines as references

ab5c739

Changes pipeline triggers to be stored as a reference, rather than as the full execution. This makes individual pipelines smaller to store, trading additional SQL queries to hydrate the full `PipelineExecution`.

robzienert force-pushed the child-pipeline-optimization branch from 23b1561 to ab5c739 Compare November 6, 2020 20:50

robzienert commented Nov 6, 2020

View reviewed changes

Merge branch 'master' into child-pipeline-optimization

75d6278

robzienert added the ready to merge Approved and ready for merge label Nov 30, 2020

mergify bot merged commit cc93ece into spinnaker:master Nov 30, 2020

mergify bot added the auto merged Merged automatically by a bot label Nov 30, 2020

robzienert mentioned this pull request Nov 30, 2020

revert(sql): Revert storing references for child pipelines #4014

Merged

spinnakerbot added the target-release/1.24 label Dec 3, 2020

emagana-zz mentioned this pull request Dec 16, 2020

Kubernetes operations should not include the entire manifest in pipeline context spinnaker/spinnaker#5909

Open

dbyron-sf mentioned this pull request Jun 13, 2024

perf(sql): Store child pipeline execution in trigger as reference #4749

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf(sql): Store child pipelines as references #3986

perf(sql): Store child pipelines as references #3986

robzienert commented Nov 2, 2020 •

edited

Loading

cfieber Nov 2, 2020

robzienert Nov 2, 2020

cfieber commented Nov 2, 2020

marchello2000 Nov 2, 2020

marchello2000 commented Nov 2, 2020

robzienert commented Nov 2, 2020 •

edited

Loading

ajordens commented Nov 3, 2020

robzienert left a comment

robzienert Nov 6, 2020

dilippai commented Dec 10, 2020

Nirmalyasen commented Dec 17, 2020

perf(sql): Store child pipelines as references #3986

perf(sql): Store child pipelines as references #3986

Conversation

robzienert commented Nov 2, 2020 • edited Loading

cfieber Nov 2, 2020

Choose a reason for hiding this comment

robzienert Nov 2, 2020

Choose a reason for hiding this comment

cfieber commented Nov 2, 2020

marchello2000 Nov 2, 2020

Choose a reason for hiding this comment

marchello2000 commented Nov 2, 2020

robzienert commented Nov 2, 2020 • edited Loading

ajordens commented Nov 3, 2020

robzienert left a comment

Choose a reason for hiding this comment

robzienert Nov 6, 2020

Choose a reason for hiding this comment

dilippai commented Dec 10, 2020

Nirmalyasen commented Dec 17, 2020

robzienert commented Nov 2, 2020 •

edited

Loading

robzienert commented Nov 2, 2020 •

edited

Loading