provide a pipeline concurrency limit #1305

jstrachan · 2019-09-13T04:23:13Z

Expected Behavior

Its common in CI systems to limit the number of concurrent pipelines that can execute on a given repository and branch. e.g. process PRs concurrently, but only allow a maximum of 1 release to be performed at once in case releases clash with each other. e.g. to avoid race conditions between concurrent pipeline steps that operate on shared git repositories/buckets/kubernetes clusters.

e.g. imagine a simple pipeline of

get the next incrementing version number
spend some time building artifacts/images
use kubectl to deploy some resources using this version number or update the website/changelog

If you run this pipeline concurrently all kinds of things could happen due to the wonders of concurrency (e.g. seeing the version number go forwards then backwards).

When working on separate PRs concurrency is not usually an issue; but working on shared resources (e.g. producing a sequential stream of artifacts or updating a shared cluster) we often want to force a clean ordering on the pipelines to avoid confusion or worse things.

Actual Behavior

There's currently no way to force a pipeline to not execute until all other pipelines for that repository + branch have completed without writing some kind of leader election step.

We're pondering writing a little leader election step as a workaround (which would be Jenkins X specific jenkins-x/jx#5471); but figure it would be nice to be able to add this kind of capability into the tekton controller.

If you squint its a little like the tekton controller being like the ReplicaSet controller; if replicas = 1 for a unique string (e.g. the git repository URL + branch name), only start a new Pod for a PipelineRun when no others are running for that string.

Steps to Reproduce the Problem

run lots of pipelines like the above example and watch the version number go up and down

Additional Info

If there was some kind of MaximumConcurrency for a specific source repository and branch we could modify the tekton controller to only create a new Pod when it knows there are no other running pods for a given source repository + branch.

The text was updated successfully, but these errors were encountered:

vdemeester · 2019-11-27T11:30:03Z

/kind feature
/kind question

I wonder if this should be a feature for pipeline (core) or some integration tooling (like jenkins-x)

bobcatfish · 2019-11-27T13:26:21Z

This sounds like it could be pretty cool!! I agree with @vdemeester in that I'm not 100% sure where it would best belong - but there have been some related ideas that have come up lately (but are slightly more complicated) such as:

Keeping track of how much $ a Pipeline is costing (how many resources it's consuming) and limiting how frequently it can run as a result

I think it could be pretty cool to think about what it would be like to apply generic policies to Pipeline execution, maybe in an admission controller, so it could be decoupled from the Tekton Pipelines codebase but could be ultra flexible.

assertion · 2020-01-14T06:17:45Z

Any updates about this issue? @bobcatfish @vdemeester
We are alse facing this situation when switching repo based CI/CD pipelines to tekton.

bobcatfish · 2020-01-28T15:13:59Z

Hey @assertion ! I don't think there has been any movement but if you (or anyone else in the community) wants to take this on I think we'd be happy to see it! Let me know if you want any pointers re. next steps to work on this.

Fabian-K · 2020-02-10T10:50:39Z

I looked into the approach using an admission controller. To ensure a pipeline has no concurrent executions, I registered a validating admission controller for creation of taskruns. As the incoming AdmissionReview contains the taskrun definition, I can query for other running pipelines and as a result return either true or false.

This works fine as the tekton controller retries creating the taskrun after it was rejected.

As the retry seems to follow some exponential backoff pattern, I´m a bit worried that if this is requested multiple times, a lot of time might be wasted. Any ideas about that? Do you know where I can find details about the backoff pattern?

tekton-robot · 2020-08-13T10:27:17Z

Stale issues rot after 30d of inactivity.
Mark the issue as fresh with /remove-lifecycle rotten.
Rotten issues close after an additional 30d of inactivity.
If this issue is safe to close now please do so with /close.

/lifecycle rotten

Send feedback to tektoncd/plumbing.

tekton-robot · 2020-08-13T10:27:17Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.
If this issue is safe to close now please do so with /close.

/lifecycle stale

Send feedback to tektoncd/plumbing.

tekton-robot · 2020-08-13T10:27:18Z

Rotten issues close after 30d of inactivity.
Reopen the issue with /reopen.
Mark the issue as fresh with /remove-lifecycle rotten.

/close

Send feedback to tektoncd/plumbing.

tekton-robot · 2020-08-13T10:27:20Z

@tekton-robot: Closing this issue.

In response to this:

Rotten issues close after 30d of inactivity.
Reopen the issue with /reopen.
Mark the issue as fresh with /remove-lifecycle rotten.

/close

Send feedback to tektoncd/plumbing.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

bobcatfish · 2020-08-13T15:56:49Z

I'm gonna reopen this one, I wonder if we should add it to the roadmap also 🤔

/reopen

tekton-robot · 2020-08-13T15:56:51Z

@bobcatfish: Reopened this issue.

In response to this:

I'm gonna reopen this one, I wonder if we should add it to the roadmap also 🤔

/reopen

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

bobcatfish · 2020-08-24T15:26:37Z

We're adding this to our roadmap, I changed the name a bit to reflect that we feel like we'd want to address this in general but maybe not specifically for branch + repo only

imjasonh · 2020-08-24T18:19:41Z

Some thoughts on how this could possibly work, feel free to propose alternatives:

introduce a "concurrency bucket" CRD with a cap on task runs and/or resources, and/or something else
have PipelineRuns and TaskRuns state what bucket they're counting against; possibly using an annotation?
triggers could populate with some key based on repo+branch (or just repo, or org, etc.)
PipelineRun controller holds runs in a concurrency-limited state until there's room in the bucket's cap

Open Questions:

should items be unblocked FIFO? At random? Scheduled based on requests?
how should this interact with existing K8s features for limiting resource usage in a namespace? Can operators use these features effectively today as a stopgap?

bobcatfish · 2020-08-24T19:33:55Z

Related issue: #2591

afrittoli · 2021-02-09T17:58:59Z

@jstrachan work on this is happening in experimental tektoncd/experimental#699, please continue the investigation / discussion on this there. Closing this one for now.

jstrachan mentioned this issue Sep 13, 2019

provide an exclusive lock step in a pipeline to avoid concurrent builds jenkins-x/jx#5471

Closed

tekton-robot added kind/feature Categorizes issue or PR as related to a new feature. kind/question Issues or PRs that are questions around the project or a particular feature labels Nov 27, 2019

bobcatfish added this to Needs triage in Tekton Pipelines Feb 26, 2020

dibyom moved this from Needs triage to Backlog in Tekton Pipelines Mar 30, 2020

tekton-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Aug 13, 2020

tekton-robot closed this as completed Aug 13, 2020

Tekton Pipelines automation moved this from Unsure what to do next to Closed Aug 13, 2020

tekton-robot reopened this Aug 13, 2020

Tekton Pipelines automation moved this from Closed to Needs triage Aug 13, 2020

bobcatfish added the priority/backlog Higher priority than priority/awaiting-more-evidence. label Aug 13, 2020

project-bot bot moved this from Needs triage to High priority in Tekton Pipelines Aug 24, 2020

bobcatfish changed the title ~~provide a pipeline concurrency limit for a given repository + branch~~ provide a pipeline concurrency limit Aug 24, 2020

bobcatfish mentioned this issue Sep 14, 2020

TEP-0015 - Add a pending setting to Tekton PipelineRun and TaskRuns tektoncd/community#203

Merged

jerop mentioned this issue Oct 5, 2020

Controlling max parallel jobs per pipeline[WIP] #3112

Closed

4 tasks

NikeNano mentioned this issue Oct 8, 2020

TEP-0013 for adding a limit to pipeline concurrency tektoncd/community#228

Closed

vdemeester mentioned this issue Nov 20, 2020

Towards v1 API #3548

Closed

17 tasks

ghost mentioned this issue Jan 26, 2021

Concurrency limiter controller tektoncd/experimental#699

Open

afrittoli closed this as completed Feb 9, 2021

Tekton Pipelines automation moved this from High priority to Closed Feb 9, 2021

dibyom mentioned this issue Jan 20, 2022

[Workflows] Explore Pipeline Concurrency Support in Workflows tektoncd/experimental#826

Open

amitjha780 mentioned this issue Apr 24, 2023

Controlling max parallel jobs per pipeline #2591

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

provide a pipeline concurrency limit #1305

provide a pipeline concurrency limit #1305

jstrachan commented Sep 13, 2019 •

edited

Loading

vdemeester commented Nov 27, 2019

bobcatfish commented Nov 27, 2019

assertion commented Jan 14, 2020

bobcatfish commented Jan 28, 2020

Fabian-K commented Feb 10, 2020

tekton-robot commented Aug 13, 2020

tekton-robot commented Aug 13, 2020

tekton-robot commented Aug 13, 2020

tekton-robot commented Aug 13, 2020

bobcatfish commented Aug 13, 2020

tekton-robot commented Aug 13, 2020

bobcatfish commented Aug 24, 2020

imjasonh commented Aug 24, 2020

bobcatfish commented Aug 24, 2020

afrittoli commented Feb 9, 2021

provide a pipeline concurrency limit #1305

provide a pipeline concurrency limit #1305

Comments

jstrachan commented Sep 13, 2019 • edited Loading

Expected Behavior

Actual Behavior

Steps to Reproduce the Problem

Additional Info

vdemeester commented Nov 27, 2019

bobcatfish commented Nov 27, 2019

assertion commented Jan 14, 2020

bobcatfish commented Jan 28, 2020

Fabian-K commented Feb 10, 2020

tekton-robot commented Aug 13, 2020

tekton-robot commented Aug 13, 2020

tekton-robot commented Aug 13, 2020

tekton-robot commented Aug 13, 2020

bobcatfish commented Aug 13, 2020

tekton-robot commented Aug 13, 2020

bobcatfish commented Aug 24, 2020

imjasonh commented Aug 24, 2020

bobcatfish commented Aug 24, 2020

afrittoli commented Feb 9, 2021

jstrachan commented Sep 13, 2019 •

edited

Loading