Ability to re-trigger failed build with the same input versions #413

davewalter · 2016-05-02T17:42:42Z

When using the new version: every configuration for a get task, it is possible to arrive at a state where you have multiple builds of the same job running at the same time. If an earlier build fails, there is no way to re-trigger it with the same set of inputs. We haven't been able to determine a useful workaround; setting serial: true doesn't really help in this scenario, because the next build will start as soon as the first one fails.

It would be helpful if there were a way to re-trigger the job with the same inputs as a particular build (failed or otherwise).

Let us know if you need more details on this scenario or our desired fix. Thanks!

@davewalter and @rmasand

The text was updated successfully, but these errors were encountered:

vito · 2016-07-15T21:40:46Z

We're thinking about splitting today's trigger build button (+). It is primarily used for three things today:

Impatience: I just pushed something or know that someone just published something, and I want the build to run now.
Retrying a build (this issue): I want to re-run the current build, either to see if it's flaky or to retry because something outside the build failed (e.g. github, a deployment, etc.).
Triggering a job that only ever manually runs, e.g. shipping a product after you've written release notes.

The flaw with case 1 is that there's a race condition. In the time between you loading the page and clicking the +, Concourse may have already found your stuff and queued a build. Now you have two, which is annoying.

The flaw with case 2 is you can only do it with the latest build, and also if you triggered a bunch, new versions may come in, potentially invalidating your flakiness trial. You could set version to a particular version in your pipeline, but that's annoying.

Case 3 pretty much works, but you don't know what versions it'll use until you run it. See #269

So, I think we should split + into two buttons. One that lives on the job, "sync", which will make sure everything's up-to-date and then queue up a build if it should (i.e. one's not queued already; same semantics as auto-triggering). The other button would be associated with a particular build of the job, and would re-trigger it with the same inputs. This covers cases 1 and 2.

The third case needs some more thinking since a "sync" button alone doesn't intuitively seem like enough given that the build only manually triggers.

endzyme · 2016-08-15T17:52:10Z

+1 this would be a great, and much needed, feature for Concourse CI

charlieoleary · 2016-11-15T19:09:32Z

+1 As well, this is a pretty critical feature. We've sort of circumvented it with empty commits (since we're using the PR resource), but it would be ideal to simply retry a failed job with the same inputs.

ahelal · 2016-11-15T20:24:51Z

Would love to see that. We are doing crazy stuff to try to trigger old commits. Is this open for external people to help ? and if so how ?

primalmotion · 2016-12-03T18:36:15Z

same thing here, I feel that concourse has everything to be able to do this fairly easily. It would work perfectly with the pull request resource.

tracker-common · 2016-12-14T17:26:12Z

+1000000 👍 👍 👍 👍 👍 👍 👍 👍 👍 👍 👍 👍 👍 👍 👍 👍 👍 👍 👍 👍 👍 👍 👍👍 👍👍👍 👍👍 👍 👍 👍 👍 👍 👍 👍 👍 👍 👍 👍 👍 👍 👍 👍 👍 👍 👍 👍 👍 👍 👍 👍👍 👍👍👍 👍👍 👍 👍 👍 👍 👍 👍 👍 👍 👍 👍 👍 👍 👍 👍 👍 👍 👍 👍 👍 👍 👍 👍 👍👍 👍👍👍 👍👍 👍 👍 👍 👍 👍 👍 👍 👍 👍 👍 👍 👍 👍 👍 👍 👍 👍 👍 👍 👍 👍 👍 👍👍 👍👍👍 👍👍

gabro · 2017-02-08T15:44:36Z

Same here, if you guys are busy can we help somehow?

miromode · 2017-03-08T01:25:23Z

+1

kei-yamazaki · 2017-04-10T00:46:01Z

👍

timrchavez · 2017-04-12T18:34:37Z

👍 This would go a long way to making Concourse a more viable choice for us.

ls-yann-david · 2017-04-27T19:05:01Z

Pleaseeeeee 👍

VanAxe · 2017-04-27T19:09:42Z

Retrigerring jobs would be a lot more elegant than empty commits! 😝 👍

olhtbr · 2017-04-27T19:37:12Z

👍

tsantero · 2017-05-09T16:03:56Z

is this issue currently being actively worked on internally? I need this asap, but also don't want to duplicate work. similarly, are there any contributor guidelines I should read?

vito · 2019-04-09T16:11:43Z

Notes from IPM:

Preserve inputs on re-triggered build (duh) - make sure the scheduler/build starter doesn't re-compute them or use next_build_inputs
We'll need a button in the UI for re-triggering (@Lindsayauchin), in addition to the existing trigger-build button
We'll need to clear out the build events for the old build and "re-set" the build back to pending state (and reset whatever other build data is appropriate, i.e. start/end time)
Also: clear out any outputs for the build, otherwise a re-trigger from green to red could result in "phantom outputs" satisfying passed constraints
Re-compute build plan based on current pipeline config
- If anyone is worried about this let us know - it's way easier to implement this way, and we figure there may be cases where a pipeline config change was made to fix the errant build anyway, in which case we'd want to pick up the new config.

We'll also spike on creating a new build instead of replacing it. They may actually be roughly the same difficulty. If we do this instead, we don't have to reset anything or clear out any outputs/etc.

StevenArmstrong · 2019-05-12T13:50:42Z

Could the trigger buttons be moved out from under the job and put on the main page somehow? It could have a trigger icon with a prompt or something? I say this as one of the main feedbacks our product lead gets of our concourse pipelines is around users calling the interface awkward and complaining they have to click through the green/red square of a job to trigger it. To mitigate this we have had to have a job called trigger for production deployments so users know how to trigger a production deployment. As a result of the clicking around we have even had requests to build a UI on top of concourse to make it easier and more dev friendly for roll forward and rollback triggers :(

vito · 2019-05-13T16:10:05Z

@StevenArmstrong So having a specific 'retrigger' button on failed builds in the pipeline? Sounds reasonable, though I would argue people should probably be clicking into the build first and understanding the failure rather than blindly re-triggering. 🤔 In any case we may want to discuss that as a separate issue that we can address after we take our first crack at this. :)

hstenzel · 2019-05-13T17:11:52Z

One question I have is how we can retrigger a build if we no longer have the log from the original?

vito · 2019-05-13T18:57:50Z

@hstenzel Build logs are purely cosmetic, the actual information regarding which versions/etc. are used is kept in the database and so re-triggering will still work.

hstenzel · 2019-05-13T21:53:59Z

Perhaps I'm missing something then. How would I retrigger a build for which I no longer have logs if the UI element is on the log screen?

…

On Mon, May 13, 2019, 2:58 PM Alex Suraci ***@***.***> wrote: @hstenzel <https://github.com/hstenzel> Build logs are purely cosmetic, the actual information regarding which versions/etc. are used is kept in the database and so re-triggering will still work. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#413?email_source=notifications&email_token=ABOOX5Z275ZG2V2MKEHPXW3PVG23TA5CNFSM4CCWGJC2YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGODVJHUGA#issuecomment-491944472>, or mute the thread <https://github.com/notifications/unsubscribe-auth/ABOOX52QDUJWEQEVIGCHT3DPVG23TANCNFSM4CCWGJCQ> .

vito · 2019-05-13T22:36:49Z

The only thing removed when logs are reaper are the build logs below the build header. The rest of the ui is still there and shows the build status and such. On Mon, May 13, 2019, 5:54 PM Harley Stenzel <notifications@github.com> wrote:

…

Perhaps I'm missing something then. How would I retrigger a build for which I no longer have logs if the UI element is on the log screen? On Mon, May 13, 2019, 2:58 PM Alex Suraci ***@***.***> wrote: > @hstenzel <https://github.com/hstenzel> Build logs are purely cosmetic, > the actual information regarding which versions/etc. are used is kept in > the database and so re-triggering will still work. > > — > You are receiving this because you were mentioned. > Reply to this email directly, view it on GitHub > < #413?email_source=notifications&email_token=ABOOX5Z275ZG2V2MKEHPXW3PVG23TA5CNFSM4CCWGJC2YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGODVJHUGA#issuecomment-491944472 >, > or mute the thread > < https://github.com/notifications/unsubscribe-auth/ABOOX52QDUJWEQEVIGCHT3DPVG23TANCNFSM4CCWGJCQ > > . > — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#413?email_source=notifications&email_token=AAAAOWBU5MKVXKG2O76VNB3PVHPPRA5CNFSM4CCWGJC2YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGODVJVPLA#issuecomment-492001196>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AAAAOWCKKMZ2KKD2YDHMA7TPVHPPRANCNFSM4CCWGJCQ> .

StevenArmstrong · 2019-05-14T09:35:43Z

@vito it wasn't really just for retrigger. It was more if you are doing a redesign on what the + buttons or other buttons do it would be really good if they weren't nested under the job as users have said they find it confusing having to click through to manually initiate a new build when deploying to production. Instead I was suggesting having them a layer up as an icon on the pipeline page beside each job. You could then have a confirmation pop up if someone clicks it by mistake to confirm to build with latest or retrigger or cancel. This way you wouldn't need multiple buttons simply 1 button to build with multiple sub options without having to click through to the log to trigger anything. It's something that is frequently fed back about the UI from our users.

Lindsayauchin · 2019-05-14T13:44:43Z

@StevenArmstrong interesting idea. I think that from pipeline we have observed, like the (rabbit MQ team at Pivotal below) an action button to trigger a job on the pipeline page is just not scalable.

We are thinking about the user pains around triggering a build with the work being done on the resource version. You can follow a related issue #3403 to see our progress on the UX changes.

hstenzel · 2019-05-14T13:59:15Z

If I understand correctly, to retrigger a specific job I'd scroll left/right on the build page to find the correct job?

Also, I'd potentially want to retrigger a successful job too, thinking about the case of rebuilding an artifact that was accidentally lost.

Perhaps these questions are really more about the UI related to the feature.

vito · 2019-05-14T15:56:47Z

@hstenzel Yep - to re-trigger a build you would do so from the build's page. There's no such thing as re-triggering a job - you can trigger a new build of a job, but the re- part of re-trigger means you're running an already-existing build for a second time with the same inputs.

You will be able to re-trigger a build regardless of whether it succeeded or failed; they both have their use case: re-trigger a succeeded build to detect flakes, re-trigger failed build to allow artifacts to continue along the pipeline.

StevenArmstrong · 2019-05-14T18:11:37Z

@Lindsayauchin most people use pipeline groups on concourse to visualise big pipelines with many jobs. So I still think in combination with pipeline groups it could still be viable to have the trigger icons on the pipeline page.

Lindsayauchin · 2019-08-06T15:31:02Z

This has evolved into the Build re-triggering track of work. Iterative designs have been moved to smaller sliced stories and can be found in the Build re-triggering project here: https://github.com/concourse/concourse/projects/24

vito · 2019-10-27T16:16:11Z

For those following along: this has been implemented and will be in v6.0! I don't have an ETA yet since v6.0 includes very substantial internal changes that we're doing due diligence to test out. We're considering shipping a beta release first.

vito · 2020-01-07T21:41:51Z

Closing this out!

As implemented in v6.0, re-running a build will create a new build named e.g. "123.1", which will apprear adjacent to the original build in the build history. This placement in the history reflects the scheduler's iteration order for passed constraints - i.e. if you re-run a very old build it won't suddenly propagate the older versions downstream if there are "newer" successful builds after the re-run.

The new build will run with a newly constructed build plan based on the pipeline's current configuration, using the versions of each input from the original build. This should work in the common case of re-triggering recent flakes, but it can fail if the configuration has changed such that new get steps have been added, or the old version is no longer available because the resource changed. Hope that's good enough for MVP!

In the future we plan to fix this by having re-runs run with the the exact build plan that the original ran with, rather than constructing a new plan based on the current configuration. This is going to be tracked in a separate "epic", Build Lifecycle.

concourse-bot added unscheduled and removed unscheduled labels May 2, 2016

This was referenced Jul 8, 2016

Add ability to rerun a failed build with the same resource versions #129

Closed

Support for building arbitrary git branches based off regex for branch name (without requiring a GitHub pull request) #239

Closed

vito changed the title ~~Ability to ReTrigger Failed Job With Same Version of Resources~~ Ability to re-trigger failed build with the same input versions Jul 15, 2016

vito mentioned this issue Jul 15, 2016

Show versions of inputs that would be used if a manual job would be triggered *now* #269

Open

concourse-bot added unscheduled and removed scheduled labels Jul 22, 2016

vito added the help wanted label Oct 19, 2016

jtarchie mentioned this issue Dec 3, 2016

be able to re-trigger PRs through the pipeline without an empty commit jtarchie/github-pullrequest-resource#46

Closed

vito removed the unscheduled label May 8, 2017

vito removed the help wanted label May 9, 2017

vito added this to the Staging milestone May 17, 2017

vito added the workflows label May 17, 2017

vito added this to the v6.0.0 milestone May 10, 2019

vito added enhancement and removed needs-validation labels Jul 29, 2019

vito added this to To do in Algorithm v3 Jul 30, 2019

vito moved this from To do to In progress in Algorithm v3 Jul 30, 2019

vito moved this from In progress to To do in Algorithm v3 Jul 30, 2019

vito added this to To do in Build re-running Jul 30, 2019

vito removed this from To do in Algorithm v3 Jul 30, 2019

vito moved this from To do to End Goals in Build re-running Aug 6, 2019

vito mentioned this issue Aug 12, 2019

Core: add an endpoint to re-trigger a build with the same inputs #4268

Closed

clarafu mentioned this issue Nov 6, 2019

Beta release 6.0.x #4721

Merged

10 tasks

vito mentioned this issue Dec 20, 2019

bug(lidar): unexpected serial resource check #4940

Closed

vito closed this as completed Jan 7, 2020

phantas mentioned this issue Jan 15, 2020

Revert "BAU - allow manual trigger of PR jobs" alphagov/govwifi-safe-restarter#104

Closed

clarafu added the release/documented Documentation and release notes have been updated. label Feb 12, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ability to re-trigger failed build with the same input versions #413

Ability to re-trigger failed build with the same input versions #413

davewalter commented May 2, 2016

vito commented Jul 15, 2016

endzyme commented Aug 15, 2016

charlieoleary commented Nov 15, 2016

ahelal commented Nov 15, 2016

primalmotion commented Dec 3, 2016

tracker-common commented Dec 14, 2016

gabro commented Feb 8, 2017

miromode commented Mar 8, 2017

kei-yamazaki commented Apr 10, 2017

timrchavez commented Apr 12, 2017

ls-yann-david commented Apr 27, 2017

VanAxe commented Apr 27, 2017

olhtbr commented Apr 27, 2017

tsantero commented May 9, 2017

vito commented Apr 9, 2019

StevenArmstrong commented May 12, 2019

vito commented May 13, 2019

hstenzel commented May 13, 2019

vito commented May 13, 2019

hstenzel commented May 13, 2019 via email

vito commented May 13, 2019 via email

StevenArmstrong commented May 14, 2019

Lindsayauchin commented May 14, 2019 •

edited

hstenzel commented May 14, 2019

vito commented May 14, 2019

StevenArmstrong commented May 14, 2019

Lindsayauchin commented Aug 6, 2019 •

edited

vito commented Oct 27, 2019

vito commented Jan 7, 2020

Ability to re-trigger failed build with the same input versions #413

Ability to re-trigger failed build with the same input versions #413

Comments

davewalter commented May 2, 2016

vito commented Jul 15, 2016

endzyme commented Aug 15, 2016

charlieoleary commented Nov 15, 2016

ahelal commented Nov 15, 2016

primalmotion commented Dec 3, 2016

tracker-common commented Dec 14, 2016

gabro commented Feb 8, 2017

miromode commented Mar 8, 2017

kei-yamazaki commented Apr 10, 2017

timrchavez commented Apr 12, 2017

ls-yann-david commented Apr 27, 2017

VanAxe commented Apr 27, 2017

olhtbr commented Apr 27, 2017

tsantero commented May 9, 2017

vito commented Apr 9, 2019

StevenArmstrong commented May 12, 2019

vito commented May 13, 2019

hstenzel commented May 13, 2019

vito commented May 13, 2019

hstenzel commented May 13, 2019 via email

vito commented May 13, 2019 via email

StevenArmstrong commented May 14, 2019

Lindsayauchin commented May 14, 2019 • edited

hstenzel commented May 14, 2019

vito commented May 14, 2019

StevenArmstrong commented May 14, 2019

Lindsayauchin commented Aug 6, 2019 • edited

vito commented Oct 27, 2019

vito commented Jan 7, 2020

Lindsayauchin commented May 14, 2019 •

edited

Lindsayauchin commented Aug 6, 2019 •

edited