Retry `Kitchen::Provisioner#run_command` after allowed exit codes #1055

smurawski · 2016-06-13T22:14:10Z

Allow user to define exit codes to retry Provision#run_command
Allow user to define wait time between attempts
Allow user to define exit codes to retry
Allow user to define maximum number of retry attempts

Resolves #1016

smurawski · 2016-06-15T19:59:36Z

I've tested this against Windows, Centos, and Ubuntu guests. The Windows guests worked the most consistently. The Centos and Ubuntu guests were more of a race to see if Chef could finish before the system shut down. (See chef/chef#5026)

smurawski · 2016-06-15T20:03:51Z

With this PR - all provisioners support three new configuration settings.

retry_on_exit_code - which takes an array of exit codes that can indicate that kitchen should retry the converge command. Defaults to an empty array.
max_retries - number of times to retry the converge before passing along the failed status. Defaults to 1.
wait_for_retry - number of seconds to wait between converge attempts. Defaults to 30.

smurawski · 2016-06-15T20:08:30Z

Appveyor build failure seems specific to the build box. More "symlink is unimplemented" errors, which seem to be common amongst appveyor builds recently.

Rebased on master after merging #1057 to fix appveyor tests.

cheeseplus · 2016-06-16T20:21:24Z

+1

…error

…or the next try.

adamleff · 2016-06-16T22:22:28Z

lib/kitchen/command.rb

@@ -170,17 +170,25 @@ def run_action(action, instances, *args)
        concurrency.times do
          threads << Thread.new do
            while instance = queue.pop
+              puts "running #{instance.name}"


Is this debug output? If not, we should probably use the logger methods instead.

I had the same question and then noticed puts getting used for these in several places in TK. a bit weird but separate from this PR

Oops.. that should be pulled. I was troubleshooting the difference between actionfailed and instancefailed. Thanks.

adamleff · 2016-06-16T22:32:52Z

Some minor things, all non-blockers. 👍

mwrock · 2016-06-16T23:03:43Z

👍

lamont-granquist · 2016-06-16T23:27:34Z

👍

carpnick · 2016-07-19T01:28:17Z

Thanks for doing this @smurawski. This one was a long time coming to TK. Thanks for putting in the effort on chef and TK to get this done.

smurawski added the Developing label Jun 13, 2016

smurawski changed the title ~~Retry Kitchen::Provisioner#run_command after allowed exit codes~~ WIP - Retry Kitchen::Provisioner#run_command after allowed exit codes Jun 13, 2016

smurawski self-assigned this Jun 13, 2016

smurawski added Accepted Improvement labels Jun 13, 2016

smurawski changed the title ~~WIP - Retry Kitchen::Provisioner#run_command after allowed exit codes~~ Retry Kitchen::Provisioner#run_command after allowed exit codes Jun 15, 2016

smurawski mentioned this pull request Jun 16, 2016

Release 1.10.0 #1058

Merged

smurawski added 9 commits June 16, 2016 15:29

commands that exit non-zero pass exit code back in a TransportFailed …

6ea0cdf

…error

add execute_with_retry

cd838e5

Add allowed retries to base provisioner

e03169b

add a wait time option

182c877

capture instance failures and better combined error message

45b692c

Get the real last exit code from PowerShell.

769fe21

change the default value for retry_on_exit_code to an empty array

b6f8850

default one retry. force close the session to require a new session f…

919839f

…or the next try.

fix quality tests

e4878ed

smurawski force-pushed the smurawski/reboot_support branch from dc43531 to e4878ed Compare June 16, 2016 20:35

adamleff reviewed Jun 16, 2016
View reviewed changes

address code review feedback

f87b3aa

smurawski force-pushed the smurawski/reboot_support branch from 0cf416e to f87b3aa Compare June 16, 2016 22:55

smurawski merged commit acc9ccb into master Jun 17, 2016

smurawski removed Accepted Developing labels Jun 17, 2016

smurawski deleted the smurawski/reboot_support branch June 17, 2016 01:15

jonathanmorley mentioned this pull request Jun 20, 2016

Reboot resource with new 'reboot and try again' feature #1062

Closed

nicolasvan mentioned this pull request Oct 31, 2016

Idempotence check? neillturner/kitchen-puppet#141

Closed

test-kitchen locked and limited conversation to collaborators Nov 16, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Retry `Kitchen::Provisioner#run_command` after allowed exit codes #1055

Retry `Kitchen::Provisioner#run_command` after allowed exit codes #1055

smurawski commented Jun 13, 2016 •

edited

smurawski commented Jun 15, 2016

smurawski commented Jun 15, 2016

smurawski commented Jun 15, 2016 •

edited

cheeseplus commented Jun 16, 2016

adamleff Jun 16, 2016

mwrock Jun 16, 2016

smurawski Jun 16, 2016

adamleff commented Jun 16, 2016

mwrock commented Jun 16, 2016

lamont-granquist commented Jun 16, 2016

carpnick commented Jul 19, 2016

Retry Kitchen::Provisioner#run_command after allowed exit codes #1055

Retry Kitchen::Provisioner#run_command after allowed exit codes #1055

Conversation

smurawski commented Jun 13, 2016 • edited

smurawski commented Jun 15, 2016

smurawski commented Jun 15, 2016

smurawski commented Jun 15, 2016 • edited

cheeseplus commented Jun 16, 2016

adamleff Jun 16, 2016

Choose a reason for hiding this comment

mwrock Jun 16, 2016

Choose a reason for hiding this comment

smurawski Jun 16, 2016

Choose a reason for hiding this comment

adamleff commented Jun 16, 2016

mwrock commented Jun 16, 2016

lamont-granquist commented Jun 16, 2016

carpnick commented Jul 19, 2016

Retry `Kitchen::Provisioner#run_command` after allowed exit codes #1055

Retry `Kitchen::Provisioner#run_command` after allowed exit codes #1055

smurawski commented Jun 13, 2016 •

edited

smurawski commented Jun 15, 2016 •

edited