[kbn/optimizer] Force worker exit, extend parent ping timeout #67235

spalger · 2020-05-22T01:28:34Z

We're still seeing failures on CI caused by workers who are exiting early (probably because the parent process doesn't response to the ping quickly enough)

As well as workers which don't gracefully close for some reason

We don't know exactly why this is happening, but it's clearly related to the pings we implemented yesterday, hoping that forcefully closing the worker internally, and extending the ping timeout for the parent will be sufficient to avoid this level of failure on CI.

elasticmachine · 2020-05-22T01:28:36Z

Pinging @elastic/kibana-operations (Team:Operations)

mistic

LGTM

tylersmalley · 2020-05-22T02:47:44Z

packages/kbn-optimizer/src/worker/run_worker.ts

-  setTimeout(() => {
-    send(
-      workerMsgs.error(
-        new Error('process did not automatically exit within 5 seconds, forcing exit')


We probably still want to log an error.

What error? If we call process.exit() the process is going to exit immediately and we won't be able to set a timer or anything.

tylersmalley

LGTM - just one minor comment.

kibanamachine · 2020-05-22T03:22:26Z

💔 Build Failed

continuous-integration/kibana-ci/pull-request
Commit: 3be2740
Pipeline Steps (look for red circles / failed steps)
Interpreting CI Failures

Failed CI Steps

Execute kibana-intake

Test Failures

Kibana Pipeline / kibana-intake-agent / Jest Integration Tests.src/dev/code_coverage/ingest_coverage/integration_tests.Ingesting coverage to the coverage index should result in every posted item having a site url that meets all regex assertions

Link to Jenkins

Standard Out

Failed Tests Reporter:
  - Test has failed 1 times on tracked branches: https://github.com/elastic/kibana/issues/67075

Stack Trace

Error: Failed: 1
    at Env.fail (/var/lib/jenkins/workspace/elastic+kibana+pipeline-pull-request/kibana/node_modules/jest-jasmine2/build/jasmine/Env.js:778:61)
    at ChildProcess.next (/var/lib/jenkins/workspace/elastic+kibana+pipeline-pull-request/kibana/node_modules/jest-jasmine2/build/queueRunner.js:31:24)
    at ChildProcess.emit (events.js:198:13)
    at maybeClose (internal/child_process.js:982:16)
    at Socket.stream.socket.on (internal/child_process.js:389:11)
    at Socket.emit (events.js:198:13)
    at Pipe._handle.close (net.js:607:12)

To update your PR or re-run it, just comment with:
@elasticmachine merge upstream

spalger · 2020-05-22T03:47:49Z

I'm just going to revert the changes I've made to the optimizer recently, there is clearly something wrong with the strategy here and I'm really unsure that this is going to make things better.

spalger added 2 commits May 21, 2020 18:19

[kbn/optimizer] avoid early exit, give parent plenty of time to respond

147e5f2

stop trying to gracefully exit

3be2740

spalger added Team:Operations Team label for Operations Team v8.0.0 release_note:skip Skip the PR/issue when compiling release notes v7.9.0 v7.8.1 labels May 22, 2020

spalger requested a review from a team as a code owner May 22, 2020 01:28

mistic approved these changes May 22, 2020

View reviewed changes

tylersmalley reviewed May 22, 2020

View reviewed changes

tylersmalley approved these changes May 22, 2020

View reviewed changes

spalger closed this May 22, 2020

spalger deleted the extend-parent-ping-timeout branch August 18, 2020 18:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[kbn/optimizer] Force worker exit, extend parent ping timeout #67235

[kbn/optimizer] Force worker exit, extend parent ping timeout #67235

spalger commented May 22, 2020

elasticmachine commented May 22, 2020

mistic left a comment

tylersmalley May 22, 2020 •

edited

Loading

spalger May 22, 2020

tylersmalley left a comment

kibanamachine commented May 22, 2020

Standard Out

Stack Trace

spalger commented May 22, 2020

[kbn/optimizer] Force worker exit, extend parent ping timeout #67235

[kbn/optimizer] Force worker exit, extend parent ping timeout #67235

Conversation

spalger commented May 22, 2020

elasticmachine commented May 22, 2020

mistic left a comment

Choose a reason for hiding this comment

tylersmalley May 22, 2020 • edited Loading

Choose a reason for hiding this comment

spalger May 22, 2020

Choose a reason for hiding this comment

tylersmalley left a comment

Choose a reason for hiding this comment

kibanamachine commented May 22, 2020

💔 Build Failed

Failed CI Steps

Test Failures

Standard Out

Stack Trace

spalger commented May 22, 2020

tylersmalley May 22, 2020 •

edited

Loading