Extend script execution timeout by mna · Pull Request #15779 · fleetdm/fleet

mna · 2023-12-20T20:42:28Z

#15196 This is the work of @ghernandez345 except for adding the ResponseController thing in Go to override the server timeout for that specific sync endpoint so that the calls don't timeout waiting for a script response (the default HTTP server timeout was 90s for our server).

Checklist for submitter

If some of the following don't apply, delete the relevant line.

Changes file added for user-visible changes in changes/ or orbit/changes/.
See Changes files for more information.
Input data is properly validated, SELECT * is avoided, SQL injection is prevented (using placeholders for values in statements)
Added/updated tests
Manual QA for all new/changed functionality
- For Orbit and Fleet Desktop changes:
  - Manual QA must be performed in the three main OSs, macOS, Windows and Linux.
  - Auto-update manual QA, from released version of component to new version (see tools/tuf/test).

codecov · 2023-12-20T20:45:48Z

Codecov Report

Attention: 8 lines in your changes are missing coverage. Please review.

Comparison is base (f3d400d) 66.04% compared to head (a3c3dcc) 66.03%.
Report is 1 commits behind head on main.

Files	Patch %	Lines
cmd/fleet/serve.go	0.00%	7 Missing ⚠️
server/fleet/scripts.go	50.00%	0 Missing and 1 partial ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main   #15779      +/-   ##
==========================================
- Coverage   66.04%   66.03%   -0.01%     
==========================================
  Files        1067     1067              
  Lines       93546    93554       +8     
  Branches     2337     2337              
==========================================
+ Hits        61781    61782       +1     
- Misses      27141    27148       +7     
  Partials     4624     4624

Flag	Coverage Δ
backend	`67.22% <33.33%> (-0.01%)`	⬇️
frontend	`52.04% <ø> (ø)`

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

mna · 2023-12-20T20:44:18Z

-			expectErrMsg: fleet.RunScriptHostTimeoutErrMsg,
-		},
+		// TODO: this would take 5 minutes to run, we don't want that kind of slowdown in our test suite
+		// but can be useful to have around for manual testing.


Ideally we'd have a simple way to reduce the timeout duration, but for now this is only possible within the server/service package and the response with shorter timeouts is well tested in this package (in the integration tests), so I just commented-it out.

mna · 2023-12-20T21:01:07Z

+					}
+				}
+				apiHandler.ServeHTTP(rw, req)
+			})


This is ugly, but the issue is in the promhttp third-party package, they wrap the raw http.ResponseWriter but don't provide the Unwrap method required to unwrap it back to the original value, and this is required for ResponseController to work properly.

In the meantime and pressed by time I went with this ugly approach, but I'm sure the prometheus folks would be open to integrate the change as this is the right way to do it since Go 1.20, and then we could have a cleaner approach and do this timeout extension where the route is defined, e.g. with a ue.WithHTTPMiddleware(...).POST("/...") (which I attempted at first before noticing the prometheus thing).

gillespi314

Looks good! Happy to review again if there are further changes.

roperzh

Pulled and tested and works perfectly

roperzh · 2024-01-03T19:39:48Z

@georgekarrv @sabrinabuckets per @lukeheath's request I'm including this in the release.

for #15196. The main problem was that we have two timeouts: 1. The timeout used by the host to kill the script execution 2. The timeout used by the server to wait for the script results Before the changes in #15779, the server timeout was longer than the host timeout, but we inadvertently set both values to 5 minutes, which breaks the logic we have to handle both kinds of timeouts.

ghernandez345 and others added 3 commits December 20, 2023 09:55

change script execution timeout firstpass

dcf3c85

add line that script is running

087c7fc

Override server timeout for the sync run script endpoint

389ed5b

mna temporarily deployed to Docker Hub December 20, 2023 20:42 — with GitHub Actions Inactive

Move setting the override write timeout to fleet serve

ccd839a

mna had a problem deploying to Docker Hub December 20, 2023 20:57 — with GitHub Actions Error

Fix return value

2ec166a

mna temporarily deployed to Docker Hub December 20, 2023 20:58 — with GitHub Actions Inactive

mna commented Dec 20, 2023

View reviewed changes

gillespi314 previously approved these changes Dec 20, 2023

View reviewed changes

Comment thread cmd/fleetctl/scripts_test.go

Merge branch 'main' into feat-extend-script-timeout

a237d9d

roperzh had a problem deploying to Docker Hub January 3, 2024 19:09 — with GitHub Actions Error

add changes files

a3c3dcc

roperzh dismissed gillespi314’s stale review via a3c3dcc January 3, 2024 19:13

roperzh temporarily deployed to Docker Hub January 3, 2024 19:13 — with GitHub Actions Inactive

roperzh marked this pull request as ready for review January 3, 2024 19:37

roperzh requested review from a team as code owners January 3, 2024 19:37

roperzh approved these changes Jan 3, 2024

View reviewed changes

roperzh merged commit d943fbb into main Jan 3, 2024

roperzh deleted the feat-extend-script-timeout branch January 3, 2024 19:39

roperzh mentioned this pull request Jan 3, 2024

fix unreleased bugs for the increased script timeout #15897

Merged

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Extend script execution timeout#15779

Extend script execution timeout#15779
roperzh merged 7 commits intomainfrom
feat-extend-script-timeout

mna commented Dec 20, 2023 •

edited by roperzh

Loading

Uh oh!

codecov Bot commented Dec 20, 2023 •

edited

Loading

Uh oh!

mna Dec 20, 2023

Uh oh!

mna Dec 20, 2023

Uh oh!

gillespi314 left a comment

Uh oh!

Uh oh!

roperzh left a comment

Uh oh!

roperzh commented Jan 3, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

mna commented Dec 20, 2023 • edited by roperzh Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Checklist for submitter

Uh oh!

codecov Bot commented Dec 20, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

mna Dec 20, 2023

Choose a reason for hiding this comment

Uh oh!

mna Dec 20, 2023

Choose a reason for hiding this comment

Uh oh!

gillespi314 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

roperzh left a comment

Choose a reason for hiding this comment

Uh oh!

roperzh commented Jan 3, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

mna commented Dec 20, 2023 •

edited by roperzh

Loading

codecov Bot commented Dec 20, 2023 •

edited

Loading