Stampeding API requests causes stalled workers #4372

clarafu · 2019-09-04T18:41:32Z

Within our lidar-drills environment (which is running with Algorithm V3 and Lidar), we are putting the concourse deployment under a lot of stress (to put it in perspective we have about double the amount of jobs as wings). When we open the web UI to view the pipelines and click around, it causes a stampede of API requests to various endpoints (for example, GetJobs for the pipeline view or GetBuild from the build page).

These requests build up because there is a new one sent every tick if we haven't received a response back yet as they are all stuck pending. This is becoming a problem now because we have set a dedicated connection pool for the api connections (default to 10 right now) and once it hits that max 10 connections, workers will become stalled because they arn't able to send their heartbeating requests.

Maybe instead of sending a new request every tick, we can just wait for a response after the first request is sent?

Thanks!

jamieklassen · 2019-09-05T21:48:52Z

I'm confident we can ensure that single client doesn't have multiple in-flight requests to the same endpoint. This is a good web-ui issue.

clarafu · 2019-09-06T19:04:10Z

That's great! I'm going to add this to the 6.0 milestone since we probably want get this in with the new algorithm.

jamieklassen · 2019-10-07T15:06:18Z

related: #2776

zoetian · 2019-11-04T20:45:30Z

~~Let's not forget to ensure that there is a FetchUser call every 5 seconds on the dashboard.~~ nothing that comes from the /userinfo endpoint will change in any 5 second interval.

#4372 Signed-off-by: Zoe Tian <ztian@pivotal.io> Co-authored-by: Jamie Klassen <cklassen@pivotal.io>

and remove .userState from Dashboard model. #4372 Signed-off-by: Jamie Klassen <cklassen@pivotal.io> Co-authored-by: Zoe Tian <ztian@pivotal.io>

#4372 Signed-off-by: Zoe Tian <ztian@pivotal.io> Co-authored-by: Jamie Klassen <cklassen@pivotal.io>

#4372 Signed-off-by: Jamie Klassen <cklassen@pivotal.io> Co-authored-by: Zoe Tian <ztian@pivotal.io>

The display of the orange triangle on pipeline cards depended on the order in which the APIDataFetched and AllResourcesFetched callbacks were handled. Accordingly, we reversed the order in which the existing tests for this feature received those callbacks. Technically we have no coverage for the scenario where the callbacks are received in the other order now, but the covered case was the more difficult of the two anyway. #4372 Signed-off-by: Jamie Klassen <cklassen@pivotal.io> Co-authored-by: Zoe Tian <ztian@pivotal.io>

#4372 Signed-off-by: Jamie Klassen <cklassen@pivotal.io> Co-authored-by: Zoe Tian <ztian@pivotal.io>

…n getting team names from groups #4372 Signed-off-by: Jamie Klassen <cklassen@pivotal.io> Co-authored-by: Zoe Tian <ztian@pivotal.io>

vito · 2019-12-11T20:13:47Z

@pivotal-jamie-klassen I've extracted #4885 to cover the web UI aspect, but there's still a general concern over API requests starving the worker registration API requests of database connections which results in workers stalling.

@concourse/runtime Added this to the Runtime side-road - it's technically API related but I figured it has more to do with worker registration, and putting it in Runtime instead of Core could give us an opportunity to rethink how registration works if we want. (I have no strong suggestion.)

odormond · 2019-12-18T10:51:18Z

@vito, I noticed that you removed it from the v6.0.0 milestone. As I'm not familiar with your development workflow, I'm wondering if that means it's getting postponed for later or not?

vito · 2020-01-08T17:18:54Z

@odormond It just means it's not required for v6.0. It doesn't affect the priority; this is still high up in the runtime backlog.

odormond · 2020-01-09T08:01:02Z

Thanks @vito for the update.
Happy New Year and congratulation for the 5.8.0 release! We're excited by the containerd work and looking forward for a working implementation.

clarafu added the bug label Sep 4, 2019

jamieklassen added the web-ui label Sep 5, 2019

clarafu added this to the v6.0.0 milestone Sep 6, 2019

jamieklassen mentioned this issue Sep 23, 2019

Refresh Frequency of Team/Pipeline View #4482

Closed

jamieklassen mentioned this issue Oct 25, 2019

Main Page Loads Slowly when logging in as Admin #4646

Closed

jamieklassen assigned jomsie and jamieklassen Oct 31, 2019

jamieklassen pushed a commit that referenced this issue Nov 4, 2019

web: use Session's userState instead of Dashboard

24d663b

#4372 Signed-off-by: Zoe Tian <ztian@pivotal.io> Co-authored-by: Jamie Klassen <cklassen@pivotal.io>

jamieklassen pushed a commit that referenced this issue Nov 4, 2019

web: remove .user from APIData

1fc77ae

and remove .userState from Dashboard model. #4372 Signed-off-by: Jamie Klassen <cklassen@pivotal.io> Co-authored-by: Zoe Tian <ztian@pivotal.io>

jamieklassen pushed a commit that referenced this issue Nov 4, 2019

web: remove .version from APIData

0213366

#4372 Signed-off-by: Zoe Tian <ztian@pivotal.io> Co-authored-by: Jamie Klassen <cklassen@pivotal.io>

jamieklassen pushed a commit that referenced this issue Nov 4, 2019

web: move .version from Dashboard to Session

ab238e0

#4372 Signed-off-by: Jamie Klassen <cklassen@pivotal.io> Co-authored-by: Zoe Tian <ztian@pivotal.io>

jamieklassen pushed a commit that referenced this issue Nov 4, 2019

web: remove concourseVersion from Pipeline model

a068e38

#4372 Signed-off-by: Jamie Klassen <cklassen@pivotal.io> Co-authored-by: Zoe Tian <ztian@pivotal.io>

jomsie assigned zoetian and unassigned jomsie Nov 5, 2019

zoetian pushed a commit that referenced this issue Nov 5, 2019

web: separate resource fetching from APIData

781cbfa

#4372 Signed-off-by: Jamie Klassen <cklassen@pivotal.io> Co-authored-by: Zoe Tian <ztian@pivotal.io>

zoetian added the paused label Nov 6, 2019

odormond mentioned this issue Nov 19, 2019

Dashboard fetch pipelines once #4785

Merged

9 tasks

zoetian removed the paused label Nov 19, 2019

zoetian pushed a commit that referenced this issue Nov 19, 2019

web: seperate data fetching from APIData

17e001a

#4372 Signed-off-by: Jamie Klassen <cklassen@pivotal.io> Co-authored-by: Zoe Tian <ztian@pivotal.io>

zoetian pushed a commit that referenced this issue Nov 20, 2019

web: fetch all jobs every 5 seconds

7b2f4a7

#4372 Signed-off-by: Jamie Klassen <cklassen@pivotal.io> Co-authored-by: Zoe Tian <ztian@pivotal.io>

zoetian pushed a commit that referenced this issue Nov 21, 2019

web: refresh fetch all pipelines every 5 seconds

e2a982a

#4372 Signed-off-by: Jamie Klassen <cklassen@pivotal.io> Co-authored-by: Zoe Tian <ztian@pivotal.io>

zoetian pushed a commit that referenced this issue Nov 25, 2019

web: display turbulence view when fetch all pipelines errored

967ff8f

#4372 Signed-off-by: Jamie Klassen <cklassen@pivotal.io> Co-authored-by: Zoe Tian <ztian@pivotal.io>

zoetian pushed a commit that referenced this issue Nov 27, 2019

web: pipeline card should be continuously displayed

8f6e5ea

#4372 Signed-off-by: Jamie Klassen <cklassen@pivotal.io> Co-authored-by: Zoe Tian <ztian@pivotal.io>

jamieklassen pushed a commit that referenced this issue Nov 29, 2019

web: remove APIDataFetched and reduce the depencies for pipelines whe…

3b1ca0f

…n getting team names from groups #4372 Signed-off-by: Jamie Klassen <cklassen@pivotal.io> Co-authored-by: Zoe Tian <ztian@pivotal.io>

vito added this to To do in Runtime side-road Dec 11, 2019

vito removed the web-ui label Dec 11, 2019

vito removed this from the v6.0.0 milestone Dec 11, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Stampeding API requests causes stalled workers #4372

Stampeding API requests causes stalled workers #4372

clarafu commented Sep 4, 2019

jamieklassen commented Sep 5, 2019

clarafu commented Sep 6, 2019

jamieklassen commented Oct 7, 2019

zoetian commented Nov 4, 2019 •

edited

vito commented Dec 11, 2019

odormond commented Dec 18, 2019

vito commented Jan 8, 2020

odormond commented Jan 9, 2020

Stampeding API requests causes stalled workers #4372

Stampeding API requests causes stalled workers #4372

Comments

clarafu commented Sep 4, 2019

jamieklassen commented Sep 5, 2019

clarafu commented Sep 6, 2019

jamieklassen commented Oct 7, 2019

zoetian commented Nov 4, 2019 • edited

vito commented Dec 11, 2019

odormond commented Dec 18, 2019

vito commented Jan 8, 2020

odormond commented Jan 9, 2020

zoetian commented Nov 4, 2019 •

edited