Reflector: always succeed when listing a collection that is too large. #98541

lavalamp · 2021-01-28T20:10:47Z

Today, if a collection is too large, a LIST might not be able to finish in the 60s time out, and then it is impossible for a controller to start.

If the controller uses multiple paginated LIST requests, that is better, but it must be able to get through the entire collection before the next compaction event (every 2.5 minutes).

Fortunately this can be fixed client-side without any server changes.

List the first page. Immediately start a watch at the given RV. Enqueue the watch events for later processing.
List subsequent pages.
When you get out of the history window, restart your watch at the same lexical place by using the supplied continue token.
Track the new RV acquired with each history window reset (i.e. we must be able to determine lexically from an object's name/namespace which RV list it came from).
When you get all the way through the collection, apply the enqueued watch events to the locally stored objects. We can see if we should apply the watch event or throw it away by comparing it lexically with the list RV tracked as in step 4.

This is a version of #90339 that is less efficient but doesn't require any complicated server changes. You could also consider this issue to be the client-side version of #90179

lavalamp · 2021-01-28T20:13:30Z

/sig api-machinery

lavalamp · 2021-01-28T20:59:55Z

There are optimizations possible, e.g. step 5 can be done simultaneously. Also if we don't want the client to compare RV numbers, the client has to start a separate watch that corresponds with each new list start.

fedebongio · 2021-02-04T21:19:36Z

/triage accepted

Jeffwan · 2021-03-05T01:31:39Z

Today, if a collection is too large, a LIST might not be able to finish in the 60s time out,

Do you know what's the bottle neck for this slow query? Trying to see anything we can do to optimize the list query.

lavalamp · 2021-03-05T21:46:18Z

what's the bottle neck

Most obviously, can't transmit all the bytes from every object in the collection over the network within the 60s timeout. Many aspects of the problem could be improved, but fundamentally there's more bytes to transmit than is possible in a reasonable amount of time over a reasonable network connection.

Jeffwan · 2021-03-09T19:26:16Z

Anyone is working on this issue? If not, I want to give it a try. Looks like all changes can be done on the reflector side in client-go.

lavalamp · 2021-03-09T19:31:53Z

Feel free to give it a try--this issue is much described than implemented :)

Jeffwan · 2021-03-09T19:33:25Z

/assign @Jeffwan

fejta-bot · 2021-06-07T19:36:09Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-contributor-experience at kubernetes/community.
/lifecycle stale

fejta-bot · 2021-07-07T20:16:21Z

Stale issues rot after 30d of inactivity.
Mark the issue as fresh with /remove-lifecycle rotten.
Rotten issues close after an additional 30d of inactivity.

If this issue is safe to close now please do so with /close.

Send feedback to sig-contributor-experience at kubernetes/community.
/lifecycle rotten

k8s-triage-robot · 2021-08-06T20:17:21Z

The Kubernetes project currently lacks enough active contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Reopen this issue or PR with /reopen
Mark this issue or PR as fresh with /remove-lifecycle rotten
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/close

k8s-ci-robot · 2021-08-06T20:17:27Z

@k8s-triage-robot: Closing this issue.

In response to this:

The Kubernetes project currently lacks enough active contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied

After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied

After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Reopen this issue or PR with /reopen

Mark this issue or PR as fresh with /remove-lifecycle rotten

Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/close

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

lavalamp · 2021-08-06T20:46:39Z

/reopen
/lifecycle frozen

k8s-ci-robot · 2021-08-06T20:46:44Z

@lavalamp: Reopened this issue.

In response to this:

/reopen
/lifecycle frozen

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

k8s-triage-robot · 2023-02-08T05:24:35Z

This issue has not been updated in over 1 year, and should be re-triaged.

You can:

Confirm that this issue is still relevant with /triage accepted (org members only)
Close this issue with /close

For more details on the triage process, see https://www.kubernetes.dev/docs/guide/issue-triage/

/remove-triage accepted

cici37 · 2023-02-09T18:08:02Z

/triage accepted

k8s-triage-robot · 2024-02-09T18:32:49Z

This issue has not been updated in over 1 year, and should be re-triaged.

You can:

Confirm that this issue is still relevant with /triage accepted (org members only)
Close this issue with /close

For more details on the triage process, see https://www.kubernetes.dev/docs/guide/issue-triage/

/remove-triage accepted

seans3 · 2024-02-09T18:46:21Z

/triage accepted

lavalamp added the kind/feature Categorizes issue or PR as related to a new feature. label Jan 28, 2021

k8s-ci-robot added needs-sig Indicates an issue or PR lacks a `sig/foo` label and requires one. needs-triage Indicates an issue or PR lacks a `triage/foo` label and requires one. labels Jan 28, 2021

lavalamp mentioned this issue Jan 28, 2021

idea: More memory efficient watch cache #90179

Open

k8s-ci-robot added sig/api-machinery Categorizes an issue or PR as relevant to SIG API Machinery. and removed needs-sig Indicates an issue or PR lacks a `sig/foo` label and requires one. labels Jan 28, 2021

k8s-ci-robot added triage/accepted Indicates an issue or PR is ready to be actively worked on. and removed needs-triage Indicates an issue or PR lacks a `triage/foo` label and requires one. labels Feb 4, 2021

k8s-ci-robot assigned Jeffwan Mar 9, 2021

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jun 7, 2021

k8s-ci-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Jul 7, 2021

k8s-ci-robot closed this as completed Aug 6, 2021

k8s-ci-robot reopened this Aug 6, 2021

k8s-ci-robot added lifecycle/frozen Indicates that an issue or PR should not be auto-closed due to staleness. and removed lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. labels Aug 6, 2021

lavalamp mentioned this issue Jan 26, 2022

KEP-3157: allow informers for getting a stream of data instead of chunking kubernetes/enhancements#3142

Merged

lavalamp mentioned this issue Apr 29, 2022

Sharding list requests #109667

Closed

k8s-ci-robot added needs-triage Indicates an issue or PR lacks a `triage/foo` label and requires one. and removed triage/accepted Indicates an issue or PR is ready to be actively worked on. labels Feb 8, 2023

k8s-ci-robot added triage/accepted Indicates an issue or PR is ready to be actively worked on. and removed needs-triage Indicates an issue or PR lacks a `triage/foo` label and requires one. labels Feb 9, 2023

k8s-ci-robot added needs-triage Indicates an issue or PR lacks a `triage/foo` label and requires one. and removed triage/accepted Indicates an issue or PR is ready to be actively worked on. labels Feb 9, 2024

k8s-ci-robot added triage/accepted Indicates an issue or PR is ready to be actively worked on. and removed needs-triage Indicates an issue or PR lacks a `triage/foo` label and requires one. labels Feb 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reflector: always succeed when listing a collection that is too large. #98541

Reflector: always succeed when listing a collection that is too large. #98541

lavalamp commented Jan 28, 2021

lavalamp commented Jan 28, 2021

lavalamp commented Jan 28, 2021

fedebongio commented Feb 4, 2021

Jeffwan commented Mar 5, 2021

lavalamp commented Mar 5, 2021

Jeffwan commented Mar 9, 2021

lavalamp commented Mar 9, 2021

Jeffwan commented Mar 9, 2021

fejta-bot commented Jun 7, 2021

fejta-bot commented Jul 7, 2021

k8s-triage-robot commented Aug 6, 2021

k8s-ci-robot commented Aug 6, 2021

lavalamp commented Aug 6, 2021

k8s-ci-robot commented Aug 6, 2021

k8s-triage-robot commented Feb 8, 2023

cici37 commented Feb 9, 2023

k8s-triage-robot commented Feb 9, 2024

seans3 commented Feb 9, 2024

Reflector: always succeed when listing a collection that is too large. #98541

Reflector: always succeed when listing a collection that is too large. #98541

Comments

lavalamp commented Jan 28, 2021

lavalamp commented Jan 28, 2021

lavalamp commented Jan 28, 2021

fedebongio commented Feb 4, 2021

Jeffwan commented Mar 5, 2021

lavalamp commented Mar 5, 2021

Jeffwan commented Mar 9, 2021

lavalamp commented Mar 9, 2021

Jeffwan commented Mar 9, 2021

fejta-bot commented Jun 7, 2021

fejta-bot commented Jul 7, 2021

k8s-triage-robot commented Aug 6, 2021

k8s-ci-robot commented Aug 6, 2021

lavalamp commented Aug 6, 2021

k8s-ci-robot commented Aug 6, 2021

k8s-triage-robot commented Feb 8, 2023

cici37 commented Feb 9, 2023

k8s-triage-robot commented Feb 9, 2024

seans3 commented Feb 9, 2024