Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[C++] A more RAM-efficient top-k sink node #29786

Open
asfimport opened this issue Oct 1, 2021 · 1 comment
Open

[C++] A more RAM-efficient top-k sink node #29786

asfimport opened this issue Oct 1, 2021 · 1 comment

Comments

@asfimport
Copy link
Collaborator

asfimport commented Oct 1, 2021

Mentioned here:

#11274 (review)

For example, a top-k implementation could periodically (when batches_ has some configurable # of rows) run through and discard data. The way it is written now it would still require me to buffer the entire dataset in memory (and/or spillover).

 

 

Reporter: Alexander Ocsa / @aocsa

Related issues:

Note: This issue was originally created as ARROW-14202. Please see the migration documentation for further details.

@asfimport
Copy link
Collaborator Author

Todd Farmer / @toddfarmer:
This issue was last updated over 90 days ago, which may be an indication it is no longer being actively worked. To better reflect the current state, the issue is being unassigned per project policy. Please feel free to re-take assignment of the issue if it is being actively worked, or if you plan to start that work soon.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant