-
Notifications
You must be signed in to change notification settings - Fork 983
RapidsMPF streaming-engine translation #20161
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from all commits
Commits
Show all changes
150 commits
Select commit
Hold shift + click to select a range
1f030b4
initial translation sketch
rjzamora 1ff02fc
add basic dataframe_scan support
rjzamora 2821413
update names
rjzamora 704b2e6
simplify and update test
rjzamora bc691d2
linting
rjzamora b011083
refactor
rjzamora 1f1fba6
more refactoring
rjzamora 34de625
begin larger refactor
rjzamora ce7cfdb
adjust fallback
rjzamora 4107d59
Merge remote-tracking branch 'upstream/branch-25.12' into rapidsmpf-t…
rjzamora 7788abb
scan testing
rjzamora 2fdccae
create new test directory
rjzamora ce9658b
update comments
rjzamora 2c4b54c
adjust fallback
rjzamora 3785cc9
futher notes
rjzamora ccaa0e1
Merge branch 'branch-25.12' into rapidsmpf-translation
rjzamora cc017c3
generalize ScanPartitionPlan to IOPartitionPlan
rjzamora 6206dce
Merge remote-tracking branch 'upstream/branch-25.12' into rapidsmpf-t…
rjzamora 711708c
Merge branch 'rapidsmpf-translation' of github.com:rjzamora/cudf into…
rjzamora 7b05a34
update Rechunk class
rjzamora ccfd7c2
split union into distinct file
rjzamora 53a8cb2
avoid using task-engine lowering
rjzamora 4b666eb
add note
rjzamora 88613b5
track whether data is broadcasted
rjzamora 70495cc
linting
rjzamora 854306e
simplify
rjzamora 9d2767e
reusing task-engine lowering in most places - still debugging
rjzamora 7f03abe
debugging deadlock
rjzamora e1ac8dc
Merge remote-tracking branch 'upstream/branch-25.12' into rapidsmpf-t…
rjzamora ce15be1
small revisions - code still broken
rjzamora e206277
minor cleanup (still a mess)
rjzamora 2bb97df
still debugging - cursor probably changes some things incorrectly
rjzamora 3c64329
bug fixes
rjzamora 087e5b9
cleanup
rjzamora 0ae9b24
add throttling - though it probably isn't working
rjzamora 0097266
Merge remote-tracking branch 'upstream/branch-25.12' into rapidsmpf-t…
rjzamora a2805c6
fix subtle bug
rjzamora b094aa5
standardize argument order
rjzamora 5046925
add --rapidsmpf-engine option to pdsh
rjzamora 9b8b6e2
minor cleanup
rjzamora 2315621
removing print statements
rjzamora 1ef3907
Merge remote-tracking branch 'upstream/branch-25.12' into rapidsmpf-t…
rjzamora bbb4be1
very-basic join support
rjzamora 95bb092
Merge remote-tracking branch 'upstream/branch-25.12' into rapidsmpf-t…
rjzamora d12011d
remove final concatenation from lowering
rjzamora 66c208f
Merge remote-tracking branch 'upstream/branch-25.12' into rapidsmpf-t…
rjzamora c480075
cleanup
rjzamora 7848c8e
Merge remote-tracking branch 'upstream/branch-25.12' into rapidsmpf-t…
rjzamora c89505a
avoid concurrent sends - too much memory pressure
rjzamora 56a074c
use default UVM resource for now
rjzamora eebfeb2
improve bcast join a bit (for inner joins)
rjzamora 9f6e46c
simplify max_io_threads
rjzamora f01899a
Merge remote-tracking branch 'upstream/branch-25.12' into rapidsmpf-t…
rjzamora d409644
fix scan bug
rjzamora 147a5d3
make io-throttling local to IO node
rjzamora 54ad773
Merge remote-tracking branch 'upstream/branch-25.12' into rapidsmpf-t…
rjzamora a31450b
address partial code review
rjzamora 92b9cbe
set lower bound on estimate for string column size
rjzamora 563eb42
fix bug
rjzamora 2d8ca4e
incorperate lower-limit on storage_size (for now)
rjzamora 2e63d4c
pull out unnecessary changes
rjzamora 72152fa
Merge branch 'branch-25.12' into rapidsmpf-translation
rjzamora 12730dc
Merge remote-tracking branch 'upstream/branch-25.12' into rapidsmpf-t…
rjzamora 46970de
Merge remote-tracking branch 'upstream/branch-25.12' into rapidsmpf-t…
rjzamora 93fd831
Merge remote-tracking branch 'upstream/branch-25.12' into rapidsmpf-t…
rjzamora f078e2d
update config name to runtime
rjzamora a113047
add DEFAULT_RUNTIME - still need to debug
rjzamora 5a93792
updated experimental tests to work with rapidsmpf runtime
rjzamora f2e98fc
remove redundant tests
rjzamora f7574d3
Merge remote-tracking branch 'upstream/branch-25.12' into rapidsmpf-t…
rjzamora 653190a
revise multicast behavior - still pretty ugly
rjzamora a509492
Merge remote-tracking branch 'upstream/branch-25.12' into rapidsmpf-t…
rjzamora a7a3004
account for recent DataFrame.from_table change
rjzamora adc46b5
Merge remote-tracking branch 'upstream/branch-25.12' into rapidsmpf-t…
rjzamora a5b3b35
use channel-pair concept
rjzamora cfa0113
Merge remote-tracking branch 'upstream/branch-25.12' into rapidsmpf-t…
rjzamora 7b003ad
update comment
rjzamora 5403cb4
tweak code-coverage
rjzamora 53382ba
revise problematic union_dependancy attribute
rjzamora 4f18016
Merge remote-tracking branch 'upstream/branch-25.12' into rapidsmpf-t…
rjzamora 78a8dcd
revise multicast
rjzamora ae05e00
adjust test coverage
rjzamora 720d023
adopt ChannelManager class
rjzamora 80ee31d
multicast revisions
rjzamora 7ef2f81
Rename multicast to fanout in rapidsmpf module
rjzamora 4f3b138
Move shutdown_on_error to utils module
rjzamora 85deeb8
partially address code review (simplify join logic for now)
rjzamora a9dad1e
Merge branch 'main' into rapidsmpf-translation
rjzamora c1c58d3
rename
rjzamora 9364d0f
Merge branch 'rapidsmpf-translation' of github.com:rjzamora/cudf into…
rjzamora 940b1a1
Replace 'streaming engine' with 'streaming runtime' terminology
rjzamora f3e22c7
Use operator.add instead of lambda in process_children
rjzamora 15e68d2
update comments
rjzamora 47d166a
Merge remote-tracking branch 'upstream/main' into rapidsmpf-translation
rjzamora f5c34b3
minor simplification
rjzamora 217d347
Merge remote-tracking branch 'upstream/main' into rapidsmpf-translation
rjzamora 02175de
avoid using DEFAULT_STREAM in most places
rjzamora a2f4635
Merge remote-tracking branch 'upstream/main' into rapidsmpf-translation
rjzamora 495da51
remove DEFAULT_STREAM usage from io.py
rjzamora a7739de
Merge remote-tracking branch 'upstream/main' into rapidsmpf-translation
rjzamora db55a75
Merge remote-tracking branch 'upstream/main' into rapidsmpf-translation
rjzamora 8a3aebd
update Message import
rjzamora d38f58f
Merge remote-tracking branch 'upstream/main' into rapidsmpf-translation
rjzamora 2d5dd16
Merge remote-tracking branch 'upstream/main' into rapidsmpf-translation
rjzamora ad537a8
use pool
rjzamora 1128365
add tests; add --stream-policy pdsh arg
rjzamora 74afb07
Merge remote-tracking branch 'upstream/main' into rapidsmpf-translation
rjzamora 424617c
Merge remote-tracking branch 'upstream/main' into rapidsmpf-translation
rjzamora ac22f6e
make sure at least two IO nodes are always active
rjzamora 3362f0a
Merge remote-tracking branch 'upstream/main' into rapidsmpf-translation
rjzamora ff2b50e
adjust for breaking rapidsmpf change
rjzamora 687fe5b
Merge remote-tracking branch 'upstream/main' into rapidsmpf-translation
rjzamora 15dc89e
use --rmm-async for single cluster as well
rjzamora 42c3f92
Merge remote-tracking branch 'upstream/main' into rapidsmpf-translation
rjzamora fc220ae
enable spilling in local shuffle (via env variable for now)
rjzamora e1713d7
Merge remote-tracking branch 'upstream/main' into rapidsmpf-translation
rjzamora 455bd4f
update rapidsmpf tests to use the rapidsmpf runtime
rjzamora debbed0
use Lineariser
rjzamora 024dc49
add Lineariser
rjzamora d58fc11
Merge remote-tracking branch 'upstream/main' into rapidsmpf-translati…
rjzamora 01dc8bf
Merge remote-tracking branch 'upstream/main' into rapidsmpf-translation
rjzamora 028ad18
Reduce memory usage
rjzamora db775bc
reduce memory usage further
rjzamora 037e3f1
Merge remote-tracking branch 'upstream/main' into rapidsmpf-translati…
rjzamora 97e2dbf
rename Lineariser
rjzamora 0941cf9
Merge remote-tracking branch 'upstream/main' into rapidsmpf-translati…
rjzamora bd8ae2f
Merge branch 'rapidsmpf-translation-linearise' into rapidsmpf-transla…
rjzamora a805be7
avoid possible stream-race conditions
rjzamora 961898d
Merge remote-tracking branch 'upstream/main' into rapidsmpf-translation
rjzamora f32ae12
revise type check
rjzamora 36277b6
Merge remote-tracking branch 'upstream/main' into rapidsmpf-translation
rjzamora 55eb6ed
Merge remote-tracking branch 'upstream/main' into rapidsmpf-translation
rjzamora 87d849c
use rapidsmpf Lineariser
rjzamora 546540b
change default CUDAStreamPolicy (for now)
rjzamora 16ad915
Merge remote-tracking branch 'upstream/main' into rapidsmpf-translation
rjzamora f066a7f
Use default stream policy in pdsh for now
rjzamora 4cbb36c
fix lineariser logic
rjzamora 7aa7dc8
Merge remote-tracking branch 'upstream/main' into rapidsmpf-translation
rjzamora 6f13d5f
Merge remote-tracking branch 'upstream/main' into rapidsmpf-translation
rjzamora 71c470e
adjust defaults and configs
rjzamora 8b463fc
remove comment
rjzamora 81aef3f
Merge remote-tracking branch 'upstream/main' into rapidsmpf-translation
rjzamora 4e784de
add test
rjzamora 60edcc2
Merge remote-tracking branch 'upstream/main' into rapidsmpf-translation
rjzamora f91fb11
roll back dramatic default change (for now)
rjzamora 0f7936b
partial code review
rjzamora 9bfa355
update comments
rjzamora 7d7a007
Merge remote-tracking branch 'upstream/main' into rapidsmpf-translation
rjzamora 5a52ef5
adjust LocalShuffle
rjzamora 87bf05b
more comments
rjzamora File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Some comments aren't visible on the classic Files Changed page.
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
8 changes: 8 additions & 0 deletions
8
python/cudf_polars/cudf_polars/experimental/rapidsmpf/__init__.py
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,8 @@ | ||
| # SPDX-FileCopyrightText: Copyright (c) 2025 NVIDIA CORPORATION & AFFILIATES. | ||
| # SPDX-License-Identifier: Apache-2.0 | ||
|
|
||
| """RapidsMPF streaming-engine support.""" | ||
|
|
||
| from __future__ import annotations | ||
|
|
||
| __all__: list[str] = [] |
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.