Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Register default allocator for host memory #9908

Merged
merged 5 commits into from
Dec 4, 2023

Conversation

revans2
Copy link
Collaborator

@revans2 revans2 commented Nov 30, 2023

This gets us to a point where we can have CUDF java allocating host memory through HostAlloc. But it is not 100% perfect yet. Test pass, but the integration tests appear to have a live lock in them that I need to track down when running with limited memory. It should still be good for us to get started working on other HostMemoryLimit issues.

Signed-off-by: Robert (Bobby) Evans <bobby@apache.org>
@revans2
Copy link
Collaborator Author

revans2 commented Nov 30, 2023

build

1 similar comment
@revans2
Copy link
Collaborator Author

revans2 commented Nov 30, 2023

build

@revans2
Copy link
Collaborator Author

revans2 commented Nov 30, 2023

build

jbrennan333
jbrennan333 previously approved these changes Dec 1, 2023
Copy link
Collaborator

@jbrennan333 jbrennan333 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm

@revans2
Copy link
Collaborator Author

revans2 commented Dec 1, 2023

build

jbrennan333
jbrennan333 previously approved these changes Dec 1, 2023
@revans2
Copy link
Collaborator Author

revans2 commented Dec 4, 2023

build

@@ -163,6 +163,9 @@ private class HostAlloc(nonPinnedLimit: Long) extends HostMemoryAllocator with L
var allocAttemptFinishedWithoutException = false
try {
do {
if (retryCount > 0) {
logWarning(s"RETRY HOST ALLOC $amount $preferPinned $blocking $retryCount")
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

did you want to leave this as warn?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

oops

@revans2
Copy link
Collaborator Author

revans2 commented Dec 4, 2023

build

Copy link
Collaborator

@jbrennan333 jbrennan333 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good.

@revans2 revans2 merged commit 410647c into NVIDIA:branch-24.02 Dec 4, 2023
38 checks passed
@revans2 revans2 deleted the set_cudf_alloc_for_cpu branch December 4, 2023 19:22
@sameerz sameerz added the reliability Features to improve reliability or bugs that severly impact the reliability of the plugin label Dec 5, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
reliability Features to improve reliability or bugs that severly impact the reliability of the plugin
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

4 participants