Fixes bug in temporary decompression space estimation before calling nvcomp #11879

abellina · 2022-10-07T21:31:12Z

This PR fixes an issue we noticed while trying to read a zstd parquet file where the cuDF code was causing a very large allocation to happen (something much higher than GPU memory like 50 or 60 GB).

We bisected the issue to this PR: #11652.

The fix has been verified with the original file and Spark.

Thanks to @nvdbaranec, @jbrennan333, @mythrocks and @vuule for help looking into this!

…nvcomp

cpp/src/io/comp/nvcomp_adapter.cpp

abellina · 2022-10-07T21:48:16Z

@upsj FYI

mythrocks · 2022-10-07T21:52:54Z

cpp/src/io/comp/nvcomp_adapter.cpp

    batched_decompress_get_temp_size_ex(
-      compression, num_chunks, max_uncomp_chunk_size, &temp_size, max_total_uncomp_size)
-      .value_or(batched_decompress_get_temp_size(


Are we inadvertently changing the semantics here?

The intention of the original code seemed to be to evaluate the "else" path only if std::nullopt. i.e. If batched_decompress_get_temp_size_ex() returned nvcompErrorInternal, it was not to call batched_decompress_get_temp_size().

In the new version, batched_decompress_get_temp_size() is called in both cases. @abellina , @vuule, is that ok?

It seems like what we should be checking is if nvcomp_status is simply nullopt, then doing the second call.

Yeah - I'm not sure how much value there is in calling the second if the first failed in the library. It will likely also fail.

I don't see a reason against trying the old API if the new one failed. Agreed that it's unlikely to help.

vuule

Looks good.

abellina · 2022-10-07T22:03:44Z

@mythrocks sent me a patch (61e2499) that fixed my code styling many thanks! I am having conda issues.

mythrocks

LGTM!

abellina · 2022-10-07T23:06:27Z

I am running some end-to-end tests with this patch, I'll update later tonight as I am having some unrelated local issues. I'll leave it in draft until I have that test completed, but I don't foresee this going further than today.

codecov · 2022-10-08T00:20:14Z

Codecov Report

Base: 87.51% // Head: 87.51% // No change to project coverage 👍

Coverage data is based on head (61e2499) compared to base (4c4bce9).
Patch has no changes to coverable lines.

Additional details and impacted files

@@              Coverage Diff              @@
##           branch-22.10   #11879   +/-   ##
=============================================
  Coverage         87.51%   87.51%           
=============================================
  Files               133      133           
  Lines             21826    21826           
=============================================
  Hits              19100    19100           
  Misses             2726     2726

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

☔ View full report at Codecov.
📢 Do you have feedback about the report comment? Let us know in this issue.

jbrennan333

+1 lgtm

abellina · 2022-10-08T00:59:19Z

Thanks all for the reviews. I am removing draft as this passes my test locally (the parquet file I was trying to read no longer OOMs).

vuule · 2022-10-08T02:33:24Z

CC @GregoryKimball this is another late fix for 22.10

Fixes bug in temporary decompression space estimation before calling …

22beba9

…nvcomp

abellina changed the base branch from branch-22.12 to branch-22.10 October 7, 2022 21:31

github-actions bot added the libcudf Affects libcudf (C++/CUDA) code. label Oct 7, 2022

abellina added bug Something isn't working non-breaking Non-breaking change labels Oct 7, 2022

mythrocks reviewed Oct 7, 2022

View reviewed changes

cpp/src/io/comp/nvcomp_adapter.cpp Show resolved Hide resolved

mythrocks reviewed Oct 7, 2022

View reviewed changes

vuule approved these changes Oct 7, 2022

View reviewed changes

Apply codestyle changes

61e2499

mythrocks approved these changes Oct 7, 2022

View reviewed changes

jbrennan333 approved these changes Oct 8, 2022

View reviewed changes

abellina marked this pull request as ready for review October 8, 2022 00:58

abellina requested a review from a team as a code owner October 8, 2022 00:58

abellina requested review from cwharris and davidwendt and removed request for a team October 8, 2022 00:58

ttnghia approved these changes Oct 8, 2022

View reviewed changes

jolorunyomi merged commit 17868b7 into rapidsai:branch-22.10 Oct 8, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixes bug in temporary decompression space estimation before calling nvcomp #11879

Fixes bug in temporary decompression space estimation before calling nvcomp #11879

abellina commented Oct 7, 2022 •

edited

Loading

abellina commented Oct 7, 2022

mythrocks Oct 7, 2022

nvdbaranec Oct 7, 2022

nvdbaranec Oct 7, 2022

jbrennan333 Oct 7, 2022

vuule Oct 7, 2022

vuule left a comment

abellina commented Oct 7, 2022

mythrocks left a comment

abellina commented Oct 7, 2022 •

edited

Loading

codecov bot commented Oct 8, 2022

jbrennan333 left a comment

abellina commented Oct 8, 2022

vuule commented Oct 8, 2022

Fixes bug in temporary decompression space estimation before calling nvcomp #11879

Fixes bug in temporary decompression space estimation before calling nvcomp #11879

Conversation

abellina commented Oct 7, 2022 • edited Loading

abellina commented Oct 7, 2022

mythrocks Oct 7, 2022

Choose a reason for hiding this comment

nvdbaranec Oct 7, 2022

Choose a reason for hiding this comment

nvdbaranec Oct 7, 2022

Choose a reason for hiding this comment

jbrennan333 Oct 7, 2022

Choose a reason for hiding this comment

vuule Oct 7, 2022

Choose a reason for hiding this comment

vuule left a comment

Choose a reason for hiding this comment

abellina commented Oct 7, 2022

mythrocks left a comment

Choose a reason for hiding this comment

abellina commented Oct 7, 2022 • edited Loading

codecov bot commented Oct 8, 2022

Codecov Report

jbrennan333 left a comment

Choose a reason for hiding this comment

abellina commented Oct 8, 2022

vuule commented Oct 8, 2022

abellina commented Oct 7, 2022 •

edited

Loading

abellina commented Oct 7, 2022 •

edited

Loading