Remove `cuda::proclaim_return_type` from nested lambda #14607

ttnghia · 2023-12-10T17:34:42Z

This removes cuda::proclaim_return_type from a device lambda because that lambda is going to be nested inside another device lambda, which is in turn enclosed by cuda::proclaim_return_type.

This PR is to fix a compile issue that we encountered:

/usr/local/cuda/include/cuda/std/detail/libcxx/include/__functional/invoke.h(402): error: 
calling a __device__ function("cudf::tdigest::detail::_NV_ANON_NAMESPACE::build_output_column(int,   
  ::std::unique_ptr<   ::cudf::column,     ::std::default_delete<   ::cudf::column> >  &&,     ::std::unique_ptr<   ::cudf::column,     ::std::default_delete<   ::cudf::column> >  &&,     ::std::unique_ptr<   ::cudf::column,     ::std::default_delete<   ::cudf::column> >  &&,     ::std::unique_ptr<   ::cudf::column,     ::std::default_delete<   ::cudf::column> >  &&,     ::std::unique_ptr<   ::cudf::column,     ::std::default_delete<   ::cudf::column> >  &&, bool,  ::rmm::cuda_stream_view,  ::rmm::mr::device_memory_resource *)
::[lambda(int) (instance 2)]::operator ()(int) const") from a __host__ __device__ function("__invoke") is not allowed

Note: The issue is reproducible only in our build environment: ARM architecture, cuda 12 + rockylinux8.

Closes #14610.

Signed-off-by: Nghia Truong <nghiat@nvidia.com>

cpp/src/quantiles/tdigest/tdigest_aggregation.cu

davidwendt · 2023-12-11T15:33:12Z

Perhaps @bdice should verify this does not break his CCCL PR. #14576

bdice · 2023-12-11T15:45:38Z

Perhaps @bdice should verify this does not break his CCCL PR. #14576

I’ll verify this as soon as possible.

Signed-off-by: Nghia Truong <nghiat@nvidia.com>

bdice

I confirmed that #14576 builds with this change. I noted in #14610 that we should file an issue with CCCL that documents the compiler versions / etc. used to trigger this error, with a minimal reproduction (especially since RAPIDS CI doesn't complain). The proclaim_return_type wrapper should not break normal device lambda compilation.

ttnghia · 2023-12-11T18:12:26Z

/merge

This removes `cuda::proclaim_return_type` from a device lambda because that lambda is going to be nested inside another device lambda, which is in turn enclosed by `cuda::proclaim_return_type`. This PR is to fix a compile issue that we encountered: ``` /usr/local/cuda/include/cuda/std/detail/libcxx/include/__functional/invoke.h(402): error: calling a __device__ function("cudf::tdigest::detail::_NV_ANON_NAMESPACE::build_output_column(int, ::std::unique_ptr< ::cudf::column, ::std::default_delete< ::cudf::column> > &&, ::std::unique_ptr< ::cudf::column, ::std::default_delete< ::cudf::column> > &&, ::std::unique_ptr< ::cudf::column, ::std::default_delete< ::cudf::column> > &&, ::std::unique_ptr< ::cudf::column, ::std::default_delete< ::cudf::column> > &&, ::std::unique_ptr< ::cudf::column, ::std::default_delete< ::cudf::column> > &&, bool, ::rmm::cuda_stream_view, ::rmm::mr::device_memory_resource *) ::[lambda(int) (instance 2)]::operator ()(int) const") from a __host__ __device__ function("__invoke") is not allowed ``` Note: The issue is reproducible only in our build environment: ARM architecture, cuda 12 + rockylinux8. Closes rapidsai#14610. Authors: - Nghia Truong (https://github.com/ttnghia) Approvers: - Michael Schellenberger Costa (https://github.com/miscco) - Karthikeyan (https://github.com/karthikeyann) - https://github.com/nvdbaranec - Bradley Dice (https://github.com/bdice) URL: rapidsai#14607

Remove proclaim_return_type from lambda

fb844a8

Signed-off-by: Nghia Truong <nghiat@nvidia.com>

ttnghia added bug Something isn't working 3 - Ready for Review Ready for review by team libcudf Affects libcudf (C++/CUDA) code. Spark Functionality that helps Spark RAPIDS labels Dec 10, 2023

ttnghia requested a review from bdice December 10, 2023 17:34

ttnghia self-assigned this Dec 10, 2023

ttnghia requested a review from a team as a code owner December 10, 2023 17:34

ttnghia requested a review from vuule December 10, 2023 17:34

ttnghia added the non-breaking Non-breaking change label Dec 10, 2023

ttnghia commented Dec 10, 2023

View reviewed changes

cpp/src/quantiles/tdigest/tdigest_aggregation.cu Show resolved Hide resolved

ttnghia added the libcudf blocker label Dec 11, 2023

miscco approved these changes Dec 11, 2023

View reviewed changes

cpp/src/quantiles/tdigest/tdigest_aggregation.cu Show resolved Hide resolved

Add comment

d9b7cbe

Signed-off-by: Nghia Truong <nghiat@nvidia.com>

karthikeyann approved these changes Dec 11, 2023

View reviewed changes

nvdbaranec approved these changes Dec 11, 2023

View reviewed changes

bdice mentioned this pull request Dec 11, 2023

[BUG] compile error tdigest_aggregation.cu on cuda 12.2 on arm64 #14610

Closed

bdice approved these changes Dec 11, 2023

View reviewed changes

rapids-bot bot merged commit fcaebeb into rapidsai:branch-24.02 Dec 11, 2023
67 checks passed

ttnghia deleted the fix_tdigest_agg branch December 12, 2023 18:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove `cuda::proclaim_return_type` from nested lambda #14607

Remove `cuda::proclaim_return_type` from nested lambda #14607

ttnghia commented Dec 10, 2023 •

edited

Loading

davidwendt commented Dec 11, 2023

bdice commented Dec 11, 2023

bdice left a comment

ttnghia commented Dec 11, 2023

Remove cuda::proclaim_return_type from nested lambda #14607

Remove cuda::proclaim_return_type from nested lambda #14607

Conversation

ttnghia commented Dec 10, 2023 • edited Loading

davidwendt commented Dec 11, 2023

bdice commented Dec 11, 2023

bdice left a comment

Choose a reason for hiding this comment

ttnghia commented Dec 11, 2023

Remove `cuda::proclaim_return_type` from nested lambda #14607

Remove `cuda::proclaim_return_type` from nested lambda #14607

ttnghia commented Dec 10, 2023 •

edited

Loading