Skip to content
#

cxx

Here are 369 public repositories matching this topic...

thrust
nv-dlasalle
nv-dlasalle commented Mar 19, 2021

Problem

Cub allows itself to place into a namespace via CUB_NS_PREFIX and CUB_NS_POSTFIX, such that multiple shared libraries can each utilize their own copy of it (and thus different versions can safely coexist). Static variables used for caching could otherwise cause problems (e.g., https://github.com/NVIDIA/cub/blob/main/cub/util_device.cuh#L212).

Thrust however depends on cub and

scullionw
scullionw commented Apr 26, 2021

Expected Behavior

I can pass std::vector<int32_t>, but the the c++ library I want to use uses int, not int32_t

std::vector<int> build_array() {
    auto values = std::vector<int> {1,2,3,4,5};
    for (auto& v: values) v = v * 2;
    return values;
}

Not sure how to get around this since I can't impl VectorElement for c_int due to the orphan rule? There must b

cub
allisonvacanti
allisonvacanti commented Apr 23, 2021

DeviceSegmentedReduce reduces each segment using the same kernel as DeviceReduce, and thus provides the same run-to-run consistency guarantee for non-associative types. The documentation for DeviceSegmentedReduce should be updated to reflect this.

E.g. this bit: https://github.com/NVIDIA/cub/blob/a8910accebe74ce043a13026f8e71d678cddd6c1/cub/device/device_reduce.cuh#L91-L95
needs to be ad

Improve this page

Add a description, image, and links to the cxx topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the cxx topic, visit your repo's landing page and select "manage topics."

Learn more