[BUG]: PSTL algorithms don't launch on stream's device

Currently, when a PSTL algorithm is called with `gpu` execution policy that contains a stream, we don't set the current device to the stream's device:
```cpp
cuda::device_ref device{cuda::devices[0]};
cuda::stream stream{device};
const auto policy = cuda::execution::gpu.with(cuda::get_stream, stream);

cudaSetDevice(1);
cuda::std::find(policy, ...); // oops, launches work on device 1
```
It's basically the user's responsibility to make sure the current device and the stream's device match.

However, this is inconsistent with what we do in `cuda::launch`, where we ignore current device and always set it to the stream's device. I think we should fix this and guard all device-related operations in PSTL algorithms by `__ensure_current_context`, so the example is fixed:
```cpp
cuda::device_ref device{cuda::devices[0]};
cuda::stream stream{device};
const auto policy = cuda::execution::gpu.with(cuda::get_stream, stream);

cudaSetDevice(1);
cuda::std::find(policy, ...); // ok, launches work on device 0
```

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG]: PSTL algorithms don't launch on stream's device #9212

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

[BUG]: PSTL algorithms don't launch on stream's device #9212

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions