Special case bool dereferencing (consistent with PyTorch) by jrbyrnes · Pull Request #410 · ROCm/rocPRIM

jrbyrnes · 2023-02-09T01:32:25Z

A PyTorch Test has (roughly) the following implementation

  uint8_t *x = // some data 0 or 2-255 // host code
  void *temp = x; // host code
  bool *val = static_cast<bool *>(temp); // host code
  // host || device code -- dereference and use

This is invoking undefined behavior.

This PR identified this problem and resolved it for the CPU case. Basically it special cases dereferencing bool * (via c10::load). And this c10::load is used when iteratively performing the nonzero op on the tensor data.

This patch extends the fix to our iterators.

nolmoonen · 2023-02-10T12:04:24Z

@jrbyrnes bool* val = static_cast<bool*>(temp) breaks strict aliasing rules and is undefined behavior. If PyTorch wants to invoke undefined behavior, that's up to PyTorch, but it isn't the responsibility of rocPRIM to account for this.

To work around your problem, you could create a rocprim::transform_iterator to wrap the val pointer, and pass that to rocPRIM.

jrbyrnes · 2023-02-10T17:53:17Z

Hi @nolmoonen , thanks for your thoughts! In fact, pushing it back to PyTorch was my first response. However, there has been a specific ask to solve via ROCm since the same behavior (i.e. alias violation) results in test pass transparently for CUDA, yet fails for HIP (ref SWDEV-357998). However, if you are fundamentally opposed to this patch, please confirm and I will see what I can do in PyTorch. Thanks.

bcahoon · 2023-02-14T19:56:27Z

Hi @nolmoonen, my understanding is that bool *val = static_cast<bool *>(temp) doesn't break the strict aliasing rule if the underlying object type is a character type. In code segment above, it depends on underlying object assigned to x. That said, I'm not implying that rocPRIM is the correct place to fix this issue, as I'm sure there are other users of transform_iterator that would be impacted by this change.

nolmoonen · 2023-02-15T16:48:13Z

We are currently investigating the consequences of applying the PR.

Aside from that:
@bcahoon bool* val = static_cast<bool*>(temp) breaks the strict aliasing rule, even if the underlying object type is a character type. The following is would be allowed:

bool* x = // some data, true or false
uint8_t* tmp = reinterpret_cast<uint8_t*>(x);
// dereference tmp and write 0 if evals to false, 2-255 if evals to true, but do check that sizeof(bool) == sizeof(uint8_t)
// dereference x and check that it is equal to original array

https://en.cppreference.com/w/cpp/language/reinterpret_cast section "Type aliasing" states these rules.

nolmoonen · 2023-02-17T12:11:11Z

Sorry, correction: the example I gave is also not allowed. While manipulating the data through the char* is allowed, it is not allowed to write a value outside of the range of values that the bool can represent.

doctorcolinsmith · 2023-12-01T21:21:29Z

We think this may be fixed in pytorch and therefore not require any change to rocPRIM. @stanleytsang-amd will confirm.

Special case bool dereferencing (consistent with PyTorch)

56757bb

stanleytsang-amd requested review from nolmoonen and vince-streamhpc February 9, 2023 20:03

stanleytsang-amd mentioned this pull request Feb 7, 2025

[ROCm] sorting torch.bool tensor viewed from torch.uint8 type produces incorrect results pytorch/pytorch#139972

Closed

stanleytsang-amd closed this May 13, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

Special case bool dereferencing (consistent with PyTorch)#410

Special case bool dereferencing (consistent with PyTorch)#410
jrbyrnes wants to merge 1 commit intoROCm:developfrom
jrbyrnes:PytorchBool

jrbyrnes commented Feb 9, 2023 •

edited

Loading

Uh oh!

nolmoonen commented Feb 10, 2023

Uh oh!

jrbyrnes commented Feb 10, 2023

Uh oh!

bcahoon commented Feb 14, 2023

Uh oh!

nolmoonen commented Feb 15, 2023

Uh oh!

nolmoonen commented Feb 17, 2023

Uh oh!

doctorcolinsmith commented Dec 1, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Comments

Conversation

jrbyrnes commented Feb 9, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

nolmoonen commented Feb 10, 2023

Uh oh!

jrbyrnes commented Feb 10, 2023

Uh oh!

bcahoon commented Feb 14, 2023

Uh oh!

nolmoonen commented Feb 15, 2023

Uh oh!

nolmoonen commented Feb 17, 2023

Uh oh!

doctorcolinsmith commented Dec 1, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

jrbyrnes commented Feb 9, 2023 •

edited

Loading