Add basic torch.hash_tensor op #154149

mikaylagawarecki · 2025-05-22T20:47:13Z

Added torch.hash_tensor reduction function with a mode argument that defaults to reduction with xor.

The hash is always uint64.
Integers will be casted to uint64 before performing the xor_sum reduction
Floats will be upcasted to double and then bitcasted to uint64 before performing the xor_sum reduction

We don't provide an ordering aware hash function yet because we don't have an easy way to provide a fast cuda kernel for it

Stack from ghstack (oldest at bottom):

-> Add basic torch.hash_tensor op #154149

cc @jgong5 @mingfeima @XiaobingSuper @sanchitintel @ashokei @jingxu10 @jerryzh168 @voznesenskym @penguinwu @EikanWang @Guobing-Chen @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @chenyang78 @kadeng @muchulee8 @amjames @chauhang @aakhundov @coconutruben @Lucaskabela

[ghstack-poisoned]

pytorch-bot · 2025-05-22T20:47:17Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/154149

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

⏳ 1 Pending, 1 Unrelated Failure

As of commit 5c8132c with merge base f168cf4 ():

UNSTABLE - The following job is marked as unstable, possibly due to flakiness on trunk:

pull / linux-jammy-py3_9-clang9-xla / test (xla, 1, 1, linux.12xlarge, unstable) (gh) (#158876)
sccache: error: couldn't connect to server

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: 00cf75d Pull Request resolved: #154149

github-actions · 2025-05-22T20:50:58Z

Attention! native_functions.yaml was changed

If you are adding a new function or defaulted argument to native_functions.yaml, you cannot use it from pre-existing Python frontend code until our FC window passes (two weeks). Split your PR into two PRs, one which adds the new C++ functionality, and one that makes use of it from Python, and land them two weeks apart. See https://github.com/pytorch/pytorch/wiki/PyTorch's-Python-Frontend-Backward-and-Forward-Compatibility-Policy#forwards-compatibility-fc for more info.

Caused by:

aten/src/ATen/native/native_functions.yaml

cc jgong5 mingfeima XiaobingSuper sanchitintel ashokei jingxu10 jerryzh168 albanD [ghstack-poisoned]

cc jgong5 mingfeima XiaobingSuper sanchitintel ashokei jingxu10 jerryzh168 albanD voznesenskym penguinwu EikanWang Guobing-Chen zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy chenyang78 kadeng muchulee8 amjames chauhang aakhundov [ghstack-poisoned]

ghstack-source-id: 0da1f03 Pull Request resolved: #154149

[ghstack-poisoned]

ghstack-source-id: 12a1271 Pull Request resolved: #154149

[ghstack-poisoned]

Added `torch.hash_tensor` reduction function with a `mode` argument that defaults to multiply/shift via range then reduction with xor. The tensor is always viewed as int64 (applying padding as necessary) before the reduction so the result is always int64. I chose int64 rather than uint64 because e.g. a * x + b is not implemented for uint64 in torch on cuda. [ghstack-poisoned]

ghstack-source-id: 1ed6070 Pull Request resolved: #154149

Added `torch.hash_tensor` reduction function with a `mode` argument that defaults to multiply/shift via range then reduction with xor. The tensor is always viewed as int64 (applying padding as necessary) before the reduction so the result is always int64. I chose int64 rather than uint64 because e.g. a * x + b is not implemented for uint64 in torch on cuda. [ghstack-poisoned]

ghstack-source-id: 25eda40 Pull Request resolved: #154149

Added `torch.hash_tensor` reduction function with a `mode` argument that defaults to multiply/shift via range then reduction with xor. The tensor is always viewed as int64 (applying padding as necessary) before the reduction so the result is always int64. I chose int64 rather than uint64 because e.g. a * x + b is not implemented for uint64 in torch on cuda. [ghstack-poisoned]

ghstack-source-id: 4385e53 Pull Request resolved: #154149

Added `torch.hash_tensor` reduction function with a `mode` argument that defaults to multiply/shift via range then reduction with xor. The tensor is always viewed as int64 (applying padding as necessary) before the reduction so the result is always int64. I chose int64 rather than uint64 because e.g. a * x + b is not implemented for uint64 in torch on cuda. [ghstack-poisoned]

ghstack-source-id: 8da7498 Pull Request resolved: #154149

Added `torch.hash_tensor` reduction function with a `mode` argument that defaults to reduction with xor. - The hash is always uint64. - Integers will be casted to uint64 before performing the xor_sum reduction - Floats will be upcasted to double and then bitcasted to uint64 before performing the xor_sum reduction [ghstack-poisoned]

ghstack-source-id: 6669fda Pull Request resolved: #154149

Added `torch.hash_tensor` reduction function with a `mode` argument that defaults to reduction with xor. - The hash is always uint64. - Integers will be casted to uint64 before performing the xor_sum reduction - Floats will be upcasted to double and then bitcasted to uint64 before performing the xor_sum reduction [ghstack-poisoned]

ghstack-source-id: 7726e76 Pull Request resolved: #154149

torch/_torch_docs.py

aten/src/ATen/native/native_functions.yaml

albanD · 2025-07-22T21:39:39Z

aten/src/ATen/native/cuda/ReduceSumProdKernel.cu

+          // return a double, otherwise uint64_t will be cast to double
+          // when accumulating and the result will be wrong


Let's say we have inputs

a: 3.14159, a_bits: 4614256650576692846
b: 1.61803, b_bits: 4609965778477721196
a ^ b: 9219082337818812418

If we return the result as int64_t, the next time it is used as an input, it will be cast to 9219082337818812418.0, which becomes 4890905006165143848 in bits, which would mess up the xor reduction here

Added `torch.hash_tensor` reduction function with a `mode` argument that defaults to reduction with xor. - The hash is always uint64. - Integers will be casted to uint64 before performing the xor_sum reduction - Floats will be upcasted to double and then bitcasted to uint64 before performing the xor_sum reduction [ghstack-poisoned]

ghstack-source-id: 906195d Pull Request resolved: #154149

Added `torch.hash_tensor` reduction function with a `mode` argument that defaults to reduction with xor. - The hash is always uint64. - Integers will be casted to uint64 before performing the xor_sum reduction - Floats will be upcasted to double and then bitcasted to uint64 before performing the xor_sum reduction [ghstack-poisoned]

ghstack-source-id: fadf8f5 Pull Request resolved: #154149

Added `torch.hash_tensor` reduction function with a `mode` argument that defaults to reduction with xor. - The hash is always uint64. - Integers will be casted to uint64 before performing the xor_sum reduction - Floats will be upcasted to double and then bitcasted to uint64 before performing the xor_sum reduction [ghstack-poisoned]

ghstack-source-id: 9ac76da Pull Request resolved: #154149

albanD

Sounds good to me!

mikaylagawarecki · 2025-07-23T20:08:34Z

@pytorchbot merge

pytorchmergebot · 2025-07-23T20:11:09Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

Added `torch.hash_tensor` reduction function with a `mode` argument that defaults to reduction with xor. - The hash is always uint64. - Integers will be casted to uint64 before performing the xor_sum reduction - Floats will be upcasted to double and then bitcasted to uint64 before performing the xor_sum reduction Pull Request resolved: #154149 Approved by: https://github.com/albanD

Add basic xor_sum op

165bdae

[ghstack-poisoned]

pytorch-bot bot added the module: cpu CPU specific problem (e.g., perf, algorithm) label May 22, 2025

mikaylagawarecki added a commit that referenced this pull request May 22, 2025

Add basic xor_sum op

45e8719

ghstack-source-id: 00cf75d Pull Request resolved: #154149

mikaylagawarecki added module: python frontend For issues relating to PyTorch's Python frontend ciflow/trunk Trigger trunk jobs on your pull request and removed module: cpu CPU specific problem (e.g., perf, algorithm) labels May 22, 2025

Update on "Add basic xor_sum op"

2448b22

cc jgong5 mingfeima XiaobingSuper sanchitintel ashokei jingxu10 jerryzh168 albanD [ghstack-poisoned]

pytorch-bot bot added ciflow/inductor module: cpu CPU specific problem (e.g., perf, algorithm) module: dynamo module: inductor labels May 27, 2025

Update on "Add basic xor_sum op"

2dd3f67

cc jgong5 mingfeima XiaobingSuper sanchitintel ashokei jingxu10 jerryzh168 albanD voznesenskym penguinwu EikanWang Guobing-Chen zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy chenyang78 kadeng muchulee8 amjames chauhang aakhundov [ghstack-poisoned]

mikaylagawarecki added a commit that referenced this pull request May 27, 2025

Add basic xor_sum op

ce25820

ghstack-source-id: 0da1f03 Pull Request resolved: #154149

Update on "Add basic xor_sum op"

1215753

[ghstack-poisoned]

mikaylagawarecki changed the title ~~Add basic xor_sum op~~ Add basic torch.hash_tensor op Jun 26, 2025

Update on "Add basic torch.hash_tensor op"

d23fa6a

[ghstack-poisoned]

Update on "Add basic torch.hash_tensor op"

bc6e323

[ghstack-poisoned]

mikaylagawarecki added a commit that referenced this pull request Jun 26, 2025

Add basic xor_sum op

9b0a38f

ghstack-source-id: 12a1271 Pull Request resolved: #154149

Update on "Add basic torch.hash_tensor op"

2989431

[ghstack-poisoned]

Update on "Add basic torch.hash_tensor op"

c5208b9

[ghstack-poisoned]

Update on "Add basic torch.hash_tensor op"

2269999

[ghstack-poisoned]

mikaylagawarecki added the release notes: python_frontend python frontend release notes category label Jun 26, 2025

mikaylagawarecki requested a review from albanD July 15, 2025 15:04

mikaylagawarecki added a commit that referenced this pull request Jul 21, 2025

Add basic xor_sum op

6160085

ghstack-source-id: 1ed6070 Pull Request resolved: #154149

mikaylagawarecki added a commit that referenced this pull request Jul 22, 2025

Add basic xor_sum op

39b8863

ghstack-source-id: 25eda40 Pull Request resolved: #154149

mikaylagawarecki added a commit that referenced this pull request Jul 22, 2025

Add basic xor_sum op

e1c1f1d

ghstack-source-id: 4385e53 Pull Request resolved: #154149

mikaylagawarecki added a commit that referenced this pull request Jul 22, 2025

Add basic xor_sum op

559eba3

ghstack-source-id: 8da7498 Pull Request resolved: #154149

mikaylagawarecki added a commit that referenced this pull request Jul 22, 2025

Add basic xor_sum op

6b4846e

ghstack-source-id: 6669fda Pull Request resolved: #154149

mikaylagawarecki added a commit that referenced this pull request Jul 22, 2025

Add basic xor_sum op

900f04b

ghstack-source-id: 7726e76 Pull Request resolved: #154149

albanD reviewed Jul 22, 2025

View reviewed changes

mikaylagawarecki added a commit that referenced this pull request Jul 23, 2025

Add basic xor_sum op

8b67d27

ghstack-source-id: 906195d Pull Request resolved: #154149

mikaylagawarecki added a commit that referenced this pull request Jul 23, 2025

Add basic xor_sum op

c04207a

ghstack-source-id: fadf8f5 Pull Request resolved: #154149

mikaylagawarecki added a commit that referenced this pull request Jul 23, 2025

Add basic xor_sum op

a41756d

ghstack-source-id: 9ac76da Pull Request resolved: #154149

albanD approved these changes Jul 23, 2025

View reviewed changes

pytorchmergebot added the merging label Jul 23, 2025

pytorchmergebot added the Merged label Jul 23, 2025

pytorchmergebot closed this in 7f649ed Jul 23, 2025

pytorchmergebot removed the merging label Jul 23, 2025

github-actions bot deleted the gh/mikaylagawarecki/313/head branch August 24, 2025 02:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add basic torch.hash_tensor op #154149

Add basic torch.hash_tensor op #154149

Uh oh!

mikaylagawarecki commented May 22, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented May 22, 2025 •

edited

Loading

Uh oh!

github-actions bot commented May 22, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

albanD Jul 22, 2025

Uh oh!

mikaylagawarecki Jul 23, 2025 •

edited

Loading

Uh oh!

albanD left a comment

Uh oh!

mikaylagawarecki commented Jul 23, 2025

Uh oh!

pytorchmergebot commented Jul 23, 2025

Uh oh!

Uh oh!

		// return a double, otherwise uint64_t will be cast to double
		// when accumulating and the result will be wrong

Add basic torch.hash_tensor op #154149

Add basic torch.hash_tensor op #154149

Uh oh!

Conversation

mikaylagawarecki commented May 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented May 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/154149

⏳ 1 Pending, 1 Unrelated Failure

Uh oh!

github-actions bot commented May 22, 2025

Attention! native_functions.yaml was changed

Uh oh!

Uh oh!

Uh oh!

Uh oh!

albanD Jul 22, 2025

Choose a reason for hiding this comment

Uh oh!

mikaylagawarecki Jul 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

albanD left a comment

Choose a reason for hiding this comment

Uh oh!

mikaylagawarecki commented Jul 23, 2025

Uh oh!

pytorchmergebot commented Jul 23, 2025

Merge started

Uh oh!

Uh oh!

mikaylagawarecki commented May 22, 2025 •

edited

Loading

pytorch-bot bot commented May 22, 2025 •

edited

Loading

mikaylagawarecki Jul 23, 2025 •

edited

Loading