Skip to content

Conversation

bddppq
Copy link
Contributor

@bddppq bddppq commented Nov 10, 2018

  • Switch to __ffsll in Embedding which is the correct intrinsic here.
  • Fix WARP_BALLOT and ffsll in LookupTable as well.

Fix comes from @iotamudelta

#12899

* Switch to __ffsll in Embedding which is the correct intrinsic here.
* Fix WARP_BALLOT and ffsll in LookupTable as well.
@bddppq bddppq added the module: rocm AMD GPU support for Pytorch label Nov 10, 2018
Copy link
Contributor

@facebook-github-bot facebook-github-bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@bddppq has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

zdevito pushed a commit to zdevito/ATen that referenced this pull request Nov 10, 2018
Summary:
* Switch to __ffsll in Embedding which is the correct intrinsic here.
* Fix WARP_BALLOT and ffsll in LookupTable as well.

Fix comes from iotamudelta

Pull Request resolved: pytorch/pytorch#13804

Differential Revision: D13016184

Pulled By: bddppq

fbshipit-source-id: 2287a78ee9e592630336a073ad1e55a90e1f946d
@bddppq bddppq deleted the ffs_fix branch November 11, 2018 04:56
@ezyang ezyang added the merged label Jun 25, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
module: rocm AMD GPU support for Pytorch
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants