Skip to content

Conversation

@antonwolfy
Copy link
Contributor

@antonwolfy antonwolfy commented Mar 5, 2023

The PR removes excess memory copy of shapes and strides to shared memory for elementwise and bitwise dpnp functions.
USM-host allocator is used to transfer strides data from the host to device memory.
The shaped data isn't required to be moved into device memory at all.

  • Have you provided a meaningful PR description?
  • Have you added a test, reproducer or referred to issue with a reproducer?
  • Have you tested your changes locally for CPU and GPU devices?
  • Have you made sure that new changes do not introduce compiler warnings?
  • If this PR is a work in progress, are you filing the PR as a draft?

@antonwolfy antonwolfy self-assigned this Mar 5, 2023
@antonwolfy antonwolfy merged commit cd24184 into IntelPython:master Mar 5, 2023
@antonwolfy antonwolfy deleted the memcpy_in_elem_bitwise branch March 5, 2023 13:19
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants