Skip to content

[BUGFIX] Excessive number of copy kernel calls#795

Merged
oleksandr-pavlyk merged 1 commit intomasterfrom
avoid-extra-copy-call
Mar 21, 2022
Merged

[BUGFIX] Excessive number of copy kernel calls#795
oleksandr-pavlyk merged 1 commit intomasterfrom
avoid-extra-copy-call

Conversation

@oleksandr-pavlyk
Copy link
Contributor

@oleksandr-pavlyk oleksandr-pavlyk commented Mar 21, 2022

Using ze_tracer from pti-gpu/ repo reported two kernel calls for every setitem call.

Missing return was the culprit

@oleksandr-pavlyk oleksandr-pavlyk changed the title onetrace revealed twice as many copy kernel calls as expected [BUGFIX] Excessive number of copy kernel calls Mar 21, 2022
@github-actions
Copy link

@coveralls
Copy link
Collaborator

Coverage Status

Coverage decreased (-0.01%) to 81.87% when pulling 60e06dd on avoid-extra-copy-call into 8e06a6c on master.

@oleksandr-pavlyk oleksandr-pavlyk merged commit c274840 into master Mar 21, 2022
@oleksandr-pavlyk oleksandr-pavlyk deleted the avoid-extra-copy-call branch March 21, 2022 19:39
@github-actions
Copy link

Deleted rendered PR docs from intelpython.github.com/dpctl, latest should be updated shortly. 🤞

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants