Skip to content

Conversation

@rohany
Copy link
Contributor

@rohany rohany commented Mar 16, 2021

Fixes #422.

This commit ensures that the allocation clearing logic is applied to
the CUDA backend as well. The windowing test caught this because TACO
was automatically parallelizing the loop onto the GPU.

Fixes tensor-compiler#422.

This commit ensures that the allocation clearing logic is applied to
the CUDA backend as well. The windowing test caught this because TACO
was automatically parallelizing the loop onto the GPU.
@rohany rohany force-pushed the winfowing-cuda-bug branch from 432c25d to 557294e Compare March 16, 2021 23:11
@rohany
Copy link
Contributor Author

rohany commented Mar 16, 2021

Seems like one of these tests is failing because a dependency being apt-get installed is not available.

@Infinoid
Copy link
Contributor

Yep, #424 fixes the apt-get issue.

@Infinoid
Copy link
Contributor

I tested this locally. It fixes windowing/slicedOutput.windowing/0 and does not break anything new. It also fixes a more recently introduced failure in windowing.lhsIndexSet.

Thanks!

(There are other failures in the CUDA build right now, but I will report those separately.)

@stephenchouca stephenchouca merged commit c9bd10d into tensor-compiler:master Mar 17, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

windowing test failure in cuda builds

3 participants