Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

GPU runtime optimisation #896

Open
wants to merge 9 commits into
base: main
Choose a base branch
from
Open

Conversation

antoniupop
Copy link
Contributor

This PR reduces the number of copy operations by transferring ownership of data buffers to the runtime system, and by merging the outputs of SDFG compute graphs in place.
It also fixes some bugs and minor issues.

@cla-bot cla-bot bot added the cla-signed label Jun 17, 2024
@antoniupop
Copy link
Contributor Author

@slab-ci compiler-cpu-build-distributed

@antoniupop
Copy link
Contributor Author

@slab-ci compiler-cpu-build

@antoniupop antoniupop force-pushed the antoniu/gpu_optimization_wip branch 3 times, most recently from 93575ee to 13dc8d8 Compare June 24, 2024 13:31
@antoniupop antoniupop force-pushed the antoniu/gpu_optimization_wip branch 5 times, most recently from 1a6375b to 6b6cf2a Compare June 28, 2024 09:03
@antoniupop antoniupop marked this pull request as ready for review June 28, 2024 09:04
@antoniupop antoniupop force-pushed the antoniu/gpu_optimization_wip branch from 6b6cf2a to c3a776a Compare June 29, 2024 07:00
Copy link
Member

@BourgerieQuentin BourgerieQuentin left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Minor comment, and the one about the search space can be addressed in another PR if needed. So open an issue if you to merge without fixing that.

…ns on multiple keys in KS/BS operations for GPU.
…s to the GPU backend when already lowering through the SDFG dialect. Otherwise this forces GPU offload even for very fine granularity operations, bypassing SDFG and Batching.
…Add environment variable CONCRETELANG_TIMING_ENABLED as prerequisite to activation of timing logs.
…he SDFG runtime on Put operations. This allows to avoid an extra copy of the data for use in asynchronous operations.
… by default and use runtime environment variable to activate as required.
@antoniupop antoniupop force-pushed the antoniu/gpu_optimization_wip branch from 023db39 to 2c244df Compare July 5, 2024 09:46
@antoniupop antoniupop force-pushed the antoniu/gpu_optimization_wip branch from 2c244df to 3346127 Compare July 5, 2024 09:54
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants