Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add SPIR-V matmul variants/specialization constants for various workloads and targets #44

Closed
benvanik opened this issue Oct 13, 2019 · 1 comment

Comments

@benvanik
Copy link
Collaborator

Enabling the use of extensions (such as cooperative matrix) and special codepaths for various runtime-detectable capabilities would be a good way of evaluating the flexibility of the compiler. Most of this work is predicated on support for such features in MLIR's SPIR-V dialect (if we want to go that route) and the design of the integration, however we could do some proof of concept work for benchmarking independently.

@benvanik benvanik added this to Ideas in Runtime Optimization via automation Oct 13, 2019
@benvanik
Copy link
Collaborator Author

Was done and is now getting removed.

Runtime Optimization automation moved this from Ideas to Done Mar 19, 2020
stellaraccident pushed a commit that referenced this issue Sep 24, 2023
Also fixes some other odds and ends:

* Switches to larger runners.
* Uses the compiler's native support for memory outputs (vs direct use
of memfd). This removes special casing that would be needed when
building for old glibc versions (the compiler already does the right
thing in these cases).
* Adds a :plugins target to build everything.
* Fixes the CUDA SDK env var to include the "_DIR" suffix.
* Installs the needed parts of the CUDA SDK
* Forks the build_linux_packages.sh that everyone uses and builds binary
plugins. Will extend this later to build Python wheels that
auto-configure.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
No open projects
Development

No branches or pull requests

1 participant