Implement metal kernel for GPUs on Mac #44

casper-hansen · 2023-09-11T09:10:59Z

MIT released their TinyChatEngine which includes kernels for every kind of platform. To ensure wide availability, we should integrate the kernels for Metal which are kernels for running quantized on MacOS.

Requirements:

Run the Metal kernel through Python (maybe run on the fly?)
Automatically switch to the Metal kernel if torch.backends.mps.is_available(), otherwise use CUDA.
Figure out: How to support either GEMV/GEMM format with Metal

Kernel:
https://github.com/mit-han-lab/TinyChatEngine/blob/main/kernels/metal/kernel/op.metal#L10

The text was updated successfully, but these errors were encountered:

casper-hansen added the help wanted Extra attention is needed label Sep 11, 2023

casper-hansen mentioned this issue Sep 11, 2023

📌 AutoAWQ Roadmap #32

Closed

30 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement metal kernel for GPUs on Mac #44

Implement metal kernel for GPUs on Mac #44

casper-hansen commented Sep 11, 2023 •

edited

Loading

Implement metal kernel for GPUs on Mac #44

Implement metal kernel for GPUs on Mac #44

Comments

casper-hansen commented Sep 11, 2023 • edited Loading

casper-hansen commented Sep 11, 2023 •

edited

Loading