Kernel Tuner
-
Updated
Jun 9, 2024 - Python
Kernel Tuner
Stretching GPU performance for GEMMs and tensor contractions.
SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime
This software package accompanies the paper "A Methodology for Comparing Auto-Tuning Optimization Algorithms" (https://doi.org/10.1016/j.future.2024.05.021), making the guidelines in the methodology easy to apply.
Must read research papers and links to tools and datasets that are related to using machine learning for compilers and systems optimisation
A GPU benchmark suite for autotuners
Autotuner for Spark applications
A self-hosted language learning website
CLTune: An automatic OpenCL & CUDA kernel tuner
Disabling auto-tuning on Windows will limit your internet speeds. This script checks to see if your auto-tuning level is enabled or not on Windows 10 and prompts if you want to enable it to fix your internet download speeds.
Benchmark scripts for TVM
Machine Learning Framework for Operating Systems - Brings ML to Linux kernel
A Generic Distributed Auto-Tuning Infrastructure
Collective Knowledge crowd-tuning extension to let users crowdsource their experiments (using portable Collective Knowledge workflows) such as performance benchmarking, auto tuning and machine learning across diverse platforms with Linux, Windows, MacOS and Android provided by volunteers. Demo of DNN crowd-benchmarking and crowd-tuning:
Phoebe
ebpf profiler for jvm
Four assignments from two LabVIEW Modules. LabVIEW Visual Programming and LabVIEW App Development
Add a description, image, and links to the auto-tuning topic page so that developers can more easily learn about it.
To associate your repository with the auto-tuning topic, visit your repo's landing page and select "manage topics."