#

auto-tuning

Here are 27 public repositories matching this topic...

intel / neural-compressor

SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime

sparsity pruning quantization knowledge-distillation auto-tuning int8 low-precision quantization-aware-training post-training-quantization awq int4 large-language-models gptq smoothquant sparsegpt fp4 mxformat

Updated Apr 30, 2024
Python

zwang4 / awesome-machine-learning-in-compilers

Must read research papers and links to tools and datasets that are related to using machine learning for compilers and systems optimisation

machine-learning compiler parallel-computing parallelism artificial-intelligence operating-systems optimisation auto-tuning parallel-programming parallelisation multi-cores

Updated Apr 18, 2024

oracle / bpftune

bpftune uses BPF to auto-tune Linux systems

linux ebpf bpf auto-tuning

Updated Dec 20, 2023
C

kernel_tuner

KernelTuner / kernel_tuner

Kernel Tuner

python c testing machine-learning cplusplus gpu optimization opencl cuda autotuning software-development opencl-kernels kernel-tuner cuda-kernels gpu-computing auto-tuning

Updated Apr 30, 2024
Python

kernel-ml

sbu-fsl / kernel-ml

Machine Learning Framework for Operating Systems - Brings ML to Linux kernel

machine-learning kernel-module linux-kernel operating-systems auto-tuning mlsys

Updated Dec 13, 2021
C

ROCm / Tensile

Stretching GPU performance for GEMMs and tensor contractions.

python machine-learning amd gpu assembly opencl dnn matrix-multiplication neural-networks gpu-acceleration blas hip gpu-computing tensors tensor-contraction gemm radeon auto-tuning radeon-open-compute

Updated Apr 30, 2024
Python

CNugteren / CLTune

CLTune: An automatic OpenCL & CUDA kernel tuner

opencl cuda tuner auto-tuning

Updated Dec 12, 2022
C++

ederwander / PyAutoTune

Autotune Module for Python "PyAutoTune"

audio python c pyaudio real-time dsp realtime voice autotuning fft pitch auto-tuning autotune auto-tune

Updated Oct 16, 2020
C

SUSE / phoebe

Phoebe

linux machine-learning artificial-intelligence systems self-healing auto-tuning

Updated May 24, 2021
C

tlc-pack / TLCBench

Benchmark scripts for TVM

benchmark deep-learning auto-tuning tvm tuning-logs

Updated Mar 15, 2022
Python

weixingsun / jBProF

ebpf profiler for jvm

profiler jvm latency breakpoint perf flamegraph jni ebpf bpf jvmti auto-tuning

Updated May 5, 2021
C++

ctuning / ck-crowdtuning

Collective Knowledge crowd-tuning extension to let users crowdsource their experiments (using portable Collective Knowledge workflows) such as performance benchmarking, auto tuning and machine learning across diverse platforms with Linux, Windows, MacOS and Android provided by volunteers. Demo of DNN crowd-benchmarking and crowd-tuning:

iot machine-learning optimization collaboration collective-intelligence mobile-phones knowledge-sharing collective-knowledge auto-tuning mobile-devices crowdsource-experiments crowd-tuning crowd-benchmarking open-repository

Updated Jul 10, 2021
Python

cornell-zhang / uptune

A Generic Distributed Auto-Tuning Infrastructure

python distributed-systems cpp heuristics auto-tuning

Updated Jul 29, 2021
Python

NTNU-HPC-Lab / BAT

A GPU benchmark suite for autotuners

benchmarking kernel hpc cuda autotuning bat benchmark-suite auto-tuning

Updated Feb 20, 2024
Cuda

go-playground / backoff

Backoff uses an exponential backoff algorithm to backoff between retries with optional auto-tuning functionality.

retry backoff exponential-backoff auto-tuning

Updated Dec 31, 2017
Go

umayrh / sparktuner

Autotuner for Spark applications

python spark apache-spark tuning tuning-parameters auto-tuning

Updated May 22, 2023
Python

jokopi / GSWITCH

A pattern-based algorithmic auto-tuner for graph processing on GPUs

gpu cuda graph-processing auto-tuning gswitch

Updated Nov 20, 2018
Cuda

arcari-galimberti / margot-aspect

MarGotAspect - An AspectC++ code generator for the mARGOt framework

hpc code-generation aspect-oriented-programming code-injection auto-tuning aspectcpp

Updated Jun 28, 2017
C++

mergian / matog

MATOG: CUDA Array Access Auto-Tuner

cuda auto-tuning

Updated Oct 13, 2018
C

TharinduRusira / looper

machine-learning compiler high-performance-computing code-generation sequence-prediction auto-tuning loop-optimization

Updated Apr 28, 2017
C

Improve this page

Add a description, image, and links to the auto-tuning topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the auto-tuning topic, visit your repo's landing page and select "manage topics."