rocm

Star

Here are 179 public repositories matching this topic...

vllm-project / vllm

Sponsor

Star

A high-throughput and memory-efficient inference and serving engine for LLMs

Updated Jun 18, 2025
Python

apache / tvm

Star

Open deep learning compiler stack for cpu, gpu and specialized accelerators

javascript machine-learning performance deep-learning metal compiler gpu vulkan opencl tensor spirv rocm tvm

Updated Jun 18, 2025
Python

tracel-ai / burn

Star

Burn is a next generation Deep Learning Framework that doesn't compromise on flexibility, efficiency and portability.

rust machine-learning deep-learning metal cross-platform neural-network vulkan cuda wasm pytorch scientific-computing ndarray tensor webgpu rocm autodiff onnx kernel-fusion

Updated Jun 17, 2025
Rust

cupy / cupy

Sponsor

Star

NumPy & SciPy for GPU

python gpu numpy cuda cublas scipy tensor cudnn rocm cupy cusolver nccl curand cusparse nvrtc cutensor nvtx cusparselt

Updated Jun 18, 2025
Python

gpustack / gpustack

Star

Simple, scalable AI model deployment on GPU clusters

Updated Jun 18, 2025
Python

lshqqytiger / stable-diffusion-webui-amdgpu

Sponsor

Star

Stable Diffusion web UI

web ai deep-learning amd torch image-generation hip amdgpu rocm radeon text2image image2image img2img ai-art directml txt2img stable-diffusion

Updated May 26, 2025
Python

deepmodeling / deepmd-kit

Star

A deep learning package for many-body potential energy representation and molecular dynamics

nodejs python c deep-learning cpp tensorflow cuda molecular-dynamics pytorch computational-chemistry lammps materials-science paddle ipi rocm ase jax potential-energy deepmd

Updated Jun 16, 2025
Python

dmlc / nnvm

Star

deep-learning deployment metal optimization opencl cuda computation-graph rocm nnvm tvm

Updated Sep 11, 2018
C++

aphrodite-engine / aphrodite-engine

Star

Large-scale LLM inference engine

machine-learning cuda intel api-rest lora rocm inference-engine tpu inferentia speculative-decoding

Updated Jun 10, 2025
C++

stotko / stdgpu

Star

stdgpu: Efficient STL-like Data Structures on the GPU

cpp gpu modern-cpp openmp cuda stl data-structures gpgpu gpu-acceleration cpp17 stl-containers hip gpu-computing rocm cpp20 stl-like

Updated Apr 16, 2025
C++

ROCm / ROCm-docker

Star

Dockerfiles for the various software layers defined in the ROCm software platform

docker rocm

Updated Jun 13, 2025
Shell

patientx / ComfyUI-Zluda

Star

The most powerful and modular stable diffusion GUI, api and backend with a graph/nodes interface. Now ZLUDA enhanced for better AMD GPU performance.

windows amd cuda rocm stable-diffusion comfyui zluda

Updated Jun 17, 2025
Python

alpaka-group / alpaka

Star

Abstraction Library for Parallel Kernel Acceleration 🦙

cpp hpc gpu openmp cuda header-only cpp17 hip heterogeneous-parallel-programming tbb openacc rocm

Updated Jun 18, 2025
C++

ROCm / rocBLAS

Star

Next generation BLAS implementation for ROCm platform

blas hip rocm

Updated Jun 16, 2025
C++

Main repository for QMCPACK, an open-source production level many-body ab initio Quantum Monte Carlo code for computing the electronic structure of atoms, molecules, and solids with full performance portable GPU support

c-plus-plus hpc gpu mpi cuda high-performance-computing quantum-chemistry rocm quantum-monte-carlo electronic-structure oneapi

Updated Jun 17, 2025
C++

agenium-scale / nsimd

Star

Agenium Scale vectorization library for CPUs and GPUs

hpc neon cuda avx simd avx2 sse2 simd-programming aarch64 avx512 simd-instructions simd-library sse42 rocm cpp20 sve neon128 cpp20-library vectorization-library

Updated Oct 21, 2021
C

ROCm / k8s-device-plugin

Star

Kubernetes (k8s) device plugin to enable registration of AMD GPU to a container cluster

kubernetes k8s rocm kubernetes-device-plugins

Updated Jun 15, 2025
Go

JuliaGPU / AMDGPU.jl

Star

AMD GPU (ROCm) programming in Julia

gpu julia amdgpu rocm gpu-programming

Updated Jun 12, 2025
Julia

devnen / Chatterbox-TTS-Server

Star

Self-host the powerful Chatterbox TTS model. This server offers a user-friendly Web UI, flexible API endpoints (incl. OpenAI compatible), predefined voices, voice cloning, and large audiobook-scale text processing. Runs accelerated on NVIDIA (CUDA), AMD (ROCm), and CPU.

python text-to-speech ai cuda web-ui api-server pytorch tts speech-synthesis rocm chatterbox speech-synthesis-api tts-api voice-cloning fastapi huggingface openai-api audio-generation chatterbox-tts

Updated Jun 13, 2025
Python

ROCm / pytorch

Star

Tensors and Dynamic neural networks in Python with strong GPU acceleration

pytorch rocm

Updated Jun 18, 2025
Python

Improve this page

Add a description, image, and links to the rocm topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the rocm topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

rocm

Here are 179 public repositories matching this topic...

vllm-project / vllm

apache / tvm

tracel-ai / burn

cupy / cupy

gpustack / gpustack

lshqqytiger / stable-diffusion-webui-amdgpu

deepmodeling / deepmd-kit

dmlc / nnvm

aphrodite-engine / aphrodite-engine

stotko / stdgpu

ROCm / ROCm-docker

patientx / ComfyUI-Zluda

alpaka-group / alpaka

ROCm / rocBLAS

QMCPACK / qmcpack

agenium-scale / nsimd

ROCm / k8s-device-plugin

JuliaGPU / AMDGPU.jl

devnen / Chatterbox-TTS-Server

ROCm / pytorch

Improve this page

Add this topic to your repo