AMD ROCm™ LLMExt

AMD ROCm™ LLMExt (ROCm-LLMExt) is an open-source software toolkit built on the ROCm platform for large language model (LLM) extensions, integrations, and performance enablement on AMD GPUs. The domain brings together training, post-training, inference, and orchestration components to make modern LLM stacks practical and reproducible on AMD hardware.

Training

Large-scale transformer training
Distributed parallelism (data, tensor, pipeline)
Mixed precision and performance tuning
Mixture-of-Experts (MoE) enablement

Post-training and alignment

Reinforcement learning and post-training workflows
Scalable experimentation
Reproducible configurations

Inference and serving

High-throughput decoding and low-latency serving
Optimized attention and inference operators
Lightweight and edge-friendly inference paths

Distributed execution

Multi-node orchestration
Cluster bring-up and scheduling
Batch and online inference pipelines

Reference integrations and projects

ROCm-LLMExt provides reference integrations, build instructions, patches when required, benchmarks, and examples for the following projects:

Verl: reinforcement learning and post-training workflows for LLMs
Ray: distributed execution framework for training, inference, and serving
FlashInfer: optimized inference operators such as attention and decoding kernels
MegaBlocks: high-performance Mixture-of-Experts building blocks
Stanford Megatron-LM: large-scale transformer training using Megatron-style parallelism
Llama.cpp: lightweight and portable LLM inference for servers, desktops, edge devices and HPC environments

Documentation

Refer to the individual component pages for documentation on system requirements, installation instructions and examples.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
flashinfer @ ff0328e		flashinfer @ ff0328e
llama.cpp @ 3003c82		llama.cpp @ 3003c82
megablocks @ 7925a61		megablocks @ 7925a61
ray @ 734bfc1		ray @ 734bfc1
stanford-megatron-lm @ 38c3530		stanford-megatron-lm @ 38c3530
verl @ 9cb89d8		verl @ 9cb89d8
.gitignore		.gitignore
.gitmodules		.gitmodules
.readthedocs.yaml		.readthedocs.yaml
.wordlist.txt		.wordlist.txt
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AMD ROCm™ LLMExt

Training

Post-training and alignment

Inference and serving

Distributed execution

Reference integrations and projects

Documentation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 1

Folders and files

Latest commit

History

Repository files navigation

AMD ROCm™ LLMExt

Training

Post-training and alignment

Inference and serving

Distributed execution

Reference integrations and projects

Documentation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 1

Packages