Skip to content

AMD ROCm™ Software

AMD ROCm software is AMD's Open Source stack for GPU computation.

To learn more about ROCm, check out our Documentation, Examples, and Developer Hub.

If you have questions or need help, reach out to us on GitHub.

Popular repositories Loading

  1. ROCm ROCm Public

    AMD ROCm™ Software - GitHub Home

    Shell 5.1k 416

  2. hip hip Public

    HIP: C++ Heterogeneous-Compute Interface for Portability

    C++ 3.9k 551

  3. MIOpen MIOpen Public

    AMD's Machine Intelligence Library

    Assembly 1.1k 243

  4. tensorflow-upstream tensorflow-upstream Public

    Forked from tensorflow/tensorflow

    TensorFlow ROCm port

    C++ 689 96

  5. HIPIFY HIPIFY Public

    HIPIFY: Convert CUDA to Portable C++ Code

    C++ 563 84

  6. ROCm-docker ROCm-docker Public

    Dockerfiles for the various software layers defined in the ROCm software platform

    Shell 453 69

Repositories

Showing 10 of 310 repositories
  • apex Public Forked from NVIDIA/apex

    A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch

    Python 21 BSD-3-Clause 1,455 13 6 Updated Mar 24, 2025
  • Megatron-LM Public Forked from NVIDIA/Megatron-LM

    Ongoing research training transformer models at scale

    Python 17 2,729 0 13 Updated Mar 24, 2025
  • hipBLASLt Public

    hipBLASLt is a library that provides general matrix-matrix operations with a flexible API and extends functionalities beyond a traditional BLAS library

    Assembly 83 MIT 111 11 92 Updated Mar 24, 2025
  • triton Public Forked from triton-lang/triton

    Development repository for the Triton language and compiler

    Python 114 MIT 1,906 5 54 Updated Mar 24, 2025
  • aiter Public

    AI Tensor Engine for ROCm

    Python 103 MIT 19 8 12 Updated Mar 24, 2025
  • llvm-project Public Forked from llvm/llvm-project

    This is the AMD-maintained fork of the LLVM git repository. This repository accepts pull requests and issues related to AMD fork-specific topics (amd/*). For all other issues/PRs, please submit upstream at https://github.com/llvm/llvm-project.

    LLVM 140 13,242 16 5 Updated Mar 24, 2025
  • tensorflow-upstream Public Forked from tensorflow/tensorflow

    TensorFlow ROCm port

    C++ 689 Apache-2.0 91,078 26 74 Updated Mar 24, 2025
  • composable_kernel Public

    Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators

    C++ 368 165 30 (1 issue needs help) 67 Updated Mar 24, 2025
  • flash-attention Public Forked from Dao-AILab/flash-attention

    Fast and memory-efficient exact attention

    Python 162 BSD-3-Clause 1,568 14 2 Updated Mar 24, 2025
  • onnxruntime Public Forked from microsoft/onnxruntime

    ONNX Runtime: cross-platform, high performance scoring engine for ML models

    C++ 6 MIT 3,175 0 6 Updated Mar 24, 2025