Skip to content

AMD ROCm™ Software

AMD ROCm software is AMD's Open Source stack for GPU computation.

To learn more about ROCm, check out our Documentation, Examples, and Developer Hub.

If you have questions or need help, reach out to us on GitHub.

Popular repositories Loading

  1. ROCm ROCm Public

    AMD ROCm™ Software - GitHub Home

    Shell 5.1k 415

  2. hip hip Public

    HIP: C++ Heterogeneous-Compute Interface for Portability

    C++ 3.9k 551

  3. MIOpen MIOpen Public

    AMD's Machine Intelligence Library

    Assembly 1.1k 243

  4. tensorflow-upstream tensorflow-upstream Public

    Forked from tensorflow/tensorflow

    TensorFlow ROCm port

    C++ 689 96

  5. HIPIFY HIPIFY Public

    HIPIFY: Convert CUDA to Portable C++ Code

    C++ 563 84

  6. ROCm-docker ROCm-docker Public

    Dockerfiles for the various software layers defined in the ROCm software platform

    Shell 453 69

Repositories

Showing 10 of 310 repositories
  • MIOpen Public

    AMD's Machine Intelligence Library

    Assembly 1,133 243 248 (4 issues need help) 112 Updated Mar 22, 2025
  • llvm-project Public Forked from llvm/llvm-project

    This is the AMD-maintained fork of the LLVM git repository. This repository accepts pull requests and issues related to AMD fork-specific topics (amd/*). For all other issues/PRs, please submit upstream at https://github.com/llvm/llvm-project.

    LLVM 140 13,233 16 5 Updated Mar 22, 2025
  • hipBLASLt Public

    hipBLASLt is a library that provides general matrix-matrix operations with a flexible API and extends functionalities beyond a traditional BLAS library

    Assembly 83 MIT 111 11 93 Updated Mar 22, 2025
  • composable_kernel Public

    Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators

    C++ 366 165 30 (1 issue needs help) 69 Updated Mar 22, 2025
  • rocm-examples Public

    A collection of examples for the ROCm software stack

    C++ 193 MIT 56 4 3 Updated Mar 22, 2025
  • flash-attention Public Forked from Dao-AILab/flash-attention

    Fast and memory-efficient exact attention

    Python 162 BSD-3-Clause 1,562 14 2 Updated Mar 22, 2025
  • ROCgdb Public

    This is ROCgdb, the ROCm source-level debugger for Linux, based on GDB, the GNU source-level debugger.

    C 54 GPL-2.0 11 4 1 Updated Mar 22, 2025
  • aomp Public

    AOMP is an open source Clang/LLVM based compiler with added support for the OpenMP® API on Radeon™ GPUs. Use this repository for releases, issues, documentation, packaging, and examples.

    Fortran 211 Apache-2.0 50 2 47 Updated Mar 22, 2025
  • aiter Public

    AI Tensor Engine for ROCm

    Python 85 MIT 18 9 16 Updated Mar 22, 2025
  • rocAL Public

    The AMD rocAL is designed to efficiently decode and process images and videos from a variety of storage formats and modify them through a processing graph programmable by the user.

    C++ 15 MIT 16 4 3 Updated Mar 22, 2025