Skip to content

AMD ROCm™ Software

AMD ROCm software is AMD's Open Source stack for GPU computation.

To learn more about ROCm, check out our Documentation, Examples, and Developer Hub.

If you have questions or need help, reach out to us on GitHub.

Popular repositories Loading

  1. ROCm ROCm Public

    AMD ROCm™ Software - GitHub Home

    Shell 5.2k 426

  2. hip hip Public

    HIP: C++ Heterogeneous-Compute Interface for Portability

    C++ 4k 551

  3. MIOpen MIOpen Public

    AMD's Machine Intelligence Library

    Assembly 1.1k 247

  4. tensorflow-upstream tensorflow-upstream Public

    Forked from tensorflow/tensorflow

    TensorFlow ROCm port

    C++ 690 99

  5. HIPIFY HIPIFY Public

    HIPIFY: Convert CUDA to Portable C++ Code

    C++ 571 85

  6. ROCm-docker ROCm-docker Public

    Dockerfiles for the various software layers defined in the ROCm software platform

    Shell 459 69

Repositories

Showing 10 of 313 repositories
  • rocprofiler-systems Public

    ROCm Systems Profiler

    C++ 17 MIT 13 3 11 Updated Apr 18, 2025
  • llvm-project Public Forked from llvm/llvm-project

    This is the AMD-maintained fork of the LLVM git repository. This repository accepts pull requests and issues related to AMD fork-specific topics (amd/*). For all other issues/PRs, please submit upstream at https://github.com/llvm/llvm-project.

    LLVM 143 13,485 17 5 Updated Apr 18, 2025
  • composable_kernel Public

    Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators

    C++ 379 174 35 (1 issue needs help) 77 Updated Apr 18, 2025
  • hipBLASLt Public

    hipBLASLt is a library that provides general matrix-matrix operations with a flexible API and extends functionalities beyond a traditional BLAS library

    Assembly 90 MIT 117 15 86 Updated Apr 18, 2025
  • DeepSpeed Public Forked from deepspeedai/DeepSpeed

    DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.

    Python 6 Apache-2.0 4,507 6 2 Updated Apr 18, 2025
  • amdsmi Public

    AMD SMI

    C++ 60 MIT 34 8 9 Updated Apr 18, 2025
  • aiter Public

    AI Tensor Engine for ROCm

    Python 166 MIT 28 13 18 Updated Apr 18, 2025
  • rocFFT Public

    Next generation FFT implementation for ROCm

    C++ 191 91 2 4 Updated Apr 18, 2025
  • vllm Public Forked from vllm-project/vllm

    A high-throughput and memory-efficient inference and serving engine for LLMs

    Python 74 Apache-2.0 7,018 10 27 Updated Apr 18, 2025
  • rccl Public

    ROCm Communication Collectives Library (RCCL)

    C++ 317 149 11 35 Updated Apr 18, 2025