Skip to content
This repository has been archived by the owner on Jan 29, 2024. It is now read-only.

ROCm/ROCm-ComputeABI-Doc

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

26 Commits
 
 
 
 

Repository files navigation

WARNING: this information is obsolete

ROCm - AMDGPU Compute Application Binary Interface

Table of Contents

  • Introduction
  • Finalizer, Code Object, Executable and Loader
  • Kernel dispatch
  • Hardware registers setup
  • Initial kernel register state
  • Kernel prolog code
  • Global/Readonly/Kernarg segments
  • Scratch memory swizzling
  • Flat addressing
  • Flat scratch
  • M0 Register
  • Dynamic call stack
  • Memory model
  • Memory model overview
  • Memory operation constraints for global segment
  • Memory operation constraints for group segment
  • Memory fence constraints
  • Instruction set architecture
  • AMD Kernel Code
  • AMD Kernel Code Object amd_kernel_code_t
  • Compute shader program settings 1 amd_compute_pgm_rsrc1_t
  • Compute shader program settings 2 amd_compute_pgm_rsrc2_t
  • AMD Machine Kind amd_machine_kind_t
  • Float Round Mode amd_float_round_mode_t
  • Denorm Mode amd_float_denorm_mode_t
  • PCIe Gen3 Atomic Operations
  • AMD Queue
  • HSA AQL Queue Object hsa_queue_t
  • AMD AQL Queue Object amd_queue_t
  • Queue operations
  • Signals
  • Signals overview
  • Signal kind amd_signal_kind_t
  • Signal object amd_signal_t
  • Signal kernel machine code
  • Debugtrap
  • References