A high-throughput and memory-efficient inference and serving engine for LLMs
-
Updated
Jun 2, 2024 - Python
A high-throughput and memory-efficient inference and serving engine for LLMs
AWS CloudFormation templates to provision Linux or Windows EC2 instances with GUI running NICE DCV remote display server. Includes option to install GPU drivers
Two-moment AMR radiation hydrodynamics (with self-gravity, particles, and chemistry) on CPUs/GPUs for astrophysics
Open deep learning compiler stack for cpu, gpu and specialized accelerators
A deep learning package for many-body potential energy representation and molecular dynamics
MIVisionX toolkit is a set of comprehensive computer vision and machine intelligence libraries, utilities, and applications bundled into a single toolkit. AMD MIVisionX also delivers a highly optimized open-source implementation of the Khronos OpenVX™ and OpenVX™ Extensions.
hipBLASLt is a library that provides general matrix-matrix operations with a flexible API and extends functionalities beyond a traditional BLAS library
Setup for new Ubuntu or macOS
DLA-Future
Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm
Domain specific library for electronic structure calculations
Add a description, image, and links to the rocm topic page so that developers can more easily learn about it.
To associate your repository with the rocm topic, visit your repo's landing page and select "manage topics."