Skip to content

Introduce Quadtrix benchmark suite with Python and C++ support (#44)#45

Merged
Eamon2009 merged 1 commit into
masterfrom
exp
May 21, 2026
Merged

Introduce Quadtrix benchmark suite with Python and C++ support (#44)#45
Eamon2009 merged 1 commit into
masterfrom
exp

Conversation

@Eamon2009
Copy link
Copy Markdown
Owner

Summary

  • Project Versioning: Sets the starting project version to 0.1.0.

  • Code Shortcuts (Macros): Creates clean shorthand terms for CUDA keywords (like wrapping device into QX_DEVICE) to make writing GPU kernels cleaner.

  • Math & Memory Utilities: Adds fast math helpers for aligning memory, rounding numbers, and calculating power-of-two boundaries quickly.

  • Memory Optimization: Forces a 128-byte memory alignment to ensure the GPU can read data as fast as possible (coalesced memory access).

  • Automatic Error Checking: Introduces safety wrappers (CUDA_CHECK, CUBLAS_CHECK, NCCL_CHECK) that instantly watch for crashes or failures in Nvidia's core hardware and math libraries, making debugging much easier.

## Summary
- Project Versioning: Sets the starting project version to 0.1.0.

- Code Shortcuts (Macros): Creates clean shorthand terms for CUDA
keywords (like wrapping __device__ into QX_DEVICE) to make writing GPU
kernels cleaner.

- Math & Memory Utilities: Adds fast math helpers for aligning memory,
rounding numbers, and calculating power-of-two boundaries quickly.

- Memory Optimization: Forces a 128-byte memory alignment to ensure the
GPU can read data as fast as possible (coalesced memory access).

- Automatic Error Checking: Introduces safety wrappers (CUDA_CHECK,
CUBLAS_CHECK, NCCL_CHECK) that instantly watch for crashes or failures
in Nvidia's core hardware and math libraries, making debugging much
easier.
@Eamon2009 Eamon2009 self-assigned this May 21, 2026
@Eamon2009 Eamon2009 merged commit 57311e8 into master May 21, 2026
6 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant