Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Minimal padding in pool allocator #298

Open
awnawab opened this issue Apr 23, 2024 · 0 comments
Open

Minimal padding in pool allocator #298

awnawab opened this issue Apr 23, 2024 · 0 comments
Labels
enhancement New feature or request

Comments

@awnawab
Copy link
Contributor

awnawab commented Apr 23, 2024

Currently, to protect from misaligned addresses on device, every allocation is padded to 8 bytes in the pool allocator. This would mitigate a lot of the device memory bandwidth benefits from running in single precision. A potential fix could be to only pad those allocations that are not multiples of nproma, as these are for all intents and purposes guaranteed to be multiples of 8.

@awnawab awnawab added the enhancement New feature or request label Apr 23, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant