Skip to content

Conversation

@sfudally-nvidia
Copy link
Contributor

Add support for DGX Spark/UMA devices on Linux per this note on detecting available memory:
Memory reporting differences with unified memory architecture

Copy link
Member

@ggerganov ggerganov left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can we expect cudaMemGetInfo to be fixed in the future and not need this workaround?

@github-actions github-actions bot added Nvidia GPU Issues specific to Nvidia GPUs ggml changes relating to the ggml tensor library for machine learning labels Nov 19, 2025
Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
@sfudally-nvidia
Copy link
Contributor Author

Can we expect cudaMemGetInfo to be fixed in the future and not need this workaround?

Yes, have discussed with CUDA team but do not have an ETA on when that would be released

@ggerganov ggerganov requested a review from slaren November 20, 2025 06:18
@ggerganov ggerganov merged commit a7784a8 into ggml-org:master Nov 20, 2025
1 check passed
TianHao324 pushed a commit to TianHao324/llama.cpp that referenced this pull request Nov 20, 2025
* DGX Spark: UMA support

* Updates from PR feedback

* More PR feedback cleanup

* Update ggml/src/ggml-cuda/ggml-cuda.cu

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

* Remove trailing whitespace

* Update ggml/src/ggml-cuda/ggml-cuda.cu

---------

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
TianHao324 pushed a commit to TianHao324/llama.cpp that referenced this pull request Nov 21, 2025
* DGX Spark: UMA support

* Updates from PR feedback

* More PR feedback cleanup

* Update ggml/src/ggml-cuda/ggml-cuda.cu

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

* Remove trailing whitespace

* Update ggml/src/ggml-cuda/ggml-cuda.cu

---------

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants