Guard gpu_rtx hillshade and viewshed against unbounded GPU allocations (#1308) by brendancol · Pull Request #1310 · xarray-contrib/xarray-spatial

brendancol · 2026-04-29T13:57:29Z

Summary

hillshade_rtx and viewshed_gpu allocated cupy device buffers sized by raster shape with no memory check. A 30000x30000 raster asked for 90-108 GB of VRAM (100-120 B/pixel) before cupy surfaced an opaque allocator error.
Adds gpu_rtx/_memory.py with _available_gpu_memory_bytes() and _check_gpu_memory(func_name, h, w), same shape as cost_distance: missing memory guard on CuPy backend #1262 / sky_view_factor(): numpy and cupy backends have no memory guard #1299. 120 B/pixel budget covers both entry points (worst case is viewshed_gpu). Wired in at the top of hillshade_rtx and viewshed_gpu.
9 new tests in xrspatial/tests/test_gpu_rtx_memory.py: 5 helper-unit tests that don't need RTX, 4 end-to-end through hillshade() / viewshed() gated on has_rtx().

Closes #1308.

Test plan

pytest xrspatial/tests/test_gpu_rtx_memory.py -v (9 passed)
pytest xrspatial/tests/test_hillshade.py xrspatial/tests/test_viewshed.py (81 passed, no regressions)
Helper unit-tested with patched _available_gpu_memory_bytes
MemoryError raised end-to-end from hillshade(shadows=True) and viewshed() when free VRAM is patched to a tiny value
Small raster passes the guard at real free-VRAM levels

#1308) hillshade_rtx and viewshed_gpu were allocating cupy device buffers sized by raster shape with no upfront memory check. A 30000x30000 raster asked for 90-108 GB of VRAM (around 100-120 B/pixel) before cupy gave up with an internal allocator error that didn't name the input shape, so users had no clean way to know what to turn down. Adds gpu_rtx/_memory.py with _available_gpu_memory_bytes() and _check_gpu_memory(func_name, h, w), same shape as cost_distance (#1262) and sky_view_factor (#1299). Single 120 B/pixel budget covers both entry points (worst case is viewshed_gpu). Wired in after the cupy.ndarray type check and before create_triangulation. Guard skips silently when cupy.cuda.runtime.memGetInfo() isn't available. Tests in xrspatial/tests/test_gpu_rtx_memory.py: 5 helper-unit tests that don't need RTX, 4 end-to-end tests through hillshade()/viewshed() gated on has_rtx(). The existing 81 hillshade/viewshed tests still pass.

brendancol · 2026-04-29T16:34:26Z

@copilot resolve the merge conflicts in this pull request

brendancol · 2026-04-29T16:35:29Z

@copilot resolve the merge conflicts in this pull request

Co-authored-by: brendancol <433221+brendancol@users.noreply.github.com>

Copilot · 2026-04-29T16:35:52Z

@copilot resolve the merge conflicts in this pull request

Merged main into this branch in 05193ce — no conflicts, all files auto-resolved cleanly.

Copilot · 2026-04-29T16:37:48Z

@copilot resolve the merge conflicts in this pull request

Already resolved in 05193ce — main was merged into this branch cleanly with no conflicts.

…1381) create_triangulation() computed scale = max(H, W) / cupy.amax(raster.data) without checking that the max was positive and finite. An all-zero raster gave scale = inf and an all-NaN raster gave scale = nan, both of which propagated into vertex z-coordinates and produced garbage geometry that the OptiX raytracer would silently render. Add a guard that raises ValueError when maxH is non-finite or non-positive, before any hash or device-buffer work. This is the deferred Cat 3 finding from the gpu_rtx security audit (#1308 / PR #1310). Tests cover the all-zero, all-NaN, all-negative, and single-positive-pixel cases plus the error-message format.

github-actions Bot added the performance PR touches performance-sensitive code label Apr 29, 2026

Record gpu_rtx security audit (HIGH Cat 1 fixed via #1310)

bb2ab5d

Copilot started work on behalf of brendancol April 29, 2026 16:34 View session

Merge remote-tracking branch 'origin/main' into issue-1308

05193ce

Co-authored-by: brendancol <433221+brendancol@users.noreply.github.com>

Copilot finished work on behalf of brendancol April 29, 2026 16:37

Copilot started work on behalf of brendancol April 29, 2026 16:37 View session

Copilot finished work on behalf of brendancol April 29, 2026 16:39

brendancol merged commit 4862ec8 into main Apr 29, 2026
1 of 2 checks passed

This was referenced Apr 30, 2026

mesh_utils: divide-by-zero on all-zero raster in create_triangulation #1378

Closed

Reject all-zero rasters in mesh_utils.create_triangulation (#1378) #1381

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Guard gpu_rtx hillshade and viewshed against unbounded GPU allocations (#1308)#1310

Guard gpu_rtx hillshade and viewshed against unbounded GPU allocations (#1308)#1310
brendancol merged 3 commits intomainfrom
issue-1308

brendancol commented Apr 29, 2026

Uh oh!

brendancol commented Apr 29, 2026

Uh oh!

brendancol commented Apr 29, 2026

Uh oh!

Copilot AI commented Apr 29, 2026

Uh oh!

Copilot AI commented Apr 29, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

brendancol commented Apr 29, 2026

Summary

Test plan

Uh oh!

brendancol commented Apr 29, 2026

Uh oh!

brendancol commented Apr 29, 2026

Uh oh!

Copilot AI commented Apr 29, 2026

Uh oh!

Copilot AI commented Apr 29, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants