Bug Description
We're experiencing a consistent CUDA access violation when running VDB with self0collisions enabled and using the graph capturing functionality. You can repro by switching to the VDB integrator in example_cloth.py, enabling self contact resolution, and enabling the graph capture. The crash appears to be during the first attempt to execute the previously captured graph. There's a call to array_fill_device which in turns calls alloc_device, alloc_device_async and finally a crash inside cuda_stream_is_capturing.
System Information
No response