You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Potential bug with allocating zero size View. Debug output shared by a user.
Program received signal SIGSEGV, Segmentation fault.
0x0000000008bc8f37 in
Kokkos::Experimental::Impl::SharedAllocationRecord<Kokkos::CudaUVMSpace,
void>::SharedAllocationRecord (this=0x26210120, arg_space=...,
arg_label=..., arg_alloc_size=0,
arg_dealloc=0x7bb565a <Kokkos::Experimental::Impl::(anonymous
namespace)::deallocate<Kokkos::CudaUVMSpace,
Kokkos::Experimental::Impl::ViewValueFunctor<Kokkos::Cuda,
Kokkos::pair<int, int>, false>
(Kokkos::Experimental::Impl::SharedAllocationRecord<void, void>*)>) at
/home/mbetten/Trilinos/Trilinos/packages/kokkos/core/src/Cuda/Kokkos_CudaSpace.cpp:497
497 RecordBase::m_alloc_ptr->m_record = this ;
(cuda-gdb) where
Added unit test for zero length allocation and CUDA-UVM, test passes.
Will add precondition-checks at potential failure points in this stack to attempt to catch this kind of error before a segfault.
hcedwar
added
Enhancement
Improve existing capability; will potentially require voting
and removed
Bug
Broken / incorrect code; it could be Kokkos' responsibility, or others’ (e.g., Trilinos)
labels
Mar 18, 2016
Issue was downstream bug in Tpetra. trilinos/Trilinos#227
Adding a few precondition checks to shared allocation tracker that were motivated by this issue.
Potential bug with allocating zero size View. Debug output shared by a user.
Program received signal SIGSEGV, Segmentation fault.
0x0000000008bc8f37 in
Kokkos::Experimental::Impl::SharedAllocationRecord<Kokkos::CudaUVMSpace,
void>::SharedAllocationRecord (this=0x26210120, arg_space=...,
arg_label=..., arg_alloc_size=0,
arg_dealloc=0x7bb565a <Kokkos::Experimental::Impl::(anonymous
namespace)::deallocate<Kokkos::CudaUVMSpace,
Kokkos::Experimental::Impl::ViewValueFunctor<Kokkos::Cuda,
Kokkos::pair<int, int>, false>
The text was updated successfully, but these errors were encountered: