Skip to content

Assertion `0 != r_mkey->len' failed in spml_yoda.c:1113:mca_spml_yoda_get #367

@bertwesarg

Description

@bertwesarg

I have now a reproducer which fails for me with 1.8.4 and dev-791-g03c2adf but works with 1.8.2. SHMMAX is:

$ cat /proc/sys/kernel/shmmax
268435456

config.log and ompi_info --all is attached for both 1.8.2 and dev-791-g03c2adf.

$ oshcc -W -Wall -o shmem_hello_world shmem_hello_world.c
$ oshrun -np 2 -x SHMEM_SYMMETRIC_HEAP_SIZE=$((32*1024*1024)) ./shmem_hello_world
Hello from PE 2 of 2
Hello from PE 1 of 2
shmem_hello_world: ../../../../../oshmem/mca/spml/yoda/spml_yoda.c:1113: mca_spml_yoda_get: Assertion `0 != r_mkey->len' failed.
[tudtug:25168] *** Process received signal ***
[tudtug:25168] Signal: Aborted (6)
[tudtug:25168] Signal code:  (-6)
shmem_hello_world: ../../../../../oshmem/mca/spml/yoda/spml_yoda.c:1113: mca_spml_yoda_get: Assertion `0 != r_mkey->len' failed.
[tudtug:25169] *** Process received signal ***
[tudtug:25169] Signal: Aborted (6)
[tudtug:25169] Signal code:  (-6)

Backtrace for PE 0 is:

#0  0x00007ffff749f0d5 in __GI_raise (sig=<optimized out>)
    at ../nptl/sysdeps/unix/sysv/linux/raise.c:64
#1  0x00007ffff74a283b in __GI_abort () at abort.c:91
#2  0x00007ffff7497d9e in __assert_fail_base (fmt=<optimized out>,
    assertion=0x7fffd38c2658 "0 != r_mkey->len",
    file=0x7fffd38c21f8 "../../../../../oshmem/mca/spml/yoda/spml_yoda.c",
    line=<optimized out>, function=<optimized out>) at assert.c:94
#3  0x00007ffff7497e42 in __GI___assert_fail (
    assertion=0x7fffd38c2658 "0 != r_mkey->len",
    file=0x7fffd38c21f8 "../../../../../oshmem/mca/spml/yoda/spml_yoda.c",
    line=1113, function=0x7fffd38c26e0 "mca_spml_yoda_get") at assert.c:103
#4  0x00007fffd38c0d61 in mca_spml_yoda_get (src_addr=0xff004100, size=8,
    dst_addr=0x7fffffffc9f0, src=1)
    at ../../../../../oshmem/mca/spml/yoda/spml_yoda.c:1113
#5  0x00007fffd36b5f19 in _algorithm_f_recursive_doubling (group=0xc3fb90,
    target=0xff0020d8, source=0xff0000d0, nlong=4, pSync=0xff004100)
    at ../../../../../oshmem/mca/scoll/basic/scoll_basic_collect.c:472
#6  0x00007fffd36b4bfd in mca_scoll_basic_collect (group=0xc3fb90,
    target=0xff0020d8, source=0xff0000d0, nlong=4, pSync=0xff004100,
    nlong_type=true, alg=2)
    at ../../../../../oshmem/mca/scoll/basic/scoll_basic_collect.c:93
#7  0x00007ffff7a76b2c in _shmem_collect (target=0xff0020d8,
    source=0xff0000d0, nbytes=4, PE_start=0, logPE_stride=0, PE_size=2,
    pSync=0xff004100, array_type=true) at pshmem_collect.c:87
#8  0x00007ffff7a77236 in pshmem_fcollect32 (target=0xff0020d8,
    source=0xff0000d0, nelems=1, PE_start=0, logPE_stride=0, PE_size=2,
    pSync=0xff004100) at pshmem_collect.c:115
#9  0x00000000004009d8 in main ()

Metadata

Metadata

Assignees

Labels

Type

No type

Projects

No projects

Relationships

None yet

Development

No branches or pull requests

Issue actions