Skip to content

Conversation

@hjelmn
Copy link
Member

@hjelmn hjelmn commented Mar 10, 2015

This commit should resolve an issue seen with CUDA-aware support. The
problem came in with BTL 3.0. Before 3.0 the size of the copy was
stored in the incoming segment's des_remote_count field. This field
does not exist in BTL 3.0 so I stored the value in the
des_segment_count field. This caused problems with the cuda support
code. To fix the issue the endpoint pointer is now stored in the in
fragment's endpoint pointer which free's up the segment's des_cbdata
pointer for storing the transfer size.

Signed-off-by: Nathan Hjelm hjelmn@lanl.gov

This commit should resolve an issue seen with CUDA-aware support. The
problem came in with BTL 3.0. Before 3.0 the size of the copy was
stored in the incoming segment's des_remote_count field. This field
does not exist in BTL 3.0 so I stored the value in the
des_segment_count field. This caused problems with the cuda support
code. To fix the issue the endpoint pointer is now stored in the in
fragment's endpoint pointer which free's up the segment's des_cbdata
pointer for storing the transfer size.

Signed-off-by: Nathan Hjelm <hjelmn@lanl.gov>
@hjelmn
Copy link
Member Author

hjelmn commented Mar 10, 2015

@rolfv This may be a better way to fix async-copy support. Please try it out and let me know what you think.

@hjelmn hjelmn added the bug label Mar 10, 2015
@hjelmn hjelmn added this to the Open MPI 1.9.0 milestone Mar 10, 2015
@hjelmn hjelmn self-assigned this Mar 10, 2015
@mellanox-github
Copy link

Refer to this link for build results (access rights to CI server needed):
http://bgate.mellanox.com/jenkins/job/gh-ompi-master-pr/340/
Test PASSed.

hjelmn added a commit that referenced this pull request Mar 11, 2015
btl/openib: cuda: fix CUDA-aware support with async copy
@hjelmn hjelmn merged commit ce6caab into open-mpi:master Mar 11, 2015
jsquyres added a commit to jsquyres/ompi that referenced this pull request Nov 10, 2015
mtl/ofi: Merge inline progress function from master
@hjelmn hjelmn deleted the cuda_async branch May 23, 2016 17:43
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants