Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

PageRank radixsort error on directed graphs. #14

Closed
yzhwang opened this issue Feb 6, 2015 · 3 comments
Closed

PageRank radixsort error on directed graphs. #14

yzhwang opened this issue Feb 6, 2015 · 3 comments
Assignees
Labels
🐛 bug Use to report bugs in the issues or fix bugs in a pull request.

Comments

@yzhwang
Copy link
Member

yzhwang commented Feb 6, 2015

[../../gunrock/app/pr/pr_enactor.cuh, 466] cub::DeviceRadixSort::SortPairsDescending failed (CUDA error 9: invalid configuration argument)
[test_pr.cu, 339] pr Problem Enact Failed (CUDA error 9: invalid configuration argument)

@1duo
Copy link
Member

1duo commented Feb 6, 2015

This issue might not related to cub sort actually. Some invalid reads are reported by memcheck. And the answers are not always correct even for the undirected graph, need to check in details.

@1duo 1duo added the 🐛 bug Use to report bugs in the issues or fix bugs in a pull request. label Feb 6, 2015
@1duo 1duo closed this as completed May 10, 2015
@sashkiani
Copy link

@ydwu Could you fix this? On a different topic, I'm having exactly the same error in my code using CUB.
I just realized that whenever I include thrust/sort.h in my code as well, I get this error. By not including that, everything looks fine and works perfectly.

@1duo
Copy link
Member

1duo commented Jul 8, 2015

Hi @sashkiani, this problem was originally observed running bin/pagerank market directed_graph.mtx, but now I actually cannot reproduce the same issue when I switch back using CUB's sort (for safety I changed it to mgpu::sort in current master branch). Changing pr_enactor.cuh L393 back to:

util::CUBRadixSort<Value, VertexId>(
        false, graph_slice->nodes,
        problem->data_slices[0]->d_rank_curr,
        problem->data_slices[0]->d_node_ids);

Running bin/pagerank does not incur any problem for all datasets we have. I guess it was fixed somehow near commit e7dbca5 but not sure why.

Wired enough though, I did more tests and observe that in gunrock simple interface test if I run BFS and/or BC before calling PageRank, this error will occur, but other two primitives SSSP and CC does not have any impact. However, if I comment out L29 and L30, this error disappear. In my case, I did't include any other library such as thrust/sort.h, and I didn't touch any input of PageRank function call, can you find any connections with your situation?

Also I recall that "CUDA error 9" is possibly because of CUDA launch kernel parameter settings.

neoblizz added a commit that referenced this issue Sep 9, 2022
Reworking the graph_t and graph::build interface.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
🐛 bug Use to report bugs in the issues or fix bugs in a pull request.
Projects
None yet
Development

No branches or pull requests

3 participants