Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

关于qp及通信的相关问题 #5

Closed
submission-550 opened this issue Sep 18, 2018 · 10 comments
Closed

关于qp及通信的相关问题 #5

submission-550 opened this issue Sep 18, 2018 · 10 comments

Comments

@submission-550
Copy link

  1. qps的含义是什么,对客户端和服务器来说内容有区别吗?
    2.get_local_qp_attr函数利用获取的qid返回的是发送方还是接收方的qp_attr?
    3.设备间建立连接的过程是在init2rtr还是在connect中change_qp_states之前?
@wxdwfc
Copy link

wxdwfc commented Sep 22, 2018

Hi,

I think this is related to how to use RDMA. I strongly suggest to read this document http://www.mellanox.com/related-docs/prod_software/RDMA_Aware_Programming_user_manual.pdf at first.

In a short reply:

  1. What is qps ?
  • qps are connections between different nodes.
  1. get_local_attr get local QP's attribute.
  • It is called when a remote QP wants to connect( to this local QP).
    3.The connections are established in change_qp_states().

Thanks.

@submission-550
Copy link
Author

感谢您的回答,我的RDMA网络是RoCE的,在源代码基础上做了调整,RDMA write时poll阶段报错“got bad completion with status: 0xc, vendor syndrome: 0x81, with error transport retry counter exceeded, qp n:1 t:0
”,请问你们有遇到过这一问题或者知道该如何解决吗?

@wxdwfc
Copy link

wxdwfc commented Sep 25, 2018 via email

@wxdwfc
Copy link

wxdwfc commented Sep 25, 2018 via email

@submission-550
Copy link
Author

我看了一下你们是改用了rdma_cm重写了这一模块是吗,我还是在你们原来libib的基础上加了gid,代码能运行到poll这里是不是说明connection已经成功了呢,有做网络的同学建议过我抓包看看数据是否异常,ibststus的输出您指的是什么呢?wc的status是IBV_WC_RETRY_EXC_ERR

@submission-550
Copy link
Author

ibstatus和ib_send_bw的输出情况如下,目测没什么问题:
_20180926091147
_20180926091211

@wxdwfc
Copy link

wxdwfc commented Sep 26, 2018 via email

@submission-550
Copy link
Author

嗯嗯我也发现我昨天看错了,那我直接用你们的新lib好了,谢谢你啦

@submission-550
Copy link
Author

wechatimg637
请问query分发在代码中只体现了按照subject分发,没有object和predicate的分类吗?是与论文中所写不同还是我漏看了哪里?烦请指教。

@realstolz
Copy link
Collaborator

Please open a new issue.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants