Skip to content

Enable test_rref_timeout for Tensorpipe agent #39468

@rohan-varma

Description

@rohan-varma

🚀 Feature

https://github.com/pytorch/pytorch/pull/38590/files is adding support for RRef timeouts, but the necessary error handling is not yet added to Tensorpipe agent. The test test_rref_timeout is disabled for tensorpipe in that PR, so we should add the necessary support to TP and enable this test and verify that RRef timeouts work appropriately.

cc @pietern @mrshenli @pritamdamania87 @zhaojuanmao @satgera @gqchen @aazzolini @rohan-varma @xush6528 @jjlilley @osalpekar @jiayisuse @lw @beauby

Metadata

Metadata

Assignees

No one assigned

    Labels

    module: rpcRelated to RPC, distributed autograd, RRef, and distributed optimizermodule: tensorpipeRelated to Tensorpipe RPC AgenttriagedThis issue has been looked at a team member, and triaged and prioritized into an appropriate module

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions