Skip to content

TCPStore: Address already in use in test_distributed #12876

@pietern

Description

@pietern

🐛 Bug

Flaky execution of test_distributed due to port conflicts. This is an intermittent issue.

Can happen either due to the same process interference (the subprocesses not exiting fast enough before the next test is started -- need to double check if this is possible at all), or due to multi process interference, e.g. when the same tests run on the same machine at the same time and tries to bind to the same port.

We can fix by making these tests use the FileStore on a unique path instead.

cc @ezyang @teng-li @ssnl

Metadata

Metadata

Assignees

Labels

oncall: distributedAdd this issue/PR to distributed oncall triage queue

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions