Skip to content

Conversation

teng-li
Copy link
Contributor

@teng-li teng-li commented Aug 22, 2018

Added Prefix Store support.

This will make group be backward compatible.

Test is covered too.

tengli@devfair033:~/new_pytorch/pytorch/torch/lib/build/c10d/test$ ./FileStoreTest
Using temporary file: /tmp/testoglRl4
Using temporary file: /tmp/testepZIpB
Test succeeded
tengli@devfair033:~/new_pytorch/pytorch/torch/lib/build/c10d/test$ ./TCPStoreTest
Test succeeded

@teng-li teng-li added the oncall: distributed Add this issue/PR to distributed oncall triage queue label Aug 22, 2018
Copy link
Contributor

@pietern pietern left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think we should approach this differently. Instead of each store itself supporting a prefix, we can add a PrefixStore that wraps an existing store and adds the prefix to the key for every function call. This is the same approach as taken in gloo itself and allows for reuse of the underlying store. For example the TCP store won't start twice on the same address, but if you keep the original one around and create N prefix stores on top of it, they'll all share the same underlying TCP store.

This comment was marked as off-topic.

@teng-li
Copy link
Contributor Author

teng-li commented Aug 22, 2018

@pietern good point, let me add a prefix store for it

@teng-li teng-li changed the title [c10d] Added prefix support to FileStore and TCPStore for group backward compatibility [c10d] [c10d] Added Prefix Store for group backward compatibility Aug 22, 2018
@pietern pietern changed the title [c10d] [c10d] Added Prefix Store for group backward compatibility [c10d] Added Prefix Store for group backward compatibility Aug 22, 2018
@teng-li
Copy link
Contributor Author

teng-li commented Aug 23, 2018

@pytorchbot retest this please

@teng-li teng-li changed the title [c10d] Added Prefix Store for group backward compatibility [c10d] Added PrefixStore, pybind, test for group backward compatibility Aug 23, 2018
@pietern
Copy link
Contributor

pietern commented Aug 23, 2018

@pytorchbot retest this please

1 similar comment
@teng-li
Copy link
Contributor Author

teng-li commented Aug 23, 2018

@pytorchbot retest this please

@teng-li teng-li changed the title [c10d] Added PrefixStore, pybind, test for group backward compatibility [c10d] Added PrefixStore, pybind, test for group backward compatibility Aug 23, 2018
Copy link
Contributor

@facebook-github-bot facebook-github-bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

teng-li has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

@teng-li
Copy link
Contributor Author

teng-li commented Aug 23, 2018

@pietern stamp?

PenghuiCheng pushed a commit to PenghuiCheng/pytorch that referenced this pull request Sep 11, 2018
…orch#10762)

Summary:
Added Prefix Store support.

This will make group be backward compatible.

Test is covered too.
```
tengli@devfair033:~/new_pytorch/pytorch/torch/lib/build/c10d/test$ ./FileStoreTest
Using temporary file: /tmp/testoglRl4
Using temporary file: /tmp/testepZIpB
Test succeeded
tengli@devfair033:~/new_pytorch/pytorch/torch/lib/build/c10d/test$ ./TCPStoreTest
Test succeeded
```
Pull Request resolved: pytorch#10762

Differential Revision: D9484032

Pulled By: teng-li

fbshipit-source-id: 85754af91fe3f5605087c4a2f79ae930a9fd1387
@ezyang ezyang added the merged label Jun 26, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

oncall: distributed Add this issue/PR to distributed oncall triage queue

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants