You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository has been archived by the owner on Jan 6, 2023. It is now read-only.
Currently torchelastic-0.1.0rc2 (trunk) depends on torch nightly to use EtcdStore instead of the TCPStore. The following two things need to happen when torch 1.5.0 releases:
In requirements.txt change the dependency to torch>1.4.0 to torch>=1.5.0
Edit .circleci/config.yml#install_dep target to remove the manual install of torch nightly
Known workarounds
Manually install torch nightly (1.5.0.dev+). For more details see https://pytorch.org/
Summary:
Pull Request resolved: pytorch/torchx#39
The diff merges Container and Role together to make torchx APIs simler. The diff also modifies consumers of the torchx lib.
Reviewed By: d4l3k, wilson100hong
Differential Revision: D28915998
fbshipit-source-id: e7a4702c1c4039f1caa0823da8a7f8db7278a248
fotstrt
pushed a commit
to eth-easl/elastic
that referenced
this issue
Feb 17, 2022
Summary:
Pull Request resolved: pytorch/torchx#39
The diff merges Container and Role together to make torchx APIs simler. The diff also modifies consumers of the torchx lib.
Reviewed By: d4l3k, wilson100hong
Differential Revision: D28915998
fbshipit-source-id: e7a4702c1c4039f1caa0823da8a7f8db7278a248
Sign up for freeto subscribe to this conversation on GitHub.
Already have an account?
Sign in.
🐛 Bug
Currently torchelastic-0.1.0rc2 (trunk) depends on torch nightly to use
EtcdStore
instead of theTCPStore
. The following two things need to happen when torch 1.5.0 releases:requirements.txt
change the dependency totorch>1.4.0
totorch>=1.5.0
.circleci/config.yml#install_dep
target to remove the manual install of torch nightlyKnown workarounds
torch
nightly (1.5.0.dev+
). For more details see https://pytorch.org/torchelastic
(where~/elastic
is the repo root)Component (check all that applies):
state api
train_step api
train_loop
rendezvous
checkpoint
rollback
metrics
petctl
examples
docker
To Reproduce
N/A
Expected behavior
python setup.py test
should pass all testsEnvironment
conda
,pip
, source,docker
): from sourceAdditional context
See
#37
#34
The text was updated successfully, but these errors were encountered: