Commit cc897ec
resolve KeyError: 'PDSH_SSH_ARGS_APPEND' (#5318)
when start job with `deepspeed --hostfile hostfile --master_addr
$MASTER_IP --ssh_port 20023 src/train_bash.py `
get error: KeyError: 'PDSH_SSH_ARGS_APPEND' in
https://github.com/microsoft/DeepSpeed/blob/master/deepspeed/launcher/multinode_runner.py#L77
because PDSH_SSH_ARGS_APPEND not in environment.
---------
Co-authored-by: Logan Adams <114770087+loadams@users.noreply.github.com>1 parent b5e2045 commit cc897ec
1 file changed
+2
-1
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
74 | 74 | | |
75 | 75 | | |
76 | 76 | | |
77 | | - | |
| 77 | + | |
| 78 | + | |
78 | 79 | | |
79 | 80 | | |
80 | 81 | | |
| |||
0 commit comments