FabSim3 provided a multi-threading functionality to decrease the total job submission time for a large number of ensemble/replica
however, we the ensemble runs are really high (>30k) due to a high number of SSH connection, the submission process may FAILED
...
raise SSHException("SSH session not active")
paramiko.ssh_exception.SSHException: SSH session not active
there are a number of ways that we can handle this issue, but there will be degradation on total submission time
what do you think @djgroen ? what are your suggestions to tackle this issue?
FabSim3 provided a multi-threading functionality to decrease the total job submission time for a large number of ensemble/replica
however, we the ensemble runs are really high (>30k) due to a high number of SSH connection, the submission process may FAILED
there are a number of ways that we can handle this issue, but there will be degradation on total submission time
what do you think @djgroen ? what are your suggestions to tackle this issue?