Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Shared filesystem for multiple nodes - clarification #5802

Open
OgnjenMilicevic opened this issue Aug 25, 2020 · 1 comment
Open

Shared filesystem for multiple nodes - clarification #5802

OgnjenMilicevic opened this issue Aug 25, 2020 · 1 comment

Comments

@OgnjenMilicevic
Copy link

I am reading your tutorial on HPC and I have a question that could be very uneducated. The shared filesystem section talks about the localization strategies for inputs, which is certainly an issue, but the outputs are not mentioned.

Let's say I have several nodes in a cluster and a single shared volume between them, either physical or software one (like Lustre). I am using Slurm backend and any node can end up running any task based on internal Slurm scheduling. Ideally I would want each task to copy the inputs from the shared volume to a local folder, create outputs, and then copy outputs to the shared volume. I know one can output final outputs anywhere, but how can one control what happens to intermediate files? The problem would arise if the subsequent tasks in the workflow are done on different nodes, but is enforcing (one node)/(one wf execution) even possible? Even if it is it beats the point of scheduling resources by availability.

The solution I can see is running Cromwell FROM the shared volume, but then everything would happen there and tiny inputs and outputs would choke the job and possibly cause wear on hardware. Unless I can set a temp directory while the outputs are written?

I am asking because I am not experienced and would like to know if there are solutions I am missing before I end up doing development on my own.
Thanks!

@EugeneEA
Copy link

If some one can comment on this issue or point to the right reference for investigation I would be very gratefull ...
@OgnjenMilicevic did you find answer for your question eventually?
Best, Eugene

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants