New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
File object management #129
Comments
It looks like @andre-merzky and @mturilli does it seem reasonable to build on SAGA here? How would I extract a Presumably, somewhere under the hood, RP translates its URL scheme to SAGA URLs? Is there an accessible function that client software could use to get a URL with a scheme radical.saga would understand? Or does RP register its extra schemes with the underlying saga resolver? |
Yes, I assume so
The
It is not exposed. The URL translation code in RP is complex and acts differently depending on where the URL is used, what component requests the translation, etc, so I doubt it is immediately useful to ScaleMS. |
Specifically, I am trying to figure out the easiest way to extract a saga File object from a path which may be based on one of the RP-specific URIs provided by RP objects. It isn't always obvious to a programmer whether an attribute is going to need extra processing. The RP documentation mentions However, it appears that But what would be the best way to insert the appropriate SAGA access scheme? (I think this amounts to the |
You can always get the task and pilot sandboxes via What operations do you intent to implement (beyond those provided by the staging ops)? |
I don't think there is a need for anything beyond the staging ops. But I don't need to write a wrapper for RP-based file references that includes a bound Pilot, Task, and/or Session if I can easily get a
|
It looks like Pilot stores the various sandbox URLs as RU URL objects internally, but pilot_sandbox specifically is converted to
|
Within the scope of this issue, we should make sure to support a user-provided "label" that can be easily cross-referenced with the local workflow metadata to locate file identifiers in a flexible and user-friendly way. |
Establish an abstract interface for filesystem objects that can be implemented for RP. File references should allow data flow to be defined with minimal coupling to actual data location at the time of expression. File Futures allow reference to file objects that do not yet exist.
A File reference must be easily localized to the contexts of different workflow managers, such as the client environment to the execution environment and back again.
Data localization and path management must be handled automatically by the WorkflowManager instances.
Unnecessary data transfers must be avoidable through optimization code in WorkflowManager.
Relates to #75
scalems.executable
as an abstract operation Director that generates file staging and subprocess primitives.scale-ms/tests/test_rp_exec.py
Line 155 in cfc4364
The text was updated successfully, but these errors were encountered: