Remote processes need to be considered in the OpenAPI spec #35

francbartoli · 2019-06-20T11:06:27Z

I'd like to declare a process as executed remotely, something like:

process:
    allOf:
       - $ref: 'processSummary.yaml'
       - type: object
         required:
           - id
         properties:
            provider:
              type: string
            inputs:
              type: array
              items: 
                 $ref: 'inputDescription.yaml'
            outputs:
              type: array
              items: 
                 $ref: 'outputDescription.yaml'
            links:
              type: array
              items:
                 $ref: 'link.yaml'

provider might be simply local or remote with the name (for instance Google Earth Engine or GeoScope) of the cloud infrastructure

The text was updated successfully, but these errors were encountered:

christophenoel · 2019-06-20T11:52:00Z

Some comments:

From my point of view, the nominal case for a WPS implementation is that your server supports various profiles for the processes: e.g. it may support Java processes, Python processes, BPEL workflow processes, but also Docker contains executed locally, or containers executed on Google Cloud, Amazon, Azure... The remote backend is specific to your WPS implementation, and all my docker containers will be executed on the same specific cloud (for a given implementation). I admit you can imagine having different processes with totally different implementations, but it sounds not the nominal case.
Despite this is process specific (process description) or server specific (capabitilies document), this information (the remote backend / cloud server, etc.) is not an execution mandatory input, but only a document information intended to the user.
--> such information is intended to be included either in a ows:Metadata element, or in the generic ows:additionalProperties element and do not require a modification to the spec.

My two cents...

gfenoy · 2019-06-20T12:05:04Z

Now I think I understand better this issue.

It does not seem to be only related to providing informations of where the service has been run on or can be run but also give the opportunity to the client to choose which host will run the service effectively. So the client can use this initial information provided in the DescribeProcess to decide where to run the service.

I think this is a great addition which should also be investigated for WPS 2.0 version (the previous one) probably as an extension as explained y Benjamin.

christophenoel · 2019-06-20T12:50:55Z

At least, the remote host can be specified in a process input, as any other execution parameter.

francbartoli · 2019-06-20T12:56:03Z

@gfenoy also an additional explanation could be that the client has to be aware of some sort of authentication to run specific processes. For example, in Google Earth Engine if you want to execute a process through their API you have to provide a service account but I would imagine the same sort of mechanism for other providers.
Likely it might end up with an extension in the processSummary

bradh · 2019-06-20T21:34:18Z

@francbartoli What kind of information would the user/consumer (or different kinds of users/consumers, if there are several) need to have to make an sensible choice?

I worry that this is an "unbreak it" option, where if the process can only really work locally (or remotely - one but not the other; or on a specific machine), and we offer the option, then the user needs to guess the right answer. It seems better to just hide the complexity as much as possible.

The authentication part is different - that should be a mandatory parameter if a service needs it. If you are providing a facade, then maybe the facade encapsulates that.

christophenoel · 2019-06-21T07:44:16Z

I think the required information would strongly depends on the implementation: some would require an endpoint (some several), a user and password, or a token. We have also identified (in TB14 NExtGen) that CPU/RAM resources assigned could be driven by the user).

Therefore, the Process Description simply needs to include the required Cloud/Grid/Kubernetes input description specific to the server, and the user will be able to submit this info.

 "inputs" : [
        {
          "id" : "kubernetesEndpoint",
          "title" : "Kubernetes Endpoint URL",
          "formats" : [
            {
              "mimeType" : "text/plain",
              "default" : true
            }
          ],
          "minOccurs" : 1,
          "maxOccurs" : 1,
          "LiteralDataDomain" : {
            "dataType" : "String"
          }
        },
        {
          "id" : "kubernetesToken",
          "title" : "Kubernetes Token",
          "formats" : [
            {
              "mimeType" : "text/plain",
              "default" : true
            }
          ],
          "minOccurs" : 1,
          "maxOccurs" : 1,
          "LiteralDataDomain" : {
            "dataType" : "String"
          }
        },
        {
          "id" : "kubernetesCluster",
          "title" : "Kubernetes Cluster id",
          "formats" : [
            {
              "mimeType" : "text/plain",
              "default" : true
            }
          ],
          "minOccurs" : 1,
          "maxOccurs" : 1,
          "LiteralDataDomain" : {
            "dataType" : "String"
          }
        },

francbartoli · 2019-06-21T07:55:46Z

@spacebel that's interesting, I still believe that qualifying the provider (local/remote) which will execute your process is a benefit

bpross-52n · 2020-08-17T13:29:16Z

Discussion will continue here: #47

bpross-52n added the OGC API Hackathon OGC API Hackathon 2019 label Jun 20, 2019

jerstlouis mentioned this issue Jul 30, 2019

Highly scalable, chainable and flexible processes centered around 'Collections' as inputs & outputs #47

Open

bpross-52n closed this as completed Aug 17, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remote processes need to be considered in the OpenAPI spec #35

Remote processes need to be considered in the OpenAPI spec #35

francbartoli commented Jun 20, 2019 •

edited

christophenoel commented Jun 20, 2019

gfenoy commented Jun 20, 2019

christophenoel commented Jun 20, 2019

francbartoli commented Jun 20, 2019

bradh commented Jun 20, 2019

christophenoel commented Jun 21, 2019

francbartoli commented Jun 21, 2019

bpross-52n commented Aug 17, 2020

Remote processes need to be considered in the OpenAPI spec #35

Remote processes need to be considered in the OpenAPI spec #35

Comments

francbartoli commented Jun 20, 2019 • edited

christophenoel commented Jun 20, 2019

gfenoy commented Jun 20, 2019

christophenoel commented Jun 20, 2019

francbartoli commented Jun 20, 2019

bradh commented Jun 20, 2019

christophenoel commented Jun 21, 2019

francbartoli commented Jun 21, 2019

bpross-52n commented Aug 17, 2020

francbartoli commented Jun 20, 2019 •

edited