Replace tag_search with tag_key and tag_value. #33

tetron · 2018-06-07T20:39:37Z

Spinoff from discussion on #30

We allow key-value "tags" on workflow runs. These are most useful if the client can filter on them. The current description for "tag_search" is too vague. This adds query parameters tag_key and tag_value and describes their usage more precisely.

Also adds "state" query parameter for filtering by state.

Also add filtering by state.

tetron · 2018-06-07T20:40:30Z

ping @geoffjentry @mckinsel @jaeddy @briandoconnor @dglazer

jaeddy

I vote for simplifying the tag_key/tag_value combo to a single filter object parameter.

jaeddy · 2018-06-15T02:47:19Z

openapi/workflow_execution_service.swagger.yaml

+          in: query
+          required: false
+          type: string
+        - name: tag_value


If I'm understanding this right, this filter would look like GET ".../workflows?tag_key=foo&tag_value=bar" — correct? I can see the argument for arbitrarily complex queries, but doesn't seem like the current OpenAPI spec supports that very well. Because we're already defining tags in the POST request to be an object, I think it's reasonable to use a single parameter with the same format here, e.g.:

- name: filter description: |- OPTIONAL Filter workflow runs to only return those where exact key-value pairs are present in the "tags" object specified by the client to track submissions. in: query required: false type: object

Does OpenAPI specify how type: object is encoded when it is part of a query? The main reason I proposed doing it this way is to avoid jamming ugly url-encoded json in the query portion

The default style (form) and "explode" behavior (true) for object parameter serialization in queries would apparently look like this:

Object id = {"role": "admin", "firstName": "Alex"} => /users?role=admin&firstName=Alex

You can specify more fine-grained and/or complex types for the parameter with schema (example from this page:

parameters: - in: query name: filter # Wrap 'schema' into 'content.<media-type>' content: application/json: # <---- media type indicates how to serialize / deserialize the parameter content schema: type: object properties: type: type: string color: type: string

... but I think that only works if we define specific keys for the filter. (?)

Not sure if that answers your question, or avoids encoding issues. I can play around with some examples after today's call to see how it looks.

Oh, I see. I think you're proposing that any query parameter that isn't one of page_size, page_token or state would be considered a filter on tags. That seems quite reasonable.

An alternative to URL-encoding a JSON object and including it as a query parameter is to put the object in the request body (and use POST instead of GET for the HTTP verb). The advantage is that if the object becomes large the URL will not run up against the maximum URL length.

FWIW we ran into the URL length in Cromwell, which led to us having both a GET and a POST for the equivalent behavior (why not just POST? Because people normally see it as a pain). So this is definitely a potential issue.

jaeddy · 2018-06-15T03:02:08Z

openapi/workflow_execution_service.swagger.yaml

+          in: query
+          required: false
+          type: string
+        - name: state


Can the state parameter description just be replaced with:

- $ref: '#/definitions/State'

...?

Probably, since it only makes sense to accept a string in the state enum.

jaeddy · 2018-06-15T03:04:22Z

openapi/workflow_execution_service.swagger.yaml

+          A key-value map of arbitrary metadata which may be used by
+          the client to organize and manage workflow runs.  Tags must
+          not be considered part of the workflow run input.  Clients may
+          use the "tag_key" and "tag_value" query parameters to filter


... use the "tag_key" and "tag_value" query parameters to filter ...

to

... use the key-value query parameters to filter ...

(see above comment re: filter parameter)

tetron · 2018-06-25T18:52:41Z

Just to surface this, I think @jaeddy was proposing that any URL query parameter that isn't one of page_size, page_token or state would be considered a filter on tags. I don't know how to express that with swagger but otherwise I don't have a problem with in principle, what do you think @geoffjentry @mckinsel ?

geoffjentry · 2018-06-26T20:20:03Z

@tetron I'm not sure if I think that'd be more or less confusing to a user, but have no strong feelings.

briandoconnor · 2018-07-02T14:35:00Z

@mckinsel is this a requirement for HCA?

@jaeddy is this needed for the testbed?

What about other drivers

briandoconnor · 2018-07-02T14:36:06Z

What about @geoffjentry ? He said if it's on the spec they will implement...

geoffjentry · 2018-07-02T16:41:10Z

@briandoconnor I don't have a strong opinion on the topic and we'll likely need to do roughly the same amount of work either way

briandoconnor · 2018-07-09T17:24:52Z

@jaeddy will resolve the conflicts after the other PRs get merged

jaeddy

OpenAPI 2.0 doesn't support object types for query parameters. My suggestion of mirroring the "tags" map from the workflow run request with a single "filter" parameter" won't work for now. We could investigate @brucehoff's suggestion of a "POST /workflows/_search" type endpoint as a workaround, but I think it's OK to merge the current PR and move forward.

david4096 · 2018-07-09T18:19:39Z

I might suggest that filtering on this service is a nice-to-have, but unless there is a specific use case that requires it I would suggest leaving this as an optional upgrade a WES implementing service could provide. WES is not a system of record for provenance and even if the number of workflows is in the thousands, retrieving the entirety of the index of running workflows will be in the kilobytes, making client side filtering a breeze.

jaeddy · 2018-07-09T18:30:31Z

Thanks, @david4096. I think we were hoping to get @mckinsel's input on whether this would be required for HCA (not sure about other Driver Projects at this point). I agree that paging and limits should work fine for most current use cases.

Originally, this PR was to clarify the "tag_search" parameter in the WES spec — maybe the better answer for now is to remove it?

dglazer · 2018-07-10T20:24:37Z

If none of our driver projects need search now, I agree that removing it is the right short-term answer.

(And if we do decide to keep it in, I'd like to confirm that whatever syntax we agree on meets the needs of Cromwell's existing search API users -- otherwise we're pretty much guaranteeing we'll have to rev it soon.)

geoffjentry · 2018-07-10T20:26:56Z

@dglazer if it helps, I'm fairly confident (read: I've been starting to argue for w/ an increasing amount of vigor) that Cromwell will need to change its search API in the foreseeable future. My $0.02 is to find something off the shelf which makes sense instead of going down the otherwise inevitable path of making a bespoke query language (the irony of that last statement is not lost on me)

dglazer · 2018-07-10T20:34:55Z

It does help, in that it makes me feel even more strongly that since WES and Cromwell are both inventing new search APIs, we should only invent it once. And since it we'd probably like to take more time to get it right, leaving it out entirely from WES 1.0 would be ideal.

(Unless of course an existing driver project needs it -- then we can understand their needs, do something minimal, and expect it to change later.)

david4096 · 2018-07-11T04:49:57Z

Added this issue to remove tag_search: #66

mckinsel · 2018-07-12T03:13:41Z

I believe the HCA interest in filtering and querying is mostly driven by Job Manager UI, which presumably is relying some intersection of the filtering capabilities of Cromwell and dsub. @dglazer, you may have more detail about that than anybody.

It doesn't sound like there's confidence that that intersection is something we'd want to write into an API standard at this point, so let's not. I assume we could have a situation where Cromwell implements WES 1.0 and a couple additional routes for JMUI, and more filtering eventually is included in WES when it's a little more fully baked.

geoffjentry · 2018-07-12T03:29:20Z

@mckinsel yeah i'd rather not codify JMUI needs as A Thing yet as i can already see it's going in a bad direction and For Now we can handle it w/ Cromwell. I do agree that there's a There There but we can discuss it after Basel (again, ideally w/ OTS solutions)

dglazer · 2018-07-12T12:41:11Z

Thanks @mckinsel for confirming that you only depend on filtering as implemented by JMUI. GIven that, I agree with you and @geoffjentry that API standardization is premature.

In the future, I agree with Jeff that the Right Thing is to define a WES API that meets JMUI (and other) needs directly, and have that be the one way that Cromwell supports filtering.

geoffjentry · 2018-07-12T14:35:58Z

To make a slightly less confusing way of phrasing what I stated yesterday - a lot of the changes we've added to the cromwell search API in the recent past have been specifically to support JMUI but also are highlighting why I think we need to start from scratch. So I think that JMUI makes for a good use case on a redesign, but not the current JMUI/Cromwell interaction.

pgrosu · 2018-07-12T17:42:05Z

Who is JMUI, and what are the search projects being worked on? Is there a place where the specifications for such use cases are defined in order to better understand the deliverables?

Also was there a call yesterday I was not aware of? If there are meeting notes for that call that would be very helpful, in order to catch up on what was discussed.

Thanks,
Paul

geoffjentry · 2018-07-12T18:34:59Z

@pgrosu JMUI is the Job Manager UI from Data Biosphere. It was originally created by and for HCA (and thus something a driver project relies on).

There was no call yesterday, at least not one I was on.

pgrosu · 2018-07-12T19:17:04Z

Thanks Jeff (@geoffjentry) for the clarification - appreciate it,
Paul

Replace tag_search with tag_key and tag_value.

05ccb1d

Also add filtering by state.

rishidev mentioned this pull request Jun 11, 2018

Setup key-value filtering of workflow list #38

Closed

jaeddy requested changes Jun 15, 2018

View reviewed changes

denis-yuen mentioned this pull request Jun 15, 2018

TRS /tools filters dockstore/dockstore#1486

Closed

briandoconnor requested a review from mckinsel July 9, 2018 17:22

jaeddy approved these changes Jul 9, 2018

View reviewed changes

geoffjentry mentioned this pull request Jul 25, 2018

Checkpoint update to wes2cromwell broadinstitute/cromwell#3932

Merged

tetron closed this Jul 31, 2018

jaeddy deleted the filter-tags branch October 4, 2018 23:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Replace tag_search with tag_key and tag_value. #33

Replace tag_search with tag_key and tag_value. #33

tetron commented Jun 7, 2018

tetron commented Jun 7, 2018

jaeddy left a comment

jaeddy Jun 15, 2018

tetron Jun 18, 2018

jaeddy Jun 18, 2018 •

edited

tetron Jun 18, 2018

brucehoff Jul 2, 2018

geoffjentry Jul 2, 2018

jaeddy Jun 15, 2018

tetron Jun 18, 2018

jaeddy Jun 15, 2018

tetron commented Jun 25, 2018

geoffjentry commented Jun 26, 2018

briandoconnor commented Jul 2, 2018

briandoconnor commented Jul 2, 2018

geoffjentry commented Jul 2, 2018

briandoconnor commented Jul 9, 2018

jaeddy left a comment

david4096 commented Jul 9, 2018

jaeddy commented Jul 9, 2018

dglazer commented Jul 10, 2018

geoffjentry commented Jul 10, 2018

dglazer commented Jul 10, 2018

david4096 commented Jul 11, 2018

mckinsel commented Jul 12, 2018

geoffjentry commented Jul 12, 2018

dglazer commented Jul 12, 2018

geoffjentry commented Jul 12, 2018

pgrosu commented Jul 12, 2018

geoffjentry commented Jul 12, 2018

pgrosu commented Jul 12, 2018

Replace tag_search with tag_key and tag_value. #33

Replace tag_search with tag_key and tag_value. #33

Conversation

tetron commented Jun 7, 2018

tetron commented Jun 7, 2018

jaeddy left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jaeddy Jun 18, 2018 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

tetron commented Jun 25, 2018

geoffjentry commented Jun 26, 2018

briandoconnor commented Jul 2, 2018

briandoconnor commented Jul 2, 2018

geoffjentry commented Jul 2, 2018

briandoconnor commented Jul 9, 2018

jaeddy left a comment

Choose a reason for hiding this comment

david4096 commented Jul 9, 2018

jaeddy commented Jul 9, 2018

dglazer commented Jul 10, 2018

geoffjentry commented Jul 10, 2018

dglazer commented Jul 10, 2018

david4096 commented Jul 11, 2018

mckinsel commented Jul 12, 2018

geoffjentry commented Jul 12, 2018

dglazer commented Jul 12, 2018

geoffjentry commented Jul 12, 2018

pgrosu commented Jul 12, 2018

geoffjentry commented Jul 12, 2018

pgrosu commented Jul 12, 2018

jaeddy Jun 18, 2018 •

edited