Support _total parameter in search requests #1353

punktilious · 2020-07-23T12:49:27Z

Is your feature request related to a problem? Please describe.
https://www.hl7.org/fhir/search.html#total

The _total parameter is trial-use, but could be useful in cases where the client wants to improve performance by bypassing the resource count calculation (which requires a separate database query).

Per the spec:

none        There is no need to populate the total count; the client will not use it
estimate    A rough estimate of the number of matching resources is sufficient
accurate	   The client requests that the server provide an exact total of the number of matching resources

Team Discussion:
In our implementation we are only going to support 'estimate' as 'accurate'

Describe the solution you'd like
Implement _total as described in the spec.

Describe alternatives you've considered
Ignore _total and rely on existing count query implementation.

Additional context
N/A

The text was updated successfully, but these errors were encountered:

prb112 · 2021-03-01T15:02:43Z

Added the label to bulkdata, it would cut the queries in nearly a half. For a 12K export that would mean about 24 queries at 100ms each is 2.4 seconds of saved time. In very large environments, this adds up.
More specifically this benefits PatientChunkReader

lmsurpre · 2021-03-05T14:54:21Z

We may want to add support for this _total parameter on the history operations as well. For whole-system history, we think it should continue to default to no total, because that is the current behavior and its the only way to make it performant enough for a "changes feed".

tbieste · 2021-03-09T23:12:09Z

There is a little bit of validation with paging and the calculation of the next page link that detects if a request would be beyond the last page. If the _total is not "accurate", then if we skip the total page calculation to save on performance, then we would need to ensure that requests beyond the last page behave in a reasonable matter, since we would no longer know which page is the last without having the total.

If we don't have the total, then we don't know if we are on the last page. Today, if we know we are on the last page, we don't return a "next" link. Without the total, seems like we would always return it if the current page is full (or maybe just always).
If we don't have the total, then we don't know if the caller is requesting a page beyond the last page. As long as the internal query can handle it, I think we would just try to run the query and get nothing back. Unless there's some problem with requesting _page=12345678, when there's only 1 total record.

tbieste · 2021-03-10T22:40:56Z

I dug into it a bit more.

If the total is known, then the _page automatically gets reduced back to the last page if you pick too big of a number. It generates an OperationOutcome for it, but just throws it away.
If we did not do this, then the OFFSET used in the internal query would just return no results back, so that looks good.

tbieste · 2021-03-11T16:43:01Z

In FHIRPersistenceJDBCImpl.search(…) the total is used to determine matched vs included, if the _include/_revcinclude parameters are used. So may need to disallow _total in combination with _include/_revinclude.

tbieste · 2021-03-11T17:38:58Z

I think there would be a way out needing to do the match vs included calculation at all by adding "includeType" to the ResourceDTO since the underlying query does use a SORT_ORDER specifically for that. If it's worth doing, that is.

Signed-off-by: Troy Biesterfeld <tbieste@us.ibm.com>

prb112 · 2021-03-11T20:57:04Z

There is a little bit of validation with paging and the calculation of the next page link that detects if a request would be beyond the last page. If the _total is not "accurate", then if we skip the total page calculation to save on performance, then we would need to ensure that requests beyond the last page behave in a reasonable matter, since we would no longer know which page is the last without having the total.

I think we would always return a next page blindly.
For BulkData I would run a total=accurate enabled search, and switch to _total=none, and control the paging in the bulkdata code.

1. If we don't have the total, then we don't know if we are on the last page. Today, if we know we are on the last page, we don't return a "next" link.  Without the total, seems like we would always return it if the current page is full (or maybe just always).

We could just return the next page, and the next page would be empty.

2. If we don't have the total, then we don't know if the caller is requesting a page beyond the last page. As long as the internal query can handle it, I think we would just try to run the query and get nothing back.  Unless there's some problem with requesting _page=12345678, when there's only 1 total record.

👍🏻 I like this approach.

I dug into it a bit more.

1. If the total is known, then the _page automatically gets reduced back to the last page if you pick too big of a number.  It generates an OperationOutcome for it, but just throws it away.

Interesting, did not know we did that....

2. If we did not do this, then the OFFSET used in the internal query would just return no results back, so that looks good.

👍🏻

In FHIRPersistenceJDBCImpl.search(…) the total is used to determine matched vs included, if the _include/_revcinclude parameters are used. So may need to disallow _total in combination with _include/_revinclude.

👍🏻 I totally agree on the disallowed combination and we'd just update our conformance.md

I think there would be a way out needing to do the match vs included calculation at all by adding "includeType" to the ResourceDTO since the underlying query does use a SORT_ORDER specifically for that. If it's worth doing, that is.
It's intriguing. It may be beyond the scope here?

tbieste · 2021-03-11T21:06:16Z

Since there's an issue for allowing the currently-disallowed combination of _sort + _include/_revinclude, #1915, I think deferring this as well. Then the combination of _total + _include/_revinclude and be considered a future enhancement. I opened issue #2070 to size and prioritize that enhancement.

Signed-off-by: Troy Biesterfeld <tbieste@us.ibm.com>

Issue #1353 - Add support for _total search parameter

punktilious · 2021-03-23T18:43:49Z

Verified:

?_total=nonsense             error, as expected
?_total=                     error, as expected
?_total=none                 no total
?_total=estimated            total calculated and included in bundle
?_total=accurate             total calculated and included in bundle
?_total=none&_include...     error as expected
?_total=accurate&_include... error, but ought to succeed. Can be addressed in #2070

punktilious added performance performance trial-use search labels Jul 23, 2020

prb112 added the enhancement New feature or request label Oct 12, 2020

lmsurpre mentioned this issue Nov 6, 2020

Observation search performing slowly in PostgreSQL on IBM Cloud #1673

Closed

prb112 added the bulk-data label Mar 1, 2021

lmsurpre added the P2 Priority 2 - Should Have label Mar 1, 2021

lmsurpre mentioned this issue Mar 4, 2021

Support resource change tracking with custom operation to fetch list of changes since a known point #1955

Closed

tbieste self-assigned this Mar 9, 2021

tbieste added this to the Sprint 2021-04 milestone Mar 9, 2021

tbieste added a commit that referenced this issue Mar 11, 2021

Issue #1353 - Start implementing _total search parameter

cdbbd81

Signed-off-by: Troy Biesterfeld <tbieste@us.ibm.com>

tbieste added a commit that referenced this issue Mar 11, 2021

Issue #1353 - Add support to _total search parameter

1755ca2

Signed-off-by: Troy Biesterfeld <tbieste@us.ibm.com>

tbieste added a commit that referenced this issue Mar 11, 2021

Issue #1353 - Add support to _total search parameter

9a5cb90

Signed-off-by: Troy Biesterfeld <tbieste@us.ibm.com>

tbieste added a commit that referenced this issue Mar 11, 2021

Issue #1353 - Add support for _total search parameter

27b7230

Signed-off-by: Troy Biesterfeld <tbieste@us.ibm.com>

tbieste mentioned this issue Mar 11, 2021

Issue #1353 - Add support for _total search parameter #2071

Merged

tbieste added a commit that referenced this issue Mar 12, 2021

Issue #1353 - Add support for _total search parameter

28ad7e0

Signed-off-by: Troy Biesterfeld <tbieste@us.ibm.com>

tbieste added a commit that referenced this issue Mar 12, 2021

Issue #1353 - Add support for _total search parameter

06a5284

Signed-off-by: Troy Biesterfeld <tbieste@us.ibm.com>

tbieste added a commit that referenced this issue Mar 12, 2021

Issue #1353 - Add support for _total search parameter

4df476b

Signed-off-by: Troy Biesterfeld <tbieste@us.ibm.com>

tbieste added a commit that referenced this issue Mar 12, 2021

Issue #1353 - Add support for _total search parameter

d547f26

Signed-off-by: Troy Biesterfeld <tbieste@us.ibm.com>

lmsurpre mentioned this issue Mar 15, 2021

Conditional reference while processing transaction bundle #1329

Closed

tbieste added a commit that referenced this issue Mar 15, 2021

Issue #1353 - Updates after code review

57c3f37

Signed-off-by: Troy Biesterfeld <tbieste@us.ibm.com>

tbieste added a commit that referenced this issue Mar 15, 2021

Issue #1353 - Updates after code review

c40b8e1

Signed-off-by: Troy Biesterfeld <tbieste@us.ibm.com>

tbieste added a commit that referenced this issue Mar 16, 2021

Issue #1353 - Updates after code review

a2e349a

Signed-off-by: Troy Biesterfeld <tbieste@us.ibm.com>

tbieste added a commit that referenced this issue Mar 16, 2021

Merge pull request #2071 from IBM/tbieste-issue-1353

0636671

Issue #1353 - Add support for _total search parameter

punktilious closed this as completed Mar 23, 2021

tbieste added the showcase Used to Identify End-of-Sprint Demos label Apr 20, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support _total parameter in search requests #1353

Support _total parameter in search requests #1353

punktilious commented Jul 23, 2020 •

edited by prb112

Loading

prb112 commented Mar 1, 2021

lmsurpre commented Mar 5, 2021

tbieste commented Mar 9, 2021 •

edited

Loading

tbieste commented Mar 10, 2021

tbieste commented Mar 11, 2021

tbieste commented Mar 11, 2021

prb112 commented Mar 11, 2021

tbieste commented Mar 11, 2021 •

edited

Loading

punktilious commented Mar 23, 2021 •

edited

Loading

Support _total parameter in search requests #1353

Support _total parameter in search requests #1353

Comments

punktilious commented Jul 23, 2020 • edited by prb112 Loading

prb112 commented Mar 1, 2021

lmsurpre commented Mar 5, 2021

tbieste commented Mar 9, 2021 • edited Loading

tbieste commented Mar 10, 2021

tbieste commented Mar 11, 2021

tbieste commented Mar 11, 2021

prb112 commented Mar 11, 2021

tbieste commented Mar 11, 2021 • edited Loading

punktilious commented Mar 23, 2021 • edited Loading

punktilious commented Jul 23, 2020 •

edited by prb112

Loading

tbieste commented Mar 9, 2021 •

edited

Loading

tbieste commented Mar 11, 2021 •

edited

Loading

punktilious commented Mar 23, 2021 •

edited

Loading