Implement Batching #99

lukasgraf · 2016-05-19T09:52:32Z

Implements batching for

Plone Site Root children
Dexterity Folder
Archetypes Folders
Collections
Search Results

Also changes implementation for GET on folderish contexts so catalog queries are used instead of objectValues().

TODO:

Documentation
Use Plone's default batch size as the default (turns out Plone doesn't have a configurable default batch sizes. It seems to be hard coded to 25 in most places, so I changed the default in plone.restapi.batching to that as well)
Rename items_count to total_items
Refactor tests

Closes #8

lukasgraf · 2016-05-19T14:27:38Z

@tisto this one is ready for review

lukasgraf · 2016-05-20T10:25:58Z

@buchi fixed some flaky tests and rebased onto master - please review

lukasgraf · 2016-05-20T13:50:04Z

@buchi updated to not have HypermediaBatch inherit from Batch - it was only supposed to proxy to a batch.

buchi · 2016-05-21T16:35:31Z

Still need some time for the review. Will do asap.

@lukasgraf PR needs rebase

lukasgraf · 2016-05-21T22:23:22Z

@buchi rebased

tisto · 2016-05-22T11:03:16Z

docs/source/_json/batching.json

+content-type: application/json
+
+{
+  "@id": "http://localhost:55001/plone/folder/@search?sort_on=path", 


@lukasgraf if the request is on "/folder/@search?b_size=5&sort_on=path", why is the @id equals "/folder/@search?sort_on=path" (without the "b_size")? Shouldn't this be either the base request or the full request with all params?

@tisto no, this is supposed to be the canonical ID to the base collection without any batching information (loosely based on https://www.w3.org/community/hydra/wiki/Collection_Design#Pagination / https://www.w3.org/community/hydra/wiki/Pagination#PartialCollection). The ID for the current batch is in batching/@id.

I guess the sort_on parameter doesn't really affect the collection's contents, just their ordering. But if you consider other query string parameters for @search (or possibly other endpoints), the query string params can directly affect what resource will be returned, so I'm preserving them and just stripping batching related params for the canonical resource URL.

Right. My feeling is that we should remove the sort_on param from the canonical top-level id as well. Do we have any hypermedia controls for changing the sort order? Have you looked into that?

@tisto I briefly looked into it, but couldn't find any spec for hypermedia controls to specify sort order that are standarized in any sense. Updated HypermediaBatch.canonical_url to also remove sorting related params.

buchi · 2016-05-22T13:32:12Z

Maybe we should rather call it pagination or paging. Batching is often used in the context of performing bulk operations. Any thoughts?

buchi · 2016-05-22T13:34:49Z

What about including total_items in batching instead of top-level?

buchi · 2016-05-22T13:37:01Z

docs/source/batching.rst

+        "next": "http://.../plone/folder/search?b_size=10&b_start=30"
+      },
+      "total_items": 175,
+      "member": [


has been renamed to items

Ah, well spotted. I will do one more pass for member -> items.

lukasgraf · 2016-05-22T13:38:32Z

The term "pagination" is definitely much more common outside the Plone world. Since we're doing a rather low-level API that exposes many of the implementation details already I decided to stick with Plone's naming.

I wouldn't be opposed to naming it "pagination", but calling it that and still using the b_size and b_start parameters would be a bit odd. (And we should keep using those IMHO, because they cause some implicit magic to happen under the hood that we would otherwise need to reimplement).

lukasgraf · 2016-05-22T13:47:57Z

Re: moving total_items inside the batching info dict: I also considered that. I ended up going with a top-level attribute that was in an example in one of the earlier design documents, but it would make more sense to move it inside the batching info dict. 👍 I'm in favor of moving it, any objections @tisto?

tisto · 2016-05-22T16:22:55Z

@buch @lukasgraf the rationale behind the top-level total_items is that all items inside "batching" are supposed to be hypermedia controls that the client just exposes 1:1 to the user. If we start adding more items there we will end up with tight coupling between client and server because the client needs to know about the exact structure and names of the API.

lukasgraf · 2016-05-22T16:26:20Z

@tisto I see - we'll leave it like that then. The client shouldn't be relying on total_items for page calculations anyway, because it can only ever be approximate (resultset may change during pagination).

tisto · 2016-05-22T16:33:02Z

docs/source/_json/search.json

@@ -19,5 +25,5 @@ content-type: application/json
      "title": "Test Folder"
    }
  ], 
-  "items_count": 2
+  "total_items": 2


@lukasgraf I'm wondering if it would make sense to rename "total_items" to "items_total". This would make "items_total" show up right after "items". Does not make any difference for the consumer app but for a developer browsing the API.

We could, I don't have any strong feelings about either one ;) Regarding developer-friendly representations though, we might want to look into using OrderedDicts at some point. If given an ordered dict, json.dumps() will preserve the order of the keys. Structurally the order of keys in a dict still has no meaning of course, but that might make it easier to produce JSON that is layed out in a human-friendly way.

buchi · 2016-05-23T09:01:45Z

Right now the batching information is included even when the items count doesn't exceed the batch size.
IMO opinion it doesn't make sense to provide batching links to the first or last batch in this case. A js client will unnecessarily have to check if it should display those links. I would propose to return an empty batching object instead or to not include the batching at all. @tisto @lukasgraf opinions?

lukasgraf · 2016-05-23T09:45:50Z

@buchi sounds good to me. Though if we were to still include the batching object, it probably shouldn't be completely empty, it would still need the @id I believe, otherwise JSON-LD processors would drop it alltogether. So maybe just leave it out entirely?

lukasgraf · 2016-05-24T08:25:02Z

Can we get a consensus on renaming total_items to items_total and omitting the batching dict if len(resultset) <= batch_size? I don't feel strongly about either point, but I'd like to prevent this PR from stalling and move forward towards a first alpha.

tisto · 2016-05-24T08:28:37Z

@lukasgraf +1

lukasgraf · 2016-05-24T11:18:13Z

@tisto @buchi updated:

Total item count is now called items_total
Omit batching links object entirely if resultset isn't batched

I also rebased onto master and squashed my commits in a way that some back and forth changes were eliminated.

The LazyCatalogResultSerializer has been removed in #99 without a pressing need to do so. Therefore we re-introduce it here and make it handle the batching logic that was previously left to the SearchHandler.

tisto added the in progress label May 19, 2016

lukasgraf force-pushed the batching branch from eb8b040 to 151f973 Compare May 19, 2016 14:22

lukasgraf force-pushed the batching branch 4 times, most recently from f9d2bcc to 1541bcd Compare May 20, 2016 10:00

lukasgraf force-pushed the batching branch 2 times, most recently from 4a7630a to 7585eb7 Compare May 20, 2016 13:49

lukasgraf mentioned this pull request May 21, 2016

First alpha release #111

Closed

3 tasks

lukasgraf force-pushed the batching branch from 7585eb7 to e43a274 Compare May 21, 2016 22:04

tisto reviewed May 22, 2016
View reviewed changes

buchi reviewed May 22, 2016
View reviewed changes

tisto reviewed May 22, 2016
View reviewed changes

Use catalog queries to serialize folders instead of objectValues().

bf146b0

lukasgraf added 9 commits May 24, 2016 12:27

Sort on getObjPositionInParent when listing children.

be863f4

Implement batching for search results.

d5e84ca

Implement batching for collections.

c7e18b9

Implement batching for Dexterity folders.

e2403f6

Implement batching for Archetypes folders.

eeec6a5

Implement batching for Plone Site Root children.

458adba

Add documentation for batching.

d243591

Batching: Remove sorting related params from canonical URL as well.

a9096d9

Omit batching links if resultset fits into a single batch page.

581671d

lukasgraf force-pushed the batching branch from 7678838 to 581671d Compare May 24, 2016 11:09

tisto merged commit 1423a7d into master May 24, 2016

tisto deleted the batching branch May 24, 2016 11:22

tisto removed the in progress label May 24, 2016

lukasgraf mentioned this pull request Jun 6, 2016

Reintroduce LazyCatalogResultSerializer with batching logic #118

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement Batching #99

Implement Batching #99

lukasgraf commented May 19, 2016 •

edited

lukasgraf commented May 19, 2016

lukasgraf commented May 20, 2016

lukasgraf commented May 20, 2016 •

edited

buchi commented May 21, 2016

lukasgraf commented May 21, 2016

tisto May 22, 2016

lukasgraf May 22, 2016 •

edited

tisto May 22, 2016

lukasgraf May 22, 2016

buchi commented May 22, 2016

buchi commented May 22, 2016

buchi May 22, 2016

lukasgraf May 22, 2016

lukasgraf commented May 22, 2016

lukasgraf commented May 22, 2016

tisto commented May 22, 2016

lukasgraf commented May 22, 2016

tisto May 22, 2016

lukasgraf May 22, 2016

buchi commented May 23, 2016

lukasgraf commented May 23, 2016

lukasgraf commented May 24, 2016

tisto commented May 24, 2016

lukasgraf commented May 24, 2016

Implement Batching #99

Implement Batching #99

Conversation

lukasgraf commented May 19, 2016 • edited

lukasgraf commented May 19, 2016

lukasgraf commented May 20, 2016

lukasgraf commented May 20, 2016 • edited

buchi commented May 21, 2016

lukasgraf commented May 21, 2016

tisto May 22, 2016

Choose a reason for hiding this comment

lukasgraf May 22, 2016 • edited

Choose a reason for hiding this comment

tisto May 22, 2016

Choose a reason for hiding this comment

lukasgraf May 22, 2016

Choose a reason for hiding this comment

buchi commented May 22, 2016

buchi commented May 22, 2016

buchi May 22, 2016

Choose a reason for hiding this comment

lukasgraf May 22, 2016

Choose a reason for hiding this comment

lukasgraf commented May 22, 2016

lukasgraf commented May 22, 2016

tisto commented May 22, 2016

lukasgraf commented May 22, 2016

tisto May 22, 2016

Choose a reason for hiding this comment

lukasgraf May 22, 2016

Choose a reason for hiding this comment

buchi commented May 23, 2016

lukasgraf commented May 23, 2016

lukasgraf commented May 24, 2016

tisto commented May 24, 2016

lukasgraf commented May 24, 2016

lukasgraf commented May 19, 2016 •

edited

lukasgraf commented May 20, 2016 •

edited

lukasgraf May 22, 2016 •

edited