API v2 proposal: remove PinResults.count #86

lidel · 2022-02-08T15:00:39Z

Prompted by ipfs/ipfs-webui#1900 – this is BREAKING CHANGE for v1 API, so marking this as something we can discuss if we ever want to create v2.

PinResults.count is used for two things:

(1) Reading pin count per status (used in ipfs pin remote service ls --stat):

GET /pins?limit=1&status=queued
GET /pins?limit=1&status=pinning
GET /pins?limit=1&status=failed
GET /pins?limit=1&status=pinned

(2) pagination (where it acts as an equivalent of more: true:

If the value in PinResults.count is bigger than the length of PinResults.results, the client can infer there are more results that can be queried.

Problem

Calculating PinResults.count is expensive, especially when running expensive filters like cid, name or status
- ipfs-desktop/webui is checking CID status via GET /pins?cid=Qm1.Qm2...Qm10 which

Proposed change

remove PinResults.count
add PinResults.more (true/false bool)
add /service/pins/healthcheck that allows inexpensive health check
add /service/pins/stats that returns total pin counts for each status
switch ecosystem to new fields

This would be a breaking change that requires v2.x.x of this spec and coordinated effort across the stack:
go-ipfs, js-ipfs, ipfs-webui, ipfs-desktop and Pinata.

Is it really breaking? Needs analysis.

I've been thinking a bit about a way to do this in backward compatible way and we could have PinResults.count as an optional field, and always return 1 so the pagination and health checks in old clients still works.

👉 Before we consider doing this, the main question is how old client in go-ipfs will react to an unexpected PinResults.more – if it is ignored and nothing crashes, then we have a better path forward.

If not, we would break our users, and need to coordinate go-ipfs/ipfs-webui/ipfs-desktop and Pinata to make the change around the same time to limit the damage. TBD is this is acceptable, considering the performanc benfits in the long run.

cc @obo20 @aschmahmann

The text was updated successfully, but these errors were encountered:

aschmahmann · 2022-02-08T15:26:14Z

If not, we would break our users, and need to coordinate go-ipfs/ipfs-webui/ipfs-desktop and Pinata to make the change around the same time to limit the damage. TBD is this is acceptable, considering the performanc benfits in the long run.

Can we not just add support for a v2 (and maybe a versions) endpoint and avoid breaking changes here? IIRC when it was previously discussed that we wouldn't use libp2p and leverage multistream for protocol negotiation, but instead use HTTP the claim was that having version negotiation would be similarly easy.

lidel · 2022-03-30T19:10:05Z

Yes, if we every do this, then we would introduce this breaking change as v2 + versioning scheme.
I've updated the title of this issue + created topic/v2 for changes like this one.

So far this proposal got no feedback from any pinning service – let's park this until there is interest and/or we gather more items under topic/v2

guseggert · 2022-05-29T11:33:55Z

Besides being expensive, looking at the count while paginating has problematic edge cases, e.g. pins being added/deleted while paginating.

If we're going to make a breaking change, we should consider using an opaque pagination token instead of the created timestamp, which makes less assumptions about the capabilities of the underlying DB and can include arbitrary info to help the DB paginate better.

obo20 · 2022-06-16T17:54:46Z

@lidel just now seeing this and definitely supporting this change. Count is super expensive for larger pinsets and can cause a lot of issues at scale.

I'm in favor of moving towards a created timestamp paradigm here. We had to switch our normal pinList endpoint to using this paradigm as the previous method which required counting was becoming extremely expensive for larger users.

obo20 · 2022-06-17T13:30:20Z

@lidel I would also add here that getting rid of count would allow us to raise our rate limits, which is something I know the PL team / ipfs-desktop users have been asking for on this specific endpoint

SgtPooki · 2022-06-27T17:46:55Z

I think we should remove count, and use an opaque token like "nextToken". Gus said it best:

If we're going to make a breaking change, we should consider using an opaque pagination token instead of the created timestamp, which [the pagination token] makes less assumptions about the capabilities of the underlying DB and can include arbitrary info to help the DB paginate better.

obo20 · 2022-06-29T21:41:42Z

We could pretty easily implement things right away if we did time based pagination, as we already switched the rest of our application over to doing this, but I'm guessing opaque token pagination might be a lot bigger of a lift.

@SgtPooki @guseggert could you both elaborate on what you have in mind here? Any example articles you could point to?

SgtPooki · 2022-06-30T21:09:12Z

@obo20 absolutely. First here's a quick summary in my own words:

I think of pagination tokens like nextToken as a sort of ephemeral API key for a search/query. Basically, it's a pointer(see: cursor in cursor-based-pagination) to a certain record. You could map this back to a time-based token if that's how your service is implemented. When you receive a request with a nextToken, you know exactly where to start searching, instead of always searching/filtering through all records for each query.

Those records can be sorted/filtered however you wish, and you can encode any sorting/filtering/pages/users/etc.. within that token. This flexibility is great, but does come with drawbacks that I've personally experienced and complained about plenty during my time at Amazon, but for service owners, it gives you a lot more control over how your resources are consumed no matter how many consumers or objects to consume you have.

I believe @guseggert implemented more than a few APIs at Amazon where we were required to use this pagination strategy when dynamoDB was our backend (it was most of the time), so he can correct me or speak further to the details if you need more info from someone who's got more experience.

Below are some great articles I've found that should help the discussion here for anyone interested. The writeup by slack is a fantastic read that goes into the pros and cons of many approaches.

Apollo GraphQL

Not as good as the others, but informative in a basic sense. Very specific to apollographql

Since numeric offsets within paginated lists can be unreliable, a common improvement is to identify the beginning of a page using some unique identifier that belongs to each element of the list.

If the list represents a set of elements without duplicates, this identifier could simply be the unique ID of each object, allowing additional pages to be requested using the ID of the last object in the list, together with some limit argument. With this approach, the requested cursor ID should not appear in the new page, since it identifies the item just before the beginning of the page.

--- https://www.apollographql.com/docs/react/pagination/cursor-based/

hackernoon article

Great writeup covering unidirectional and bidirectional pagination from the view of an engineer.

For unidirectional pagination, my preferred approach is to use a simple cursor. The important detail here is to make the cursor meaningless.

As far as the client is concerned, it’s just a blob the server returns in the response when there are more results to fetch. The client shouldn’t be able to derive any implementation details from it, and the only thing it can afford to do with this cursor is to send it along in the next request.

--- https://hackernoon.com/guys-were-doing-pagination-wrong-f6c18a91b232

Slack

An extremely thorough writeup of slack's considerations when upgrading their pagination API

Cursor-based pagination works by returning a pointer to a specific item in the dataset. On subsequent requests, the server returns results after the given pointer. This method addresses the drawbacks of using offset pagination, but does so by making certain trade offs:

The cursor must be based on a unique, sequential column (or columns) in the source table.

There is no concept of the total number of pages or results in the set.

The client can’t jump to a specific page.

--- https://slack.engineering/evolving-api-pagination-at-slack/

guseggert · 2022-07-13T16:06:51Z

At a high level, you can encode whatever state you need to fetch the next page in the pagination token (you might want to encrypt it too, so clients can't reverse it). E.g. for a relational database, you can encode a JSON object like {"limit":20,"offset": 100}, for key-value stores that paginate with their own tokens, use that token. Some folks need to add extra data like a timestamp, so that the token will eventually expire.

Concretely, GET /pins would return something like:

{
    "nextToken": "eyJsaW1pdCI6MjAsIm9mZnNldCI6IDEwMH0K",
    "results": [...] 
}

and then the subsequent request would be like GET /pins?nextToken=eyJsaW1pdCI6MjAsIm9mZnNldCI6IDEwMH0K, repeat until you decide to stop or nextPageToken is null.

The downside from a client's perspective is that you can't parallelize queries. That's an upside from a server's perspective, and generally a desirable thing for a multitenant API. But I'm not sure how this would impact existing impls.

Essentially though, I think we should decouple the query params / filters from the pagination mechanism, so that we don't need to make assumptions about the query capabilities of the DB in order to paginate, and the most general way that I know of to do that is with an opaque token.

obo20 · 2022-08-03T19:43:36Z

@SgtPooki @lidel did we want to make official the decisions from ipfs-thing?

lidel · 2022-08-05T16:13:56Z

iirc we were discussing backward-compatible ways of removing the cost of returning exact count value to avoid breaking existing clients.

Removing or changing type of count will break existing clients which use it for pagination (docs). However, if we keep it, but relax the requirement around its value to be something like "equal to results or results+1 (or MAX_INT?) to indicate there are more records" then you no longer need to pay the cost of calculating the exact size.

Next steps here (I have no bandwidth for this, but happy to review PRs):

propose a PR against this repo that updates Pagination-and-filtering section, making it clear pinning services can return approximate count (like results+1) when the cost of calculating real one is too high.
test if https://github.com/ipfs/go-pinning-service-http-client (used by Kubo) will correctly paginate if count value is no longer exact.

lidel added dif/expert Extensive knowledge (implications, ramifications) required effort/weeks Estimated to take multiple weeks need/analysis Needs further analysis before proceeding labels Feb 8, 2022

lidel changed the title ~~BREAKING CHANGE(TBD) Proposal: remove PinResults.count~~ BREAKING CHANGE proposal: remove PinResults.count Feb 8, 2022

lidel changed the title ~~BREAKING CHANGE proposal: remove PinResults.count~~ API v2 proposal: remove PinResults.count Mar 30, 2022

lidel added topic/v2 P3 Low: Not priority right now need/community-input Needs input from the wider community labels Mar 30, 2022

lidel mentioned this issue Jun 23, 2022

Discussion: Does Pinning Services API Spec have limitations that prevent scalability? #98

Open

guseggert mentioned this issue Jun 24, 2022

Document caveats around key-value pinset stores #97

Open

SgtPooki mentioned this issue Jul 14, 2022

2022 IPFS Thing - spec discussion #101

Open

lidel mentioned this issue Jul 21, 2022

chore: switch to stable openapi-generator ipfs/go-pinning-service-http-client#19

Closed

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

API v2 proposal: remove PinResults.count #86

API v2 proposal: remove PinResults.count #86

lidel commented Feb 8, 2022 •

edited

Loading

aschmahmann commented Feb 8, 2022

lidel commented Mar 30, 2022

guseggert commented May 29, 2022

obo20 commented Jun 16, 2022

obo20 commented Jun 17, 2022 •

edited

Loading

SgtPooki commented Jun 27, 2022

obo20 commented Jun 29, 2022

SgtPooki commented Jun 30, 2022

guseggert commented Jul 13, 2022 •

edited

Loading

obo20 commented Aug 3, 2022 •

edited

Loading

lidel commented Aug 5, 2022 •

edited

Loading

API v2 proposal: remove PinResults.count #86

API v2 proposal: remove PinResults.count #86

Comments

lidel commented Feb 8, 2022 • edited Loading

Problem

Proposed change

Is it really breaking? Needs analysis.

aschmahmann commented Feb 8, 2022

lidel commented Mar 30, 2022

guseggert commented May 29, 2022

obo20 commented Jun 16, 2022

obo20 commented Jun 17, 2022 • edited Loading

SgtPooki commented Jun 27, 2022

obo20 commented Jun 29, 2022

SgtPooki commented Jun 30, 2022

Apollo GraphQL

hackernoon article

Slack

guseggert commented Jul 13, 2022 • edited Loading

obo20 commented Aug 3, 2022 • edited Loading

lidel commented Aug 5, 2022 • edited Loading

lidel commented Feb 8, 2022 •

edited

Loading

obo20 commented Jun 17, 2022 •

edited

Loading

guseggert commented Jul 13, 2022 •

edited

Loading

obo20 commented Aug 3, 2022 •

edited

Loading

lidel commented Aug 5, 2022 •

edited

Loading