Add an experimental service with bulk load APIs. #59

jakedt · 2023-05-04T19:55:54Z

No description provided.

jzelinskie · 2023-05-04T20:06:58Z

authzed/api/v1/experimental_service.proto

+// BulkLoadRelationshipsResponse is returned on successful completion of the
+// bulk load stream, and contains the total number of relationships loaded.
+message BulkLoadRelationshipsResponse {
+  uint64 num_loaded = 1;


Is this guaranteed order? Can I figure out which rels haven't been loaded if I get back a number that is half of my request? I guess not since streaming requests?

Right now, all of the implementations are transactional. Yes they are in order, and you will get back a response with a number of ALL that you have fed the server, or an error with 0. That's up for debate as an implementation detail though. We could auto-chunk into smaller transactions and make this number represent what you're suggesting. We never made a decision on the auto-chunking decision though.

authzed/api/v1/experimental_service.proto

josephschorr · 2023-05-05T17:22:47Z

authzed/api/v1/experimental_service.proto

+// BulkLoadRelationshipsRequest represents one batch of the streaming
+// BulkLoadRelationships API. The maximum size is unlimited, but optimal size
+// should be determined by the calling client.
+message BulkLoadRelationshipsRequest {


The maximum size is unlimited

I think we should still set a limit

How does the client figure out the optimal size?

Is the datastore going to make proper batch sizes for the most efficient writes? If so, maybe it's ideal that this is a multiple of that number.

Anyways, there should be some limit on here to prevent massive request memory usage/DOS.

I think the issue is that the number is going to vary wildly based on:

datastore choice

datastore scaling parameters

compressibility of the data on the wire

how many concurrent writers we're using

how far we are from the receiving servers

This is the reason why I left it as "an exercise for the reader". I don't think there's any way to come up with those numbers except experimentally. Setting any kind of limit will just be limiting our throughput in many cases for no reason.

I think we shouldn't say "unlimited" here then; "depends on the datastore" might be better

authzed/api/v1/experimental_service.proto

josephschorr · 2023-05-05T17:24:07Z

authzed/api/v1/experimental_service.proto

+  // subject.object.object_id, subject.optional_relation)
+  // 
+  // EXPERIMENTAL
+  rpc BulkLoadRelationships(stream BulkLoadRelationshipsRequest)


I think it might be worth a requirement that any experimental API has a referenced Github issue with it, placed here

josephschorr

LGTM

jzelinskie requested changes May 4, 2023

View reviewed changes

jakedt force-pushed the bulk-load-experimental branch from 2bbdca7 to f2c6de6 Compare May 4, 2023 22:05

jakedt mentioned this pull request May 5, 2023

Add an experimental bulk load service in v2. #58

Closed

josephschorr reviewed May 5, 2023

View reviewed changes

Add an experimental service with bulk load APIs.

7f33957

jakedt force-pushed the bulk-load-experimental branch from f2c6de6 to 7f33957 Compare May 5, 2023 18:46

vroldanbet assigned jakedt May 8, 2023

josephschorr approved these changes May 9, 2023

View reviewed changes

jzelinskie approved these changes May 10, 2023

View reviewed changes

jakedt marked this pull request as draft May 10, 2023 18:08

jakedt marked this pull request as ready for review May 10, 2023 22:10

jakedt merged commit af4020f into main May 10, 2023

jakedt deleted the bulk-load-experimental branch May 10, 2023 22:10

github-actions bot locked and limited conversation to collaborators May 10, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add an experimental service with bulk load APIs. #59

Add an experimental service with bulk load APIs. #59

jakedt commented May 4, 2023

jzelinskie May 4, 2023

jakedt May 4, 2023

josephschorr May 5, 2023

jzelinskie May 5, 2023

jakedt May 5, 2023

josephschorr May 5, 2023

josephschorr May 5, 2023

josephschorr left a comment

Add an experimental service with bulk load APIs. #59

Add an experimental service with bulk load APIs. #59

Conversation

jakedt commented May 4, 2023

jzelinskie May 4, 2023

Choose a reason for hiding this comment

jakedt May 4, 2023

Choose a reason for hiding this comment

josephschorr May 5, 2023

Choose a reason for hiding this comment

jzelinskie May 5, 2023

Choose a reason for hiding this comment

jakedt May 5, 2023

Choose a reason for hiding this comment

josephschorr May 5, 2023

Choose a reason for hiding this comment

josephschorr May 5, 2023

Choose a reason for hiding this comment

josephschorr left a comment

Choose a reason for hiding this comment