Do not buffer entire body in Http Cache #4076

BoD · 2022-05-03T18:37:18Z

netlify · 2022-05-03T18:37:23Z

✅ Deploy Preview for apollo-android-docs canceled.

Name	Link
🔨 Latest commit	`127256a`
🔍 Latest deploy log	https://app.netlify.com/sites/apollo-android-docs/deploys/627a87465712b60009facd39

apollo-http-cache/src/main/kotlin/com/apollographql/apollo3/cache/http/ApolloHttpCache.kt

BoD · 2022-05-03T18:41:14Z

apollo-http-cache/src/main/kotlin/com/apollographql/apollo3/cache/http/DiskLruHttpCache.kt

+      }
+
+      if (read == -1L) {
+        // We've read fully, commit the cache edit


Not 100% sure this is needed.

The DeferJvmTest tests still pass for me without the closeAndCommitCache() can you add one that fails?

Did we ever find the reason behind this?

Sorry I thought your previous comment was about the test passing without committing because it wasn't checking the presence of data in the cache - but now I'm wondering if you meant something else 😅

Let me try again

Ah got it (thought you were commenting the inside of closeAndCommitCache).

This path can be reached when an incomplete response is fully read: the JSON reader pulls more bytes, but there aren't any, so originalSource.read returns -1. In that case this bit of code commits early, but:

it's actually not useful, because close will still be called after a parse exception occurs (which commits at that point)

not 100% sure if we actually want to commit in that case? On the one hand, this would be consistent with the current behavior (currently, the data is cached before it's parsed). On the other hand, if we know for sure that -1 here means an incomplete response, it may be nice to avoid caching that?

WDYT?

Oh gosh this is hard. There are 2 error cases:

truncated Json with a 200 response as you mentioned above. I think this is very unlikely to happen but still something we need to handle

valid Json with a 200 response containing GraphQL errors.

Ideally we don't want to HTTP cache those... But there's no way to know this at the HTTP level. For now I'd suggest we cache everything that is read fully. The only way I think right now is something along these lines:

override fun close() { if (!hasClosedAndCommitted) { try { sink.close() if (originalSource.read(Buffer(), 1) == -1L) { // The caller has read everything cacheEditor.commit() } else { cacheEditor.abort() } } catch (e: Exception) { // Silently ignore cache write errors } finally { hasClosedAndCommitted = true } originalSource.close() } }

This should cache "complete 200" responses. We can revisit later to add support for discarding GraphQL errors

Also:

Fell into this rabbit hole and turns out it's not too hard to implement. I gave it a try there: #4087. This is wildly untested. Feel free to include in this PR or I'll rebase and open a follow up PR later

Yes this looks good! 🙏 Merging into this one and I'll add 2 tests for it.

hmm 2 tests failed - I'll have a look before merging :P

martinbonnin · 2022-05-04T10:02:33Z

tests/defer/src/jvmTest/kotlin/test/DeferJvmTest.kt

+    var lastEmitTime = currentTimeMillis()
+    apolloClient.query(WithFragmentSpreadsQuery()).toFlow().collect {
+      // Allow a 10% margin for inaccuracies
+      assertTrue(currentTimeMillis() - lastEmitTime >= delay / 1.1)


It'd be nice to have a lower level MockServer API where we enqueue the chunks one after the other and read them as we go. That'd remove the potentiality of flaky timing issues.

Oh yeah 😊 I'll have a look!

apollo-http-cache/src/main/kotlin/com/apollographql/apollo3/cache/http/DiskLruHttpCache.kt

...lo-mockserver/src/commonMain/kotlin/com/apollographql/apollo3/mockserver/MockServerCommon.kt

martinbonnin · 2022-05-09T13:00:18Z

...ockserver/src/commonMain/kotlin/com/apollographql/apollo3/mockserver/MockServerExtensions.kt

@@ -47,25 +47,46 @@ fun createMultipartMixedChunkedResponse(
    responseDelayMillis: Long = 0,
    chunksDelayMillis: Long = 0,
    boundary: String = "-",
+    waitForMoreChunks: Boolean = false,


That's a lot of constructor arguments. I think MockResponse is due for a little refactoring.

We could make it take a Source as body:

class MockResponse( val statusCode: Int = 200, val body: Source = Buffer(), val headers: Map<String, String> = emptyMap(), )

Then delay and chunks "just" become specific Source implementations:

class DelayedSource( val delayMillis: Long, val wrappedSource:Source ): Source{ private var firstRead = true override fun close() { wrappedSource.close() } override fun read(sink: Buffer, byteCount: Long): Long { if (firstRead) { // TODO: see if we can implement this more efficiently with timeouts usleep(delayMillis*1000) } return wrappedSource.read(sink, byteCount) } override fun timeout(): Timeout { TODO("Not yet implemented") } } class ChunkedSource(): Source { // ... }

It's a cool idea, but the ChunkedSource + a way to enqueue the chunks may be tricky to implement: we'd need a way to make reads blocking. Did it in StreamingNSURLSessionHttpEngine with pthread_cond but we'd need something for Java (easy) and JS (not sure if easy!).

Could we do some kind of non-blocking/polling I/O with okio? have read() return zero in the source implementation while waiting and something >0 when data is available again?

So I think 0 would make callers call again immediately, in a loop, but we could sleep for a few ms to not hog the cpu? Not ideal but maybe enough for tests?

Right. okio.Source is not the good abstraction then. But we could write our own Source abstraction with coroutines support to give a chance to JS land to "sleep"/"delay"

What about Flow<ByteString>?

class MockResponse( val statusCode: Int = 200, val body: Flow<ByteString> = emptyFlow(), val headers: Map<String, String> = emptyMap(), )

Example usage:

fun MockServer.enqueue(string: String, delayMs: Long = 0) { enqueue(MockResponse( statusCode = statusCode, body = flow { delay(delayMillis) emit(body.encodeUtf8()) }, headers = mapOf("Content-Length" to string.length.toString()), )) }

val chunks = Channel<String>(Channel.UNLIMITED) delay(200) chunks.send("chunk 1") delay(200) chunks.send("chunk 2") chunks.close() val body: Flow<ByteString> = chunks.receiveAsFlow().map { it.encodeUtf8() }

If you're OK with it I can remove the MockServer changes from this PR and open a dedicated one 😅

Sure thing!

apollo-mpp-utils/src/commonMain/kotlin/com/apollographql/apollo3/mpp/utils.kt

apollo-mpp-utils/src/appleMain/kotlin/com/apollographql/apollo3/mpp/utils.kt

… Buffer once

Co-authored-by: BoD <BoD@JRAF.org>

martinbonnin

👍

BoD commented May 3, 2022

View reviewed changes

BoD marked this pull request as ready for review May 3, 2022 18:41

BoD requested review from sav007 and martinbonnin as code owners May 3, 2022 18:41

martinbonnin reviewed May 4, 2022

View reviewed changes

apollo-http-cache/src/main/kotlin/com/apollographql/apollo3/cache/http/DiskLruHttpCache.kt Outdated Show resolved Hide resolved

martinbonnin reviewed May 4, 2022

View reviewed changes

apollo-http-cache/src/main/kotlin/com/apollographql/apollo3/cache/http/DiskLruHttpCache.kt Outdated Show resolved Hide resolved

martinbonnin reviewed May 4, 2022

View reviewed changes

apollo-http-cache/src/main/kotlin/com/apollographql/apollo3/cache/http/DiskLruHttpCache.kt Outdated Show resolved Hide resolved

martinbonnin reviewed May 4, 2022

View reviewed changes

apollo-http-cache/src/main/kotlin/com/apollographql/apollo3/cache/http/DiskLruHttpCache.kt Outdated Show resolved Hide resolved

BoD commented May 9, 2022

View reviewed changes

...lo-mockserver/src/commonMain/kotlin/com/apollographql/apollo3/mockserver/MockServerCommon.kt Outdated Show resolved Hide resolved

martinbonnin reviewed May 9, 2022

View reviewed changes

apollo-mpp-utils/src/commonMain/kotlin/com/apollographql/apollo3/mpp/utils.kt Show resolved Hide resolved

martinbonnin reviewed May 9, 2022

View reviewed changes

apollo-mpp-utils/src/appleMain/kotlin/com/apollographql/apollo3/mpp/utils.kt Show resolved Hide resolved

BoD and others added 15 commits May 10, 2022 14:48

Do not buffer entire body in Http Cache

34dc488

Revert unneeded backward compatibility on ApolloHttpCache interface

541206a

Rename WriteToCacheSource to ProxySource, fix the doc and instantiate…

2822401

… Buffer once

Silently ignore cache write errors

ed3d8de

Make test also check for the cached value

1402d80

Update API dump

f116f0f

Add a way to enqueue chunks on the go

7c95590

Don't read on the main thread on native, as it can block

7463809

Add log to mpp-utils (commented)

0d99cdc

Update JS MockServer

e0f7090

Keep MockResponse.chunks as a List for tests

3ea0147

Also use enqueueChunk in payloadsAreReceivedIncrementallyWithHttpCache

6f6fb33

Mark currentTimeFormatted and currentThreadName as internal

c605487

Do not cache HTTP responses with GraphQL errors (#4087)

7e31cb6

Co-authored-by: BoD <BoD@JRAF.org>

Revert changes to MockServer. Will open a specific PR for that.

ef244eb

BoD force-pushed the dont-buffer-whole-body-in-http-cache branch from 590902a to ef244eb Compare May 10, 2022 13:13

martinbonnin approved these changes May 10, 2022

View reviewed changes

Avoid test flakiness. The test now only checks that there is a delay.

127256a

BoD merged commit 0f50117 into main May 11, 2022

BoD deleted the dont-buffer-whole-body-in-http-cache branch May 11, 2022 08:29

BoD mentioned this pull request May 12, 2022

@defer: don't block in CachingHttpInterceptor #3981

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Do not buffer entire body in Http Cache #4076

Do not buffer entire body in Http Cache #4076

BoD commented May 3, 2022

netlify bot commented May 3, 2022 •

edited

BoD May 3, 2022

martinbonnin May 4, 2022

martinbonnin May 9, 2022

BoD May 9, 2022

martinbonnin May 9, 2022

BoD May 9, 2022

martinbonnin May 9, 2022

martinbonnin May 9, 2022

BoD May 9, 2022

BoD May 9, 2022

martinbonnin May 4, 2022

BoD May 4, 2022

martinbonnin May 9, 2022

BoD May 9, 2022

martinbonnin May 9, 2022

BoD May 9, 2022

martinbonnin May 9, 2022

BoD May 10, 2022

BoD May 10, 2022

martinbonnin May 10, 2022

martinbonnin left a comment

Do not buffer entire body in Http Cache #4076

Do not buffer entire body in Http Cache #4076

Conversation

BoD commented May 3, 2022

netlify bot commented May 3, 2022 • edited

✅ Deploy Preview for apollo-android-docs canceled.

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

martinbonnin left a comment

Choose a reason for hiding this comment

netlify bot commented May 3, 2022 •

edited