Skip to content

Conversation

@luke-lombardi
Copy link
Contributor

@luke-lombardi luke-lombardi commented Jan 12, 2025

  • This adds a prefetching system to pull data via streaming
  • The main benefit is we avoid grpc serialization and connection overhead
  • The secondary benefit is we can read in larger chunks since the message size is not restricted by the max read-ahead size (enforced by kernel)

Benchmarks below (testing on local machine):

Without prefetching/streaming:
Screenshot 2025-01-12 at 8 58 36 PM

With prefetching/streaming:
Screenshot 2025-01-12 at 8 57 19 PM

Still testing some edge cases, but checksums match so seems to be working

@luke-lombardi luke-lombardi marked this pull request as draft January 12, 2025 18:19
@luke-lombardi luke-lombardi marked this pull request as ready for review January 12, 2025 19:00
@luke-lombardi luke-lombardi changed the title Feat: Allow prefetching of data to leverage GetContentStream in blobfs [WIP] Feat: Allow prefetching of data to leverage GetContentStream in blobfs Jan 13, 2025
@luke-lombardi luke-lombardi merged commit 4d18e2a into master Jan 13, 2025
@luke-lombardi luke-lombardi deleted the ll/prefetch branch January 13, 2025 22:31
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants