fix(examples): use pgvector image and fix arm64 startup failures#10
Draft
deangoodmanson wants to merge 1 commit intodevelopfrom
Draft
fix(examples): use pgvector image and fix arm64 startup failures#10deangoodmanson wants to merge 1 commit intodevelopfrom
deangoodmanson wants to merge 1 commit intodevelopfrom
Conversation
- Use pgvector/pgvector:pg17 image so the vector extension is available - Add platform: linux/amd64 for py-std-worker to suppress emulation warning - Remove ivfflat index on document_chunks (vector(3072) exceeds 2000-dim limit) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
3 tasks
3 tasks
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
image: pgvector/pgvector:pg17to the examples postgres service so thevectorextension is available on first startupivfflatindex ondocument_chunks.embedding—vector(3072)(OpenAItext-embedding-3-large) exceeds pgvector's 2000-dimension limit for IVFFlat/HNSW indexes; sequential scan is sufficient for demo datasetsplatform: linux/amd64frompy-std-worker— it was harmful (see Known Gap below)Problem
On arm64 (Apple Silicon), the examples stack failed on startup because the base postgres image does not include the
vectorextension required by the RAG examples (06b, 06c).Known Gap:
kruxia/kruxiaflow-py-stdis not a multi-arch imagekruxia/kruxiaflow-py-std:latestcurrently only publishes alinux/arm64manifest — there is nolinux/amd64variant. The previousplatform: linux/amd64override was added to suppress an emulation warning on Apple Silicon, but it was actively wrong: it forced amd64 emulation on the one platform where a native image already exists, and would fail or emulate on amd64 hosts.The correct fix is to rebuild and publish the image as a proper multi-arch manifest:
This requires the
py/Dockerfileto exist in this repo (planned perpython-sdk-architecture.mdas part of the first-party worker image family). Until the multi-arch image is published:Tracking issue needed: build and publish
kruxia/kruxiaflow-py-stdaslinux/amd64,linux/arm64.Test plan
docker-compose -f docker-compose.yml -f docker-compose.examples.yml upstarts cleanly on arm64vectorextension available inkruxiaflow_examplesdatabase🤖 Generated with Claude Code