feat: Adding support for get, update, delete for Chunks in Vector Stores API #3022

franciscojavierarceo · 2025-08-02T03:22:13Z

What does this PR do?

This PR adds support for for Chunks in the new OpenAI Vector Stores API. In particular, it adds the following APIs:

@webmethod(route="/openai/v1/vector_stores/{vector_store_id}/files/{file_id}/chunks", method="GET")
@webmethod(route="/openai/v1/vector_stores/{vector_store_id}/files/{file_id}/chunks/{chunk_id}", method="GET")
@webmethod(route="/openai/v1/vector_stores/{vector_store_id}/files/{file_id}/chunks/{chunk_id}", method="POST")
@webmethod(route="/openai/v1/vector_stores/{vector_store_id}/files/{file_id}/chunks/{chunk_id}", method="DELETE")

It's worth noting that these APIs aren't actually available from OpenAI but they are consistent with them and are likely what they expose internally (but that's just speculation on my part).

As mentioned in this issue, this is needed for supporting the ingestion of precomputed embeddings, similar to what's available with VectorIO today.

See this example here that is in use at Red Hat: https://github.com/opendatahub-io/rag/tree/main/demos/kfp/docling/pdf-conversion

I enabled the ingestion of precomputed embeddings in #2317, which has been used by a number of our customers via VectorIO.insert(). This would give us feature parity and be consistent with OpenAI's naming conventions.

Other thoughts

I also have a PoC of how this can be exposed in the UI. A screenshot is available below:

Closes #3021

Test Plan

Unit tests added.

Signed-off-by: Francisco Javier Arceo <farceo@redhat.com>

mattf · 2025-08-02T12:16:00Z

discussion of this is happening on #2981

franciscojavierarceo · 2025-09-18T19:03:07Z

@mattf it sounds like we're open to this approach now? I'd need to update it a bit to handle the extras with the openai client and add some tests. If so, let me know and I'll do so.

mattf · 2025-09-20T10:32:29Z

@mattf it sounds like we're open to this approach now? I'd need to update it a bit to handle the extras with the openai client and add some tests. If so, let me know and I'll do so.

we should have api extensions. my concern is about the design of this extension.

there are two use cases here -

(vector-stores) let llama stack be intelligent about how uploaded files are chunked, embedded and queried
(vector-dbs + vector-io) use llama stack as a consistent, portable interface to a vector db, where the user chunks and embeds their files

(2) seems useful when -

llama stack does a poor job at (1)
the user is doing research on rag pipelines

the error-prone risk seems very high when a vector store is used in both cases at the same time, e.g. user mixes type (1) processing with type (2) chunks & embeddings.

blending these two use cases under one endpoint increases the error risk.

instead of merging vector-dbs and vector-io into vector-stores, what if vector-dbs & vector-io get merged and placed at an endpoint that differentiates its use case from the vector-stores case? in the process, the embedding_model/dimension fields on vector-dbs should be removed, and the embedding should be made required on insert.

feat: Adding support for get, update, delete for Vector Stores API

4c0eb47

Signed-off-by: Francisco Javier Arceo <farceo@redhat.com>

franciscojavierarceo requested review from ashwinb, yanxi0830, hardikjshah, raghotham, ehhuang, terrytangyuan, leseb, bbrowning, reluctantfuturist and mattf as code owners August 2, 2025 03:22

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Aug 2, 2025

franciscojavierarceo and others added 2 commits August 1, 2025 23:22

Merge branch 'main' into vector-store-chunks

469413a

fix tests

24865ea

Signed-off-by: Francisco Javier Arceo <farceo@redhat.com>

franciscojavierarceo added 2 commits August 3, 2025 12:55

Merge branch 'main' into vector-store-chunks

4e986e9

Merge branch 'main' into vector-store-chunks

d764e35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: Adding support for get, update, delete for Chunks in Vector Stores API #3022

feat: Adding support for get, update, delete for Chunks in Vector Stores API #3022

Uh oh!

franciscojavierarceo commented Aug 2, 2025 •

edited

Loading

Uh oh!

mattf commented Aug 2, 2025

Uh oh!

franciscojavierarceo commented Sep 18, 2025

Uh oh!

mattf commented Sep 20, 2025

Uh oh!

Uh oh!

feat: Adding support for get, update, delete for Chunks in Vector Stores API #3022

Are you sure you want to change the base?

feat: Adding support for get, update, delete for Chunks in Vector Stores API #3022

Uh oh!

Conversation

franciscojavierarceo commented Aug 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Other thoughts

Test Plan

Uh oh!

mattf commented Aug 2, 2025

Uh oh!

franciscojavierarceo commented Sep 18, 2025

Uh oh!

mattf commented Sep 20, 2025

Uh oh!

Uh oh!

franciscojavierarceo commented Aug 2, 2025 •

edited

Loading