Auto embedding #100

richard-epsilla · 2023-12-14T05:34:00Z

Automatically embed the attributes

When start DB, need provide embedding service base url:

./build/vectordb -e http://localhost:8889

Define embedding indices:

{
    "name": "MyTable124",
    "returnTableId": true,
    "fields": [
        {
        "name": "ID2",
        "dataType": "FLOAT"
        },
        {
        "name": "Document",
        "dataType": "STRING"
        },
        {
        "name": "Document2",
        "dataType": "STRING"
        },
        {
        "name": "ID1",
        "dataType": "BIGINT"
        },
        {
            "name": "Embedding",
            "dataType": "VECTOR_FLOAT",
            "dimensions": 4,
            "metricType": "COSINE"
        }
    ],
    "indices": [
        {
            "name": "MyIndex",
            "field": "Document",
            "model": "BAAI/bge-base-en-v1.5"
        },
        {
            "name": "MyIndex2",
            "field": "Document2",
            "model": "BAAI/bge-small-en-v1.5"
        }
    ]
}

When insert the attributes, the indices will be automatically embedded
Supported embedding models:
BAAI/bge-small-en
BAAI/bge-small-en-v1.5
BAAI/bge-small-zh-v1.5
BAAI/bge-base-en
BAAI/bge-base-en-v1.5
sentence-transformers/all-MiniLM-L6-v2
openai/text-embedding-ada-002

When using openai embedding for a table, need pass X-OpenAI-API-Key in request header to insert, query, and loaddb API

When query, can query by index. Can provide either a query vector, or a query string. If provide query string, it will automatically embed using the model defined in the index

POST http://localhost:8888/api/<DB>/data/query
Option 1: with query
{
    "table": "VideoData",
    "query": "What's the best way to code xxx?",
    "queryIndex": "Embedding",
    "limit": 5
}
If there is just 1 index, the queryIndex can be omitted:
{
    "table": "VideoData",
    "query": "What's the best way to code xxx?",
    "limit": 5
}

Option 2: with query vector
{
    "table": "VideoData",
    "queryVector": [ 0.06929776072502136,
                0.49731335043907166,
                0.6196035146713257,
                0.6032981276512146 ...],
    "queryIndex": "Embedding",
    "limit": 5
}

richard-epsilla requested review from TopKeyboard, eric-epsilla and ricki-epsilla December 14, 2023 05:34

richard-epsilla marked this pull request as draft December 14, 2023 05:34

eric-epsilla approved these changes Dec 14, 2023

View reviewed changes

richard-epsilla and others added 17 commits December 15, 2023 15:19

Embedding service skeleton

b18c4d9

Fix API

0d365a0

Inject embedding service to meta and db server

8b60afc

Catalog management with index

1a5612d

Inject embedding service all the way down to table segment

64ae9b1

Partial insert handle

e2381ea

Fix embedding service to use constructor passing

715eb68

Basic embedding support

421257b

Add retry logic for embedding

73c1da1

Hide index fields implicitly

432892b

Create build-embedding.yml

0e00d50

Fix oatpp-curl install

308d25e

try link oatpp with oatpp-curl

83e36db

Query with content

5a7525d

Set default embedding model

61b4a80

Pass openai embedding during query and db load

ef9f3f3

add missing file and merge

7305990

richard-epsilla force-pushed the auto-embedding branch from c7b4150 to 7305990 Compare December 15, 2023 20:20

Fix pybinding interface

36c95bc

richard-epsilla marked this pull request as ready for review December 15, 2023 21:21

richard-epsilla added 2 commits December 15, 2023 16:53

Change back version

78dfcaf

Remove the comment

ff599a4

richard-epsilla merged commit 989060e into main Dec 16, 2023
1 check passed

richard-epsilla deleted the auto-embedding branch February 12, 2024 14:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Auto embedding #100

Auto embedding #100

richard-epsilla commented Dec 14, 2023 •

edited

Loading

Auto embedding #100

Auto embedding #100

Conversation

richard-epsilla commented Dec 14, 2023 • edited Loading

richard-epsilla commented Dec 14, 2023 •

edited

Loading