add embedding && index embedding #86

LittleLittleCloud · 2023-05-16T00:38:41Z

Purpose

This PR enables Azure Document vector search solution. It makes two major change

during indexing, it uses text-embedding-ada model to generate embeddings for all sections
when retrieving relavent doc, it uses the same embedding model to generate embedding for query

TODO

replace text-embedding-ada with local embedding model
~~update .bicep to create embedding model during provision.~~

fix issue

Update Azure Search component to use Vector DB solution #48

Does this introduce a breaking change?

[x] Yes
[ ] No

Pull Request Type

What kind of change does this Pull Request introduce?

[ ] Bugfix
[ ] Feature
[ ] Code style update (formatting, local variables)
[ ] Refactoring (no functional changes, no api changes)
[ ] Documentation content changes
[ ] Other... Please describe:

How to Test

Get the code

git clone [repo-address]
cd [repo-name]
git checkout [branch-name]
npm install

Test the code

What to Check

Verify that the following are valid

...

Other Information

scripts/prepdocs.ps1

luisquintanilla

Looks great @LittleLittleCloud. Added minor nit. Do we know if the OpenAI endpoint that's deployed by azd/bicep contains an embedding model or is this something we need to account for?

app/prepdocs/PrepareDocs/Program.Options.cs

…leLittleCloud/azure-search-openai-demo-csharp into u/xiaoyun/azureDocEmbedding

luisquintanilla · 2023-05-17T20:43:11Z

@LittleLittleCloud let's do this for this PR. We won't merge into main. Let's create a separate branch for it and update the READMEs to point to that branch if they're interested in trying out the vector support.

app/backend/Extensions/SearchClientExtensions.cs

pablocastro · 2023-05-18T00:56:42Z

app/backend/Extensions/SearchClientExtensions.cs

@@ -6,7 +6,8 @@ internal static class SearchClientExtensions
 {
    internal static async Task<string> QueryDocumentsAsync(
        this SearchClient searchClient,
-        string query,
+        string? query = null,


Since now query can be null (e.g. pure vector search), you need to account for this below when creating search options that use semantic ranking. If query == null then it's an error to enable reranking (SearchQueryType.Semantic) since we need to have a text query for that.

pablocastro · 2023-05-18T00:58:38Z

app/backend/Services/ReadRetrieveReadChatService.cs

@@ -75,7 +75,7 @@ public class ReadRetrieveReadChatService

        // step 2
        // use query to search related docs
-        var documentContents = await _searchClient.QueryDocumentsAsync(query.Result, overrides, cancellationToken);
+        var documentContents = await _searchClient.QueryDocumentsAsync(query.Result, embedding: null, overrides: overrides, cancellationToken);


Why no vectors for the chat case? Seems like it's the most popular scenario, isn't it?

pablocastro · 2023-05-18T01:02:28Z

app/backend/Services/RetrieveThenReadApproachService.cs

+            InputType = "query",
+        }, cancellationToken);
+        var embedding = questionEmbeddingResponse.Value.Data.First().Embedding.ToArray();
+        var text = await _searchClient.QueryDocumentsAsync(query: null, embedding: embedding, overrides: overrides, cancellationToken: cancellationToken);


Why null for query? We could do hybrid search using both text and vector, it'll yield better results.

Even better, though we may not have time for this right now, we could have an option in the "developer settings" part of the UX that let's the user choose between:

Pure vector search

Pure text search

Hybrid search

For 2 and 3, option to enable semantic

Separately, if the overrides enable semantic search, you can't have the text query == null.

app/backend/Services/Skills/RetrieveRelatedDocumentSkill.cs

pablocastro · 2023-05-18T05:37:02Z

infra/main.bicep

@@ -50,6 +50,8 @@ param gptDeploymentName string = 'davinci'
 param gptModelName string = 'text-davinci-003'
 param chatGptDeploymentName string = 'chat'
 param chatGptModelName string = 'gpt-35-turbo'
+param embeddingModelName string = 'text-embedding-ada-002'
+param embeddingDeploymentName string = 'embedding'


I think you might have forgotten to wire up this name to the webapp settings in line ~140 below.

Those values are saved in the key-vault now. So most of the parameters here are probably unnecessary to set.

add embedding && index embedding

783e611

LittleLittleCloud commented May 16, 2023

View reviewed changes

scripts/prepdocs.ps1 Outdated Show resolved Hide resolved

Update KeyVaultConfigurationBuilderExtensions.cs

7f7edac

luisquintanilla reviewed May 16, 2023

View reviewed changes

app/prepdocs/PrepareDocs/Program.Options.cs Outdated Show resolved Hide resolved

LittleLittleCloud added 6 commits May 15, 2023 17:53

hook up bicep

d54f90a

Merge branch 'u/xiaoyun/azureDocEmbedding' of https://github.com/Litt…

8a0a7ab

…leLittleCloud/azure-search-openai-demo-csharp into u/xiaoyun/azureDocEmbedding

add nugetconfig

792f19e

update build.yml

aacb760

fix spell error

c232917

use global namespace

075df39

pablocastro reviewed May 18, 2023

View reviewed changes

LittleLittleCloud added 4 commits May 18, 2023 11:47

fix comments

7691767

mv nuget.config

f79b530

Merge branch 'main' into u/xiaoyun/azureDocEmbedding

bbb9741

fix build

e21d14b

LittleLittleCloud changed the base branch from main to feature/embeddingSearch May 22, 2023 20:21

luisquintanilla mentioned this pull request May 22, 2023

Update Azure Search component to use Vector DB solution #48

Closed

LittleLittleCloud merged commit 0a52a89 into Azure-Samples:feature/embeddingSearch Jun 20, 2023
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add embedding && index embedding #86

add embedding && index embedding #86

LittleLittleCloud commented May 16, 2023 •

edited

Loading

luisquintanilla left a comment

luisquintanilla commented May 17, 2023

pablocastro May 18, 2023

pablocastro May 18, 2023

pablocastro May 18, 2023

pablocastro May 18, 2023

LittleLittleCloud May 18, 2023

add embedding && index embedding #86

add embedding && index embedding #86

Conversation

LittleLittleCloud commented May 16, 2023 • edited Loading

Purpose

TODO

Does this introduce a breaking change?

Pull Request Type

How to Test

What to Check

Other Information

luisquintanilla left a comment

Choose a reason for hiding this comment

luisquintanilla commented May 17, 2023

pablocastro May 18, 2023

Choose a reason for hiding this comment

pablocastro May 18, 2023

Choose a reason for hiding this comment

pablocastro May 18, 2023

Choose a reason for hiding this comment

pablocastro May 18, 2023

Choose a reason for hiding this comment

LittleLittleCloud May 18, 2023

Choose a reason for hiding this comment

LittleLittleCloud commented May 16, 2023 •

edited

Loading