[A/B] Changes to the RAG functionality #630

wpfl-dbt · 2024-06-21T11:23:05Z

Context

We want to split test two paths for the RAG functionality:

@search is the standard
@supasearch implements a higher k, similarity threshold and hybrid search with a boosted similarity weight

Note I also change the keywords we use to make them a bit nicer for users, and remove the enum for keywords in the frontend.

Note I haven't put in a unit test because this is about lightweight experimentation.

Changes proposed in this pull request

Adds @supasearch route
Rename @summarisation to @summarise to enable people to say things like "@summarise the people in these docs"
Rename @retrieval to @search to enable people to say things like "@search which type of energy saw the largest spike in prices"
Removes enum for routes in the frontend

Guidance to review

Check you're happy with my changes

Relevant links

Things to check

I have added any new ENV vars in all deployed environments
I have tested any code added or changed
I have run integration tests

gecBurton · 2024-06-24T06:38:38Z

core_api/src/dependencies.py


        return {
-            "size": env.ai.rag_k,
+            "size": params["size"],


Suggested change

"size": params["size"],

"size": env.ai.rag_k or params["size"],

or i f we arent using env.ai we should remove it (but i think its probably a good idea)

As discussed, these are added in the default params further down the function, so definitely still used and I think we should keep them for now.

gecBurton · 2024-06-24T06:39:04Z

core_api/src/dependencies.py

                        {
                            "knn": {
                                "field": "embedding",
                                "query_vector": vector,
-                                "num_candidates": env.ai.rag_num_candidates,
-                                "filter": knn_filter,
+                                "num_candidates": params["num_candidates"],


gecBurton · 2024-06-24T06:39:17Z

core_api/src/dependencies.py

+                        {
+                            "match": {
+                                "text": {
+                                    "query": query["question"],


gecBurton

LGTM

not for now but we should think about aliases for summary/summarise etc

brunns · 2024-06-24T11:03:27Z

core_api/src/semantic_routes.py

@@ -147,7 +147,19 @@ def get_routable_chains(
            ChatRoute.ability: build_static_response_chain(ABILITY_RESPONSE, ChatRoute.ability),
            ChatRoute.coach: build_static_response_chain(COACH_RESPONSE, ChatRoute.coach),
            ChatRoute.gratitude: build_static_response_chain("You're welcome!", ChatRoute.gratitude),
-            ChatRoute.retrieval: retrieval_chain,
-            ChatRoute.summarisation: summary_chain,
+            ChatRoute.search: retrieval_chain,


These changes will need to be reflected in the Django app, in the ChatRoute enum.

I'd like to remove this enum from the frontend and just hold it as a string. An advantage of keywords is we can quickly add and remove them to test stuff, and doing this as an enum in the front will mean a db migration every time this happens because keywords and routes are currently bound together.

We'll rue this one day, you mark my words. But if you must, you must.

Looks like the migration has been put in to handle route as a string in Django? Is this thread resolved?

brunns

Can you rebase and run the integration tests? The looks good to me.

gecBurton · 2024-06-25T08:38:05Z

Can you rebase and run the integration tests? The looks good to me.

https://github.com/i-dot-ai/redbox-copilot/actions/runs/9659121995

this is working locally for me

Will Langdale added 3 commits June 21, 2024 12:03

Initial supasearch functionality

ecf70f9

Fixed streaming unit test

570d12f

Merged main

ca48fcd

wpfl-dbt requested a review from jamesrichards4 June 21, 2024 11:51

Added dict logging

f3d0e6b

gecBurton reviewed Jun 24, 2024

View reviewed changes

core_api/src/dependencies.py

{

"match": {

"text": {

"query": query["question"],

Copy link

Collaborator

gecBurton Jun 24, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

👍

gecBurton approved these changes Jun 24, 2024

View reviewed changes

brunns suggested changes Jun 24, 2024

View reviewed changes

Will Langdale added 6 commits June 24, 2024 12:40

Added quickupload notebook

e1f77d0

Merged from main

58c5178

Merged main

33001ec

Minor bugfixes

a36ee0d

Merge branch 'main' into test/k2

309dc70

Removed enum for ChatRoute to neable quicker iteration

9c83eb8

jamesrichards4 approved these changes Jun 25, 2024

View reviewed changes

brunns approved these changes Jun 25, 2024

View reviewed changes

George Burton and others added 3 commits June 25, 2024 09:35

poetry update

f956d01

Merge branch 'main' into test/k2

e53f1bc

poetry update (again)

0eefcd6

jamesrichards4 mentioned this pull request Jun 25, 2024

Feature/redbox 411 documents refactor #644

Merged

3 tasks

brunns added 2 commits June 25, 2024 09:44

Update the journey test for new route name.

9509ade

Post merge poetry lock update.

1f3fe74

gecBurton merged commit c66d8aa into main Jun 25, 2024
4 checks passed

gecBurton deleted the test/k2 branch June 25, 2024 09:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[A/B] Changes to the RAG functionality #630

[A/B] Changes to the RAG functionality #630

wpfl-dbt commented Jun 21, 2024 •

edited

Loading

gecBurton Jun 24, 2024

wpfl-dbt Jun 24, 2024

gecBurton Jun 24, 2024

wpfl-dbt Jun 24, 2024

gecBurton Jun 24, 2024

gecBurton left a comment

brunns Jun 24, 2024

wpfl-dbt Jun 24, 2024 •

edited

Loading

brunns Jun 25, 2024

jamesrichards4 Jun 25, 2024

brunns left a comment

gecBurton commented Jun 25, 2024

	"size": params["size"],
	"size": env.ai.rag_k or params["size"],

[A/B] Changes to the RAG functionality #630

[A/B] Changes to the RAG functionality #630

Conversation

wpfl-dbt commented Jun 21, 2024 • edited Loading

Context

Changes proposed in this pull request

Guidance to review

Relevant links

Things to check

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

gecBurton left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

wpfl-dbt Jun 24, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

brunns left a comment

Choose a reason for hiding this comment

gecBurton commented Jun 25, 2024

wpfl-dbt commented Jun 21, 2024 •

edited

Loading

wpfl-dbt Jun 24, 2024 •

edited

Loading