Upgrade/Change Weaviate Schema to Use `text-embedding-3-small` #297

davidgxue · 2024-02-13T19:45:43Z

Description

Technical Changes

Just schema.json -- this changes the vectorizer used during ingestion AND during the retrieval
- on ingestion a new index is created if it doesn't exist before. During retrieval this index's vectorizer is used for vectorizing user query

Tests & Evaluation

No significant difference. Small quality improvements for some questions. No quality degradation.
Model is way cheaper than the original V2 ada model
See details here Upgrade/Change Weaviate Schema to Use text-embedding-3-small #297 (comment)

Related Issues

closes #286
partially completes #295

sunank200

I would love to know more about the test and evaluation for this change.

davidgxue · 2024-02-22T17:35:50Z

Evaluation is on hold as the cloud cluster (not local) is on an older version of Weaviate that does not support the new model. I am trying to get access to the remote weaviate cluster from IT. Will be blocked until then

cloudflare-pages · 2024-02-28T01:39:15Z

Deploying with Cloudflare Pages

Latest commit:	`e3727ab`
Status:	✅ Deploy successful!
Preview URL:	https://af944af7.ask-astro.pages.dev
Branch Preview URL:	https://upgrade-text-embedding-model.ask-astro.pages.dev

View logs

davidgxue · 2024-02-28T20:02:53Z

new_embed_model_comparison.csv
I upgraded the weaviate cluster version and ran some tests using the new embedding model. Very few questions out of this quick test question set had different links/documents retrieved. The vast majority of the documents retrieved had either the same links or links that are highly similar. The ones where there are changes, it made very small differences and generally saw improvement in document relevancy when it comes to retrieval. This is likely due to the fact that we use a hybrid search approach and also use a reranker and prompting an LLM to filter at the end, so changing the embedding model which is only one part of the retrieval process did not make a significant impact.

Since this new model generally performs better according to OpenAI's own metrics as well other researchers AND has significantly lower cost compared to the older V2 ada embedding model, I will go ahead and upgrade this into the newer version.

Change Weaviate to use text-embedding-3-small

cdd461e

davidgxue self-assigned this Feb 13, 2024

davidgxue marked this pull request as ready for review February 16, 2024 06:10

davidgxue requested review from Lee-W, pankajastro and sunank200 as code owners February 16, 2024 06:10

davidgxue added this to the 0.3.0 milestone Feb 16, 2024

sunank200 approved these changes Feb 21, 2024

View reviewed changes

Lee-W approved these changes Feb 22, 2024

View reviewed changes

change docker file for weaviate version update

e3727ab

davidgxue requested a review from jlaneve as a code owner February 28, 2024 01:38

davidgxue merged commit 6fbe2d4 into main Feb 28, 2024
8 checks passed

davidgxue deleted the upgrade-text-embedding-model-to-v3 branch February 28, 2024 20:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Upgrade/Change Weaviate Schema to Use `text-embedding-3-small` #297

Upgrade/Change Weaviate Schema to Use `text-embedding-3-small` #297

davidgxue commented Feb 13, 2024 •

edited

sunank200 left a comment

davidgxue commented Feb 22, 2024

cloudflare-pages bot commented Feb 28, 2024

davidgxue commented Feb 28, 2024

Upgrade/Change Weaviate Schema to Use text-embedding-3-small #297

Upgrade/Change Weaviate Schema to Use text-embedding-3-small #297

Conversation

davidgxue commented Feb 13, 2024 • edited

Description

Technical Changes

Tests & Evaluation

Related Issues

sunank200 left a comment

Choose a reason for hiding this comment

davidgxue commented Feb 22, 2024

cloudflare-pages bot commented Feb 28, 2024

Deploying with Cloudflare Pages

davidgxue commented Feb 28, 2024

Upgrade/Change Weaviate Schema to Use `text-embedding-3-small` #297

Upgrade/Change Weaviate Schema to Use `text-embedding-3-small` #297

davidgxue commented Feb 13, 2024 •

edited