Adjust how text is split depending on input type #1238

timothycarambat · 2024-04-30T17:11:26Z

resolves #1230

Fixes issue where prompt would be split erroneously by the embedder during vector search resulting in worse semantic similarity.

Important

We need to also ensure the prompt given (or chunks of prompts) are not longer than the embedder model's max length or prompt search will crash

resolves #1230

Adjust how text is split depending on input type

5f66e2a

resolves #1230

timothycarambat merged commit bf435b2 into master Apr 30, 2024

timothycarambat deleted the 1230-text-input-embed-chunking-bug branch April 30, 2024 17:11

timothycarambat mentioned this pull request Apr 30, 2024

[CHORE]: Embedder embedText length check #1239

Open

Provide feedback