Chunks too Large #410

Patrick-Davis-MSFT · 2023-12-19T19:00:08Z

Occurs in Delta Release

Describe the bug
When uploading some documents, we receive the error using the Azure Embeddings model ADA.

openai.error.InvalidRequestError: This model's maximum context length is 8191 tokens, however you requested 22089 tokens (22089 in your prompt; 0 for the completion). Please reduce your prompt; or completion length.

The embeddings model has a capacity of 352K TPM.

To Reproduce
Steps to reproduce the behavior:

Install as norma;
Upload a file
View the error in cosmos

Expected behavior
The embeddings should resolve as normal and if the chunk is too big split the chunk.

Screenshots
full error is below from 2023-12-19 18:52:43

"Traceback (most recent call last):  File \"/opt/python/3.10.13/lib/python3.10/threading.py\", line 973, in _bootstrap
self._bootstrap_inner()
  File \"/opt/python/3.10.13/lib/python3.10/threading.py\", line 1016, in _bootstrap_inner
    self.run()
  File \"/tmp/8dc00b6931bbb9f/antenv/lib/python3.10/site-packages/anyio/_backends/_asyncio.py\", line 807, in run
    result = context.run(func, *args)
  File \"/tmp/8dc00b6931bbb9f/app.py\", line 405, in poll_queue
    statusLog.upsert_document(blob_path, f'Message requed to embeddings queue, attempt {str(requeue_count)}. Visible in {str(backoff)} seconds. Error: {str(error)}.',
  File \"/tmp/8dc00b6931bbb9f/antenv/lib/python3.10/site-packages/tenacity/__init__.py\", line 382, in __call__
    result = fn(*args, **kwargs)
  File \"/tmp/8dc00b6931bbb9f/app.py\", line 85, in encode
    response = openai.Embedding.create(
  File \"/tmp/8dc00b6931bbb9f/antenv/lib/python3.10/site-packages/openai/api_resources/embedding.py\", line 33, in create
    response = super().create(*args, **kwargs)
  File \"/tmp/8dc00b6931bbb9f/antenv/lib/python3.10/site-packages/openai/api_resources/abstract/engine_api_resource.py\", line 153, in create
    response, _, api_key = requestor.request(
  File \"/tmp/8dc00b6931bbb9f/antenv/lib/python3.10/site-packages/openai/api_requestor.py\", line 226, in request
    resp, got_stream = self._interpret_response(result, stream)
  File \"/tmp/8dc00b6931bbb9f/antenv/lib/python3.10/site-packages/openai/api_requestor.py\", line 619, in _interpret_response
    self._interpret_response_line(
  File \"/tmp/8dc00b6931bbb9f/antenv/lib/python3.10/site-packages/openai/api_requestor.py\", line 679, in _interpret_response_line
    raise self.handle_error_response(
openai.error.InvalidRequestError: This model's maximum context length is 8191 tokens, however you requested 22089 tokens (22089 in your prompt; 0 for the completion). Please reduce your prompt; or completion length.

The above exception was the direct cause of the following exception:

Traceback (most recent call last):
  File \"/tmp/8dc00b6931bbb9f/app.py\", line 228, in embed_texts
    embeddings = model_obj.encode(texts)
  File \"/tmp/8dc00b6931bbb9f/antenv/lib/python3.10/site-packages/tenacity/__init__.py\", line 289, in wrapped_f
    return self(f, *args, **kw)
  File \"/tmp/8dc00b6931bbb9f/antenv/lib/python3.10/site-packages/tenacity/__init__.py\", line 379, in __call__
    do = self.iter(retry_state=retry_state)
  File \"/tmp/8dc00b6931bbb9f/antenv/lib/python3.10/site-packages/tenacity/__init__.py\", line 326, in iter
    raise retry_exc from fut.exception()
tenacity.RetryError: RetryError[<Future at 0x770bd92d7640 state=finished raised InvalidRequestError>]

The above exception was the direct cause of the following exception:

Traceback (most recent call last):
  File \"/tmp/8dc00b6931bbb9f/app.py\", line 348, in poll_queue
    embedding = embed_texts(target_embeddings_model, [text])
  File \"/tmp/8dc00b6931bbb9f/app.py\", line 242, in embed_texts
    raise HTTPException(status_code=500, detail=f\"Failed to embed: {str(error)}\") from error
fastapi.exceptions.HTTPException

Desktop (please complete the following information):

OS: Windows
Browser Edge
Version Delta 0.4 (Main Branch)

Alpha version details

GitHub branch: Main
Latest commit: Dec 15 2023.

Additional context
A commit was pushed while writing this ticket for the auto scaler. I will try with the latest and update once done.

The text was updated successfully, but these errors were encountered:

Patrick-Davis-MSFT · 2023-12-20T14:08:44Z

This is confirmed to exist on the Main Branch as of 12/19. Reach out for the file.

Patrick-Davis-MSFT · 2023-12-20T17:00:11Z

On 12/19 Build Main. It does work with the BAAI embeddings.

Patrick-Davis-MSFT · 2023-12-20T21:09:16Z

What would be better.pdf
Recreated with the attached file.

dayland · 2024-01-10T22:46:44Z

@Patrick-Davis-MSFT , as you can see George and the team have begun working on this issue. The fix is a bit too deep and complex to make it into the v1.0 release this close to release date. So we have begun the work, but are targeting this for a hotfix after the v1.0 release. Just wanted to give you an update on the plan.

dayland · 2024-02-06T21:55:34Z

@Patrick-Davis-MSFT , This hotfix has been applied to main in #478 . Closing this issue.

dayland assigned georearl Jan 4, 2024

dayland added the bug Something isn't working label Jan 4, 2024

dayland added this to the 1.0 milestone Jan 4, 2024

georearl mentioned this issue Jan 5, 2024

Geearl/6323 large tables #429

Merged

dayland modified the milestones: 1.0, 1.0 HF Jan 10, 2024

dayland mentioned this issue Feb 6, 2024

Syncing 3 PRs related to large table updates to 1.0 HF (main) #478

Merged

dayland closed this as completed Feb 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Chunks too Large #410

Chunks too Large #410

Patrick-Davis-MSFT commented Dec 19, 2023

Patrick-Davis-MSFT commented Dec 20, 2023

Patrick-Davis-MSFT commented Dec 20, 2023

Patrick-Davis-MSFT commented Dec 20, 2023

dayland commented Jan 10, 2024

dayland commented Feb 6, 2024

Chunks too Large #410

Chunks too Large #410

Comments

Patrick-Davis-MSFT commented Dec 19, 2023

Patrick-Davis-MSFT commented Dec 20, 2023

Patrick-Davis-MSFT commented Dec 20, 2023

Patrick-Davis-MSFT commented Dec 20, 2023

dayland commented Jan 10, 2024

dayland commented Feb 6, 2024