AzureOpenAI InvalidRequestError: Too many inputs. The max number of inputs is 1. #4575

Aspyryan · 2023-05-12T12:38:50Z

System Info

Langchain version == 0.0.166
Embeddings = OpenAIEmbeddings - model: text-embedding-ada-002 version 2
LLM = AzureOpenAI

Who can help?

@hwchase17
@agola11

Information

The official example notebooks/scripts
My own modified scripts

Related Components

Reproduction

Steps to reproduce:

Set up azure openai embeddings by providing key, version etc..
Load a document with a loader
Set up a text splitter so you get more then 2 documents
add them to chromadb with .add_documents(List<Document>)

This is some example code:

pdf = PyPDFLoader(url)
documents = pdf.load()

text_splitter = CharacterTextSplitter(chunk_size=1000, chunk_overlap=0)
texts = text_splitter.split_documents(documents)
vectordb.add_documents(texts)
vectordb.persist()

Expected behavior

Embeddings be added to the database, instead it returns the error openai.error.InvalidRequestError: Too many inputs. The max number of inputs is 1. We hope to increase the number of inputs per request soon. Please contact us through an Azure support request at: https://go.microsoft.com/fwlink/?linkid=2213926 for further questions.
This is because Microsoft only allows one embedding at a time while the script tries to add the documents all at once.
The following code is where the issue comes up (I think): https://github.com/hwchase17/langchain/blob/258c3198559da5844be3f78680f42b2930e5b64b/langchain/embeddings/openai.py#L205-L214
The input should be a 1 dimentional array and not multi.

The text was updated successfully, but these errors were encountered:

Aspyryan · 2023-05-12T12:58:39Z

I might have mitigated the issue by adding the chunk size to the embeddings:
embedding = OpenAIEmbeddings(deployment="embeddings",model="text-embedding-ada-002", chunk_size = 1)

fastsyrup · 2023-05-26T08:58:18Z

In the javscript version of langchain the parameter chunk_size is named batchSize

lingfengchencn · 2023-05-30T17:29:14Z

yes ,on azure just embed one by one ,

["id1"],["meta1"],["doc1"]

wrong example :

["id1",""id2"],["meta1","meta2"],["doc1","doc2"]

you will get :
Too many inputs. The max number of inputs is 1. We hope to increase the number of inputs per request soon....... 😂

tushaar9027 · 2023-06-12T10:47:12Z

Can anyone tell me how to fix this issue

lucasandre22 · 2023-06-13T23:24:01Z

I might have mitigated the issue by adding the chunk size to the embeddings: embedding = OpenAIEmbeddings(deployment="embeddings",model="text-embedding-ada-002", chunk_size = 1)

Why did you choose chunk_size = 1? And can you explain me why it does not work with the same chunk_size used in the text_splitter?
Thanks in advance.

jeremiah-dibble · 2023-06-14T19:55:58Z

I might have mitigated the issue by adding the chunk size to the embeddings: embedding = OpenAIEmbeddings(deployment="embeddings",model="text-embedding-ada-002", chunk_size = 1)

Why did you choose chunk_size = 1? And can you explain me why it does not work with the same chunk_size used in the text_splitter? Thanks in advance.

I am not positive, but from my understanding, Azure only allows you to embed one string at a time. It will give you the error described above if you try to send more than one so we must limit our chunk_size to one if we are using Azure and have not increased our limit.

lucasandre22 · 2023-06-14T23:43:58Z

Thank you, I understood the difference between the chunk_size in the embeddings and in the text_splitter, they work differently, since in the embeddings it refers to the chunks per batch.

PRAJINPRAKASH · 2023-06-22T10:12:28Z

after changing the chunk size to1 the rate limit error hapening.

makoto-soracom · 2023-06-22T22:30:44Z

How does following code work?

pdf = PyPDFLoader(url)
documents = pdf.load()

text_splitter = CharacterTextSplitter(chunk_size=1000, chunk_overlap=0)
texts = text_splitter.split_documents(documents)

for text in texts:
    vectordb.add_documents([text])
vectordb.persist()

vbonluk · 2023-06-24T19:45:25Z

@makoto-soracom it works! But I think it's not good, cus we will call vector db and azure api at same time. Just to fix azure restriction it waste resources to call vector db.

aiakubovich · 2023-07-03T06:43:58Z

with chunk_size = 1 I am getting an error when doing big embeddings

lucasandre22 · 2023-07-03T16:11:16Z

I got several warnings related to the chunk being bigger than 1, but even with those warnings I was able to load and use the chunks document.

aiakubovich · 2023-07-03T16:29:53Z

@lucasandre22 do you use Azure OpenAI APIs?

lucasandre22 · 2023-07-03T18:32:21Z

yep, no issues besides those warnings.

kylooh · 2023-07-04T06:40:34Z

I'm getting this error too, seems like azure not supporting too many arguments, so I switched to OpenAI's api. Now it works fine. I found this further link https://learn.microsoft.com/en-us/azure/cognitive-services/openai/faq#i-am-trying-to-use-embeddings-and-received-the-error--invalidrequesterror--too-many-inputs--the-max-number-of-inputs-is-1---how-do-i-fix-this-

sunyq1995 · 2023-07-14T14:19:42Z

chunk_size = 1

with chunk_size = 1 I am getting an error when doing big embeddings

me too..

lucasandre22 · 2023-07-14T14:26:03Z

Which error are you getting?

sunyq1995 · 2023-07-14T15:23:02Z

Which error are you getting?

sorry for wrong reply, what I want to say is : after changing the chunk size to1 the rate limit error hapenning.

I just solve it by building embeding one by one, and build vector_store by using FAISS.from_embeddings() ways instead of FAISS.from_texts()

marctorsoc · 2023-07-15T09:53:27Z

Not sure if @sunyq1995 means this, but this worked for me, and I think it was faster than doing from_texts

embeddings = OpenAIEmbeddings(
    deployment=embedding_deployment_id,
    model=embedding_model_name,
    chunk_size=1,
    max_retries=10,
    show_progress_bar=True,
)

loader = DataFrameLoader(
    data_df,
    page_content_column="text",
)
text_splitter = TokenTextSplitter(chunk_size=2_000, chunk_overlap=5)
documents = text_splitter.split_documents(loader.load())

returned_embeddings = embeddings.embed_documents(
    [doc.page_content for doc in documents],
)

docsearch = FAISS.from_embeddings(
    text_embeddings=[
        (doc.page_content, embedding)
        for doc, embedding in zip(documents, returned_embeddings)
    ], 
    embedding=embeddings,
    metadatas=[
        doc.metadata
        for doc in documents
    ],
)

sunyq1995 · 2023-07-17T04:11:08Z

Not sure if @sunyq1995 means this, but this worked for me, and I think it was faster than doing from_texts

embeddings = OpenAIEmbeddings(
    deployment=embedding_deployment_id,
    model=embedding_model_name,
    chunk_size=1,
    max_retries=10,
    show_progress_bar=True,
)

loader = DataFrameLoader(
    data_df,
    page_content_column="text",
)
text_splitter = TokenTextSplitter(chunk_size=2_000, chunk_overlap=5)
documents = text_splitter.split_documents(loader.load())

returned_embeddings = embeddings.embed_documents(
    [doc.page_content for doc in documents],
)

docsearch = FAISS.from_embeddings(
    text_embeddings=[
        (doc.page_content, embedding)
        for doc, embedding in zip(documents, returned_embeddings)
    ], 
    embedding=embeddings,
    metadatas=[
        doc.metadata
        for doc in documents
    ],
)

Thanks @marctorsoc for clarify, that exactly what I mean, but I have a question is there any rate limit warning when you run below code?

returned_embeddings = embeddings.embed_documents(
    [doc.page_content for doc in documents],

ThorstenHans · 2023-07-21T06:26:40Z

It looks like the team increased the limit, a chunk_size of 16 works for me (deployed text-embedding-ada-002).

I've deployed my instance of Azure OpenAI to eastus (maybe quotas differ per Azure Region)

kadereub · 2023-07-26T18:41:29Z

I was able to pass in 16 documents at a time too without the max number of input error, however I had quite a few documents. I used a for loop which worked for me however I had to add time.sleep(2) otherwise I got a rate limit warning:

openai.error.RateLimitError: Requests to the Get a vector representation of a given input that can be easily consumed by machine learning models and algorithms. Operation under Azure OpenAI API version 2023-03-15

see this thread.

example code:

batch = 16
total_docs = len(all_docs)
vectordb = Chroma(persist_directory=persist_directory, embedding_function=embeddings_model)

for i in range(0, total_docs, batch):
    sample_docs = mapping_docs[i:i + batch]
    vectordb.add_documents(sample_docs)
    time.sleep(2) # embarrassing but works
vectordb.persist()

huislaw · 2023-08-04T09:47:42Z

I have same error when using this VectorstoreIndexCreator with Azure OpenAPI.
How can I set the max number of inputs ?

Too many inputs. The max number of inputs is 16. We hope to increase the number of inputs per request soon. Please contact us through an Azure support...


from langchain.indexes import VectorstoreIndexCreator

file = 'OutdoorClothingCatalog_1000.csv'
loader = CSVLoader(file_path=file)

index = VectorstoreIndexCreator(
    vectorstore_cls=DocArrayInMemorySearch,
    embedding=embeddings,
).from_loaders([loader])

alan890104 · 2023-08-17T08:09:25Z

@huislaw here is my solution, use chunkify function to customize max number of inputs (max=16)

from langchain.document_loaders import WebBaseLoader
from langchain.embeddings import OpenAIEmbeddings
from langchain.chat_models import AzureChatOpenAI
from langchain.text_splitter import RecursiveCharacterTextSplitter
from langchain.embeddings import OpenAIEmbeddings
from langchain.vectorstores import Chroma
import logging
from langchain.retrievers.multi_query import MultiQueryRetriever
from langchain.chains import RetrievalQA
import os
from typing import Iterable

logging.basicConfig()
logging.getLogger("langchain.retrievers.multi_query").setLevel(logging.INFO)


def chunkify(arr: Iterable, size: int = 8):
    for i in range(0, len(arr), size):
        yield arr[i : i + size]


embedder = OpenAIEmbeddings(
    openai_api_key=os.getenv("OPENAI_EMBEDDING_API_KEY"),
    openai_api_base=os.getenv("OPENAI_EMBEDDING_API_BASE"),
    openai_api_version=os.getenv("OPENAI_EMBEDDING_API_VERSION"),
    openai_api_type=os.getenv("OPENAI_EMBEDDING_API_TYPE"),
    deployment=os.getenv("OPENAI_EMBEDDING_API_MODEL"),
)


chatllm = AzureChatOpenAI(
    openai_api_key=os.getenv("OPENAI_CHAT_API_KEY"),
    openai_api_base=os.getenv("OPENAI_CHAT_API_BASE"),
    openai_api_version=os.getenv("OPENAI_CHAT_API_VERSION"),
    openai_api_type=os.getenv("OPENAI_CHAT_API_TYPE"),
    deployment_name=os.getenv("OPENAI_CHAT_API_MODEL"),
    temperature=0,
)

with open("document_urls.txt", "r") as F:
    urls = F.read().split("\n")


loader = WebBaseLoader(web_path=urls)
data = loader.load()

text_splitter = RecursiveCharacterTextSplitter(chunk_size=200, chunk_overlap=0)
all_splits = text_splitter.split_documents(data)

vectorstore = Chroma(embedding_function=embedder)
for chunk in chunkify(all_splits):
    vectorstore.add_documents(chunk)

retriever_from_llm = MultiQueryRetriever.from_llm(
    retriever=vectorstore.as_retriever(), llm=chatllm
)

qa_chain = RetrievalQA.from_chain_type(chatllm, retriever=retriever_from_llm)
result = qa_chain({"query": "How many versions are there in AAVE"})
print(result)

johnjensenish · 2023-09-05T18:44:15Z

As of 8/5/23, the easiest fix is to pass in chunk_size=16 when creating OpenAIEmbeddings for an Azure deployment. Some of the other solutions here are more complicated than using this built-in functionality. As some have noted, the limit has been increased to 16 from 1.

Confusingly, this value is distinct from the chunk size for text splitting. Here, the configuration tells the OpenAIEmbeddings object to create 16 embeddings at a time, which conforms to the Azure limit. In the TypeScript version of langchain, the name of this configuration is is batchSize.

mspronesti · 2023-09-19T11:29:26Z

Can you guys try the patch in #10707 ?

@baskaryan

) This PR addresses the limitation of Azure OpenAI embeddings, which can handle at maximum 16 texts in a batch. This can be solved setting `chunk_size=16`. However, I'd love to have this automated, not to force the user to figure where the issue comes from and how to solve it. Closes #4575. @baskaryan --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>

shruti-z · 2023-09-27T05:01:41Z

chunk_size here in the Azure OpenAIEmbeddings() is referring to the number of embeddings it creates in parallel as opposed to the langchain chunk_size which is calculating the size of the chunk. chunk_size = 16 worked at this time.

jdeepak-4u · 2024-02-28T03:59:36Z

Adding chunk_size = 1 while creating embeddings worked for me when using Azure OpenAI API Key.

embeddings = AzureOpenAIEmbeddings(
deployment=AZURE_OPENAI_DEPLOYMENT,
openai_api_version=AZURE_OPENAI_VERSION,chunk_size = 1
)

dmanresa-saes referenced this issue in FlowiseAI/Flowise May 27, 2023

remove unsupported params in AzureOpenAIEmbedding

e55e1d4

trangdata mentioned this issue Jul 11, 2023

Use of Azure endpoints? wch/chatstream#8

Open

alexxchen mentioned this issue Sep 19, 2023

[功能请求]: 更新对Azure Embedding的支持 GaiZhenbiao/ChuanhuChatGPT#894

Open

1 task

mspronesti mentioned this issue Sep 19, 2023

fix(embeddings): number of texts in Azure OpenAIEmbeddings batch #10707

Merged

baskaryan closed this as completed in #10707 Sep 20, 2023

zeitderforschung added a commit to exdatic/langsync that referenced this issue Nov 6, 2023

fix langchain-ai/langchain#4575

177f99c

zeitderforschung added a commit to exdatic/langsync that referenced this issue Nov 6, 2023

fix langchain-ai/langchain#4575

52c8833

dosubot bot mentioned this issue Nov 10, 2023

MlflowAIGatewayEmbeddings : Default Batch size incompatible with Azure OpenAI text-embedding-ada-002 #13197

Open

14 tasks

dosubot bot mentioned this issue Nov 17, 2023

New update broke embeddings models #13539

Closed

14 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AzureOpenAI InvalidRequestError: Too many inputs. The max number of inputs is 1. #4575

AzureOpenAI InvalidRequestError: Too many inputs. The max number of inputs is 1. #4575

Aspyryan commented May 12, 2023 •

edited

Loading

Aspyryan commented May 12, 2023

fastsyrup commented May 26, 2023

lingfengchencn commented May 30, 2023

tushaar9027 commented Jun 12, 2023

lucasandre22 commented Jun 13, 2023 •

edited

Loading

jeremiah-dibble commented Jun 14, 2023

lucasandre22 commented Jun 14, 2023

PRAJINPRAKASH commented Jun 22, 2023

makoto-soracom commented Jun 22, 2023

vbonluk commented Jun 24, 2023 •

edited

Loading

aiakubovich commented Jul 3, 2023

lucasandre22 commented Jul 3, 2023

aiakubovich commented Jul 3, 2023

lucasandre22 commented Jul 3, 2023

kylooh commented Jul 4, 2023 •

edited

Loading

sunyq1995 commented Jul 14, 2023

lucasandre22 commented Jul 14, 2023

sunyq1995 commented Jul 14, 2023

marctorsoc commented Jul 15, 2023

sunyq1995 commented Jul 17, 2023 •

edited

Loading

ThorstenHans commented Jul 21, 2023 •

edited

Loading

kadereub commented Jul 26, 2023 •

edited

Loading

huislaw commented Aug 4, 2023 •

edited

Loading

alan890104 commented Aug 17, 2023

johnjensenish commented Sep 5, 2023 •

edited

Loading

mspronesti commented Sep 19, 2023 •

edited

Loading

shruti-z commented Sep 27, 2023

jdeepak-4u commented Feb 28, 2024

AzureOpenAI InvalidRequestError: Too many inputs. The max number of inputs is 1. #4575

AzureOpenAI InvalidRequestError: Too many inputs. The max number of inputs is 1. #4575

Comments

Aspyryan commented May 12, 2023 • edited Loading

System Info

Who can help?

Information

Related Components

Reproduction

Expected behavior

Aspyryan commented May 12, 2023

fastsyrup commented May 26, 2023

lingfengchencn commented May 30, 2023

tushaar9027 commented Jun 12, 2023

lucasandre22 commented Jun 13, 2023 • edited Loading

jeremiah-dibble commented Jun 14, 2023

lucasandre22 commented Jun 14, 2023

PRAJINPRAKASH commented Jun 22, 2023

makoto-soracom commented Jun 22, 2023

vbonluk commented Jun 24, 2023 • edited Loading

aiakubovich commented Jul 3, 2023

lucasandre22 commented Jul 3, 2023

aiakubovich commented Jul 3, 2023

lucasandre22 commented Jul 3, 2023

kylooh commented Jul 4, 2023 • edited Loading

sunyq1995 commented Jul 14, 2023

lucasandre22 commented Jul 14, 2023

sunyq1995 commented Jul 14, 2023

marctorsoc commented Jul 15, 2023

sunyq1995 commented Jul 17, 2023 • edited Loading

ThorstenHans commented Jul 21, 2023 • edited Loading

kadereub commented Jul 26, 2023 • edited Loading

huislaw commented Aug 4, 2023 • edited Loading

alan890104 commented Aug 17, 2023

johnjensenish commented Sep 5, 2023 • edited Loading

mspronesti commented Sep 19, 2023 • edited Loading

shruti-z commented Sep 27, 2023

jdeepak-4u commented Feb 28, 2024

Aspyryan commented May 12, 2023 •

edited

Loading

lucasandre22 commented Jun 13, 2023 •

edited

Loading

vbonluk commented Jun 24, 2023 •

edited

Loading

kylooh commented Jul 4, 2023 •

edited

Loading

sunyq1995 commented Jul 17, 2023 •

edited

Loading

ThorstenHans commented Jul 21, 2023 •

edited

Loading

kadereub commented Jul 26, 2023 •

edited

Loading

huislaw commented Aug 4, 2023 •

edited

Loading

johnjensenish commented Sep 5, 2023 •

edited

Loading

mspronesti commented Sep 19, 2023 •

edited

Loading