openai.error.InvalidRequestError: This model's maximum context length is 4097 tokens, however you requested 11836 tokens (11580 in your prompt; 256 for the completion). Please reduce your prompt; or completion length. #2333

wen020 · 2023-04-03T10:00:19Z

this is my code for hooking up an LLM to answer questions over a database(remote pg).

but find error:

Can anyone give me some advice to solve this problem？

sergerdn · 2023-04-03T10:20:42Z

Take a look #2133 (comment)

wen020 · 2023-04-03T10:28:48Z

Take a look #2133 (comment)

this a good job，but i dont know how to set reduce_k_below_max_tokens=True？Can you give me some examples

FahriKhalid · 2023-05-03T08:17:18Z

same issue here, have you solved ?

Laisky · 2023-05-06T02:40:58Z

Take a look #2133 (comment)

this a good job，but i dont know how to set reduce_k_below_max_tokens=True？Can you give me some examples

here is an example:

chain_type_kwargs = {"prompt": prompt}
llm = ChatOpenAI(
    model_name="gpt-3.5-turbo", 
    temperature=0, 
    max_tokens=1000)  
chain = VectorDBQAWithSourcesChain.from_chain_type(
    llm=llm,
    vectorstore=index.store,
    return_source_documents=True,
    chain_type_kwargs=chain_type_kwargs,
    reduce_k_below_max_tokens=True,
)

https://github.com/Laisky/HelloWorld/blob/master/py3/ailangchain/security.ipynb

MarabhaTheGreat · 2023-05-06T14:29:19Z

Take a look #2133 (comment)

this a good job，but i dont know how to set reduce_k_below_max_tokens=True？Can you give me some examples

here is an example:
chain_type_kwargs = {"prompt": prompt}
llm = ChatOpenAI(
    model_name="gpt-3.5-turbo", 
    temperature=0, 
    max_tokens=1000)  
chain = VectorDBQAWithSourcesChain.from_chain_type(
    llm=llm,
    vectorstore=index.store,
    return_source_documents=True,
    chain_type_kwargs=chain_type_kwargs,
    reduce_k_below_max_tokens=True,
)
https://github.com/Laisky/HelloWorld/blob/master/py3/ailangchain/security.ipynb

God Bless You

shuvracse03 · 2023-05-21T05:45:34Z

If you are getting this error: openai.error.InvalidRequestError: This model's maximum context length is 4097 tokens, however you requested 4638 tokens (4382 in your prompt; 256 for the completion). Please reduce your prompt; or completion length. The solution is bellow[change your host port accordingly]. The trick is to include only tables that you want. I have given bellow an actual code snippet

from langchain import SQLDatabaseChain
from sqlalchemy import create_engine
import os

os.environ["OPENAI_API_KEY"] = 'XXXXXX'

engine=create_engine('mysql+pymysql://admin:admin@localhost:3307/wordpress1')
include_tables=['wp_greetings']

db = SQLDatabase(engine, include_tables=include_tables) #This will only include a list of tables you want!!!
llm = OpenAI(temperature=0, verbose=True,max_tokens=1000)
db_chain = SQLDatabaseChain(llm=llm, database=db, verbose=True)

db_chain.run("Describe wp_greetings table")

davidfan168 · 2023-06-01T09:26:28Z

I tried to limit the content length by using the following code:

llm = OpenAI(temperature=0, verbose=True, max_tokens=2000)

toolkit = SQLDatabaseToolkit(db=db, llm=llm, max_tokens = 2000)

agent_executor = create_sql_agent(
    llm=OpenAI(temperature=0),
    toolkit=toolkit,
    verbose=True,
    reduce_k_below_max_tokens=True,
    max_tokens = 2000
)

But that didn't seem to work. I still got the following error:

InvalidRequestError: This model's maximum context length is 4097 tokens, however you requested 4151 tokens (3895 in your prompt; 256 for the completion). Please reduce your prompt; or completion length.

Is this something wrong with the way I'm using the agent, or is this a bug in langchain? I am using sql agent on a large database so something may be wrong with how I'm using it.

shuvracse03 · 2023-06-01T19:51:36Z

David, you can try including only the necessary table as I shown above. This will definitely decrease number of tokens

davidfan168 · 2023-06-02T02:53:41Z

David, you can try including only the necessary table as I shown above. This will definitely decrease number of tokens

Thanks for the suggestion!

I only have two tables in my database, but the tables have a ton of columns in them. In this case should I try to build my own agent with the ability to summarize tables?

foshesss · 2023-06-09T20:53:13Z

Hi! I'm encountering the same issue as David. Is there a method to track token usage as the program runs? I think it'd be very beneficial for me to monitor what is actually being used as a token and potentially finding reductions that way.

dosubot · 2023-09-22T16:02:05Z

Hi, @wen020! I'm Dosu, and I'm helping the LangChain team manage their backlog. I wanted to let you know that we are marking this issue as stale.

From what I understand, the issue is about the maximum context length for the model being 4097 tokens, but you are requesting 11836 tokens. In the comments, there are suggestions and examples provided by users on how to solve this problem. Some users suggest setting reduce_k_below_max_tokens=True to reduce the token length, while others suggest including only necessary tables to decrease the number of tokens. Additionally, there is a question about whether there is a method to track token usage during program execution.

Before we close this issue, we wanted to check with you if it is still relevant to the latest version of the LangChain repository. If it is, please let us know by commenting on the issue. Otherwise, feel free to close the issue yourself or it will be automatically closed in 7 days.

Thank you for your understanding and contribution to the LangChain project!

dosubot bot added the stale Issue has not had recent activity or appears to be solved. Stale issues will be automatically closed label Sep 22, 2023

dosubot bot closed this as not planned Won't fix, can't repro, duplicate, stale Sep 29, 2023

dosubot bot removed the stale Issue has not had recent activity or appears to be solved. Stale issues will be automatically closed label Sep 29, 2023

This was referenced Oct 4, 2023

Issue: prevent a agent from exceeding the token limit #11405

Closed

OperationalError: (pyodbc.OperationalError) ('08001', '[08001] #11337

Closed

dosubot bot mentioned this issue Oct 15, 2023

Use OpenAI ChatGPT Plugin via Python API fails with maximum number of tokens error #11837

Closed

14 tasks

dosubot bot mentioned this issue Dec 10, 2023

value error when using huggingfacehub API #14400

Closed

14 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

openai.error.InvalidRequestError: This model's maximum context length is 4097 tokens, however you requested 11836 tokens (11580 in your prompt; 256 for the completion). Please reduce your prompt; or completion length. #2333

openai.error.InvalidRequestError: This model's maximum context length is 4097 tokens, however you requested 11836 tokens (11580 in your prompt; 256 for the completion). Please reduce your prompt; or completion length. #2333

wen020 commented Apr 3, 2023

sergerdn commented Apr 3, 2023

wen020 commented Apr 3, 2023

FahriKhalid commented May 3, 2023

Laisky commented May 6, 2023 •

edited

Loading

MarabhaTheGreat commented May 6, 2023

shuvracse03 commented May 21, 2023 •

edited

Loading

davidfan168 commented Jun 1, 2023 •

edited

Loading

shuvracse03 commented Jun 1, 2023

davidfan168 commented Jun 2, 2023

foshesss commented Jun 9, 2023

dosubot bot commented Sep 22, 2023

openai.error.InvalidRequestError: This model's maximum context length is 4097 tokens, however you requested 11836 tokens (11580 in your prompt; 256 for the completion). Please reduce your prompt; or completion length. #2333

openai.error.InvalidRequestError: This model's maximum context length is 4097 tokens, however you requested 11836 tokens (11580 in your prompt; 256 for the completion). Please reduce your prompt; or completion length. #2333

Comments

wen020 commented Apr 3, 2023

sergerdn commented Apr 3, 2023

wen020 commented Apr 3, 2023

FahriKhalid commented May 3, 2023

Laisky commented May 6, 2023 • edited Loading

MarabhaTheGreat commented May 6, 2023

shuvracse03 commented May 21, 2023 • edited Loading

davidfan168 commented Jun 1, 2023 • edited Loading

shuvracse03 commented Jun 1, 2023

davidfan168 commented Jun 2, 2023

foshesss commented Jun 9, 2023

dosubot bot commented Sep 22, 2023

Laisky commented May 6, 2023 •

edited

Loading

shuvracse03 commented May 21, 2023 •

edited

Loading

davidfan168 commented Jun 1, 2023 •

edited

Loading