fixed json parsing for multiple generated JSON code segments, plus co… #5037

rickbraddy-pharma · 2023-05-20T15:23:27Z

Fixes showstopper issue with parsing LLM-generated code that returns one or multiple code segments in JSON format

Fixes # (issue)

Before submitting

Who can review?

Community members can review the PR once tests pass. Tag maintainers/contributors who might be interested:

…mments

Kav-K · 2023-05-21T05:03:02Z

Can someone approve this workflow at least? This is an incredibly important fix. @hwchase17

0ptim · 2023-05-21T12:03:09Z

I also have many failed runs like #2679 .

My agent sometimes returns:

```\n{\n    \"action\": \"Final Answer\",\n    \"action_input\": \"The latest block count is 2950616.\"\n}\n```

Instead of:

\n{\n    \"action\": \"Final Answer\",\n    \"action_input\": \"The latest block count is 2950616.\"\n}\n

schinto · 2023-05-22T12:41:31Z

I guess the issue with the LLM-generated code is not fixed yet.
Using langchain version 0.0.176 the following error occurs when code is returned in triple backticks:

2023-05-22 14:16:40.713 Uncaught app exception
Traceback (most recent call last):
File "/home/user1/scratch/conda/envs/lcpy39/lib/python3.9/site-packages/langchain/agents/conversational_chat/output_parser.py", line 21, in parse
cleaned_output, _ = cleaned_output.split("```")
ValueError: too many values to unpack (expected 2)

The above exception was the direct cause of the following exception:

Traceback (most recent call last):
File "/home/user1/scratch/conda/envs/lcpy39/lib/python3.9/site-packages/streamlit/runtime/scriptrunner/script_runner.py", line 565, in _run_script
exec(code, module.dict)
File "/home/user1/langchain-tools/chatbot_streamlit_chat.py", line 116, in
output, total_tokens, prompt_tokens, completion_tokens = generate_response(
File "/home/user1/langchain-tools/chatbot_streamlit_chat.py", line 96, in generate_response
response = chatbot.agent({"input": query})
File "/home/user1/scratch/conda/envs/lcpy39/lib/python3.9/site-packages/langchain/chains/base.py", line 140, in call
raise e
File "/home/user1/scratch/conda/envs/lcpy39/lib/python3.9/site-packages/langchain/chains/base.py", line 134, in call
self._call(inputs, run_manager=run_manager)
File "/home/user1/scratch/conda/envs/lcpy39/lib/python3.9/site-packages/langchain/agents/agent.py", line 947, in _call
next_step_output = self._take_next_step(
File "/home/user1/scratch/conda/envs/lcpy39/lib/python3.9/site-packages/langchain/agents/agent.py", line 773, in _take_next_step
raise e
File "/home/user1/scratch/conda/envs/lcpy39/lib/python3.9/site-packages/langchain/agents/agent.py", line 762, in _take_next_step
output = self.agent.plan(
File "/home/user1/scratch/conda/envs/lcpy39/lib/python3.9/site-packages/langchain/agents/agent.py", line 444, in plan
return self.output_parser.parse(full_output)
File "/home/user1/scratch/conda/envs/lcpy39/lib/python3.9/site-packages/langchain/agents/conversational_chat/output_parser.py", line 36, in parse
raise OutputParserException(f"Could not parse LLM output: {text}") from e
langchain.schema.OutputParserException: Could not parse LLM output: ```json
{
"action": "Final Answer",
"action_input": "```python\ndef fibonacci(n):\n if n <= 0:\n return []\n elif n == 1:\n return [0]\n elif n == 2:\n return [0, 1]\n else:\n fib_seq = [0, 1]\n for i in range(2, n):\n fib_seq.append(fib_seq[-1] + fib_seq[-2])\n return fib_seq\n\ndef main():\n fib_numbers = fibonacci(10)\n print(fib_numbers)\n\nif __name__ == '__main__':\n main()\n```\n\nThe algorithm used in this script is an iterative approach to calculate the first 10 Fibonacci numbers. It starts with the base cases of n=1 and n=2, returning [0] and [0, 1] respectively. For n > 2, the script initializes the Fibonacci sequence with the first two numbers (0 and 1) and iterates from the third number to the nth number, appending the sum of the previous two numbers in the sequence. The main() function calls the fibonacci() function with the input 10 and prints the resulting list of Fibonacci numbers."
}
```

schinto · 2023-05-22T12:49:50Z

Sorry, I just saw that the fix is not in release 0.0.176 yet.

rickbraddy-pharma · 2023-05-22T13:35:39Z

Can we please approve the workflow - or is there something else needed to proceed? (this is my first pull request, so happy to adjust as appropriate).

dev2049

thanks - lets adds some tests for things that this can handle so we don't have any regressions in the future

rickbraddy-pharma · 2023-05-22T20:24:45Z

Resolved Black code formatting issue.

If additional testers are needed, please advise specifics (e.g., which one to modify or a good example to copy/modify)

rickbraddy-pharma · 2023-05-22T20:27:52Z

thanks - lets adds some tests for things that this can handle so we don't have any regressions in the future

Happy to do so. Can you point me to existing tester you'd suggest that I update?

dev2049 · 2023-05-22T21:36:22Z

thanks - lets adds some tests for things that this can handle so we don't have any regressions in the future

Happy to do so. Can you point me to existing tester you'd suggest that I update?

don't think any unit tests exist for this class yet, would make a new file at tests/unit_tests/agents/conversational_chat/test_output_parser.py and add there

rickbraddy-pharma · 2023-05-23T15:21:22Z

I have submitted the fix again, this time addressing the "ruff" code format issues.

I have no idea where to start to develop the unit tester...

rickbraddy-pharma · 2023-05-23T18:58:09Z

Figured it out. Will deliver unit tester soon...

…sing

rickbraddy-pharma · 2023-05-23T19:55:00Z

Okay. Unit tester is now available.

rickbraddy-pharma · 2023-05-23T20:00:08Z

How should we handle OPEN API keys?

_ ERROR collecting tests/unit_tests/agents/conversational_chat/test_output_parser.py _
tests/unit_tests/agents/conversational_chat/test_output_parser.py:11: in
llm_chatgpt = ChatOpenAI(temperature=0.3, model_name="gpt-3.5-turbo")
pydantic/main.py:341: in pydantic.main.BaseModel.init
???
E pydantic.error_wrappers.ValidationError: 1 validation error for ChatOpenAI
E root
E Did not find openai_api_key, please add an environment variable OPENAI_API_KEY which contains it, or pass openai_api_key as a named parameter. (type=value_error)

tests/unit_tests/agents/conversational_chat/test_output_parser.py

rickbraddy-pharma · 2023-05-24T16:51:05Z

The unit tester has been rewritten in accordance with the recommendations. Please re-run the workflow.

rickbraddy-pharma · 2023-05-24T19:01:28Z

Not sure why the latest commit is a conflict. The entire file was rewritten to replace the prior commit.

schinto · 2023-05-25T12:57:42Z

Do the tests cover cases with nested triple backticks ?
As shown below:

langchain.schema.OutputParserException: Could not parse LLM output: ```json
{
"action": "Final Answer",
"action_input": "Here is example code\n```python\nprint('Hello World')\n```\n"
}
```

I wonder how the function parse_json_markdown will work in this case?

json_string = json_string.replace("```json", "").replace("```", "")

removes all triple backticks, whereas the backticks in action_input should be preserved as they are part of the markdown.

rickbraddy-pharma · 2023-05-25T13:54:34Z

Yes, it covers the case where code gets returned within the JSON string, starting with triple backticks and followed by type of code... as shown in the working unit tester.

rickbraddy-pharma · 2023-05-25T13:55:05Z

Who should resolve the conflicts that have cropped up?

schinto · 2023-05-25T14:03:42Z

Yes, it covers the case where code gets returned within the JSON string, starting with triple backticks and followed by type of code... as shown in the working unit tester.

Sorry, I could not see this nested case in the unit tester.

rickbraddy-pharma · 2023-05-25T14:56:31Z

It was this nested case, where '''python or '''javascript get returned when code is generated that caused the original issues with the output parser. This is what was fixed and why the unit tester was developed to test the code/no-code return cases.

I actually added logging to the output parser, captured actual returned code and non-code parser examples, then included those in the unit tester, to ensure they are correctly structured.

Please let me know if further refinements are needed to merge.

Thanks
Rick

schinto · 2023-05-30T07:53:30Z

It was this nested case, where '''python or '''javascript get returned when code is generated that caused the original issues with the output parser. This is what was fixed and why the unit tester was developed to test the code/no-code return cases.

I just tested with langchain version 0.0.184 and the parsing of code blocks including triple backticks fails, see error below:

File "/home/user1/scratch/conda/envs/lcpy39/lib/python3.9/site-packages/langchain/agents/conversational_chat/output_parser.py", line 24, in parse
raise OutputParserException(f"Could not parse LLM output: {text}") from e
langchain.schema.OutputParserException: Could not parse LLM output: ```json
{
"action": "Final Answer",
"action_input": "Here's a Python script to remove triple backticks at the start and end of a string, allowing spaces:\n\n```python\ndef remove_triple_backticks(s):\n return s.strip().lstrip('```').rstrip('```')\n\ninput_string = ' ```example text``` '\noutput_string = remove_triple_backticks(input_string)\nprint(output_string)\n```\n\nThis script defines a function `remove_triple_backticks` that takes a string as input, removes any leading and trailing spaces using `strip()`, and then removes triple backticks from the start and end of the string using `lstrip()` and `rstrip()`. The example input string is processed and the result is printed."
}
```

rickbraddy-pharma · 2023-05-30T15:52:43Z

Clearly this needs more rework...

fixed json parsing for multiple generated JSON code segments, plus co…

c0c1182

…mments

rickbraddy-pharma mentioned this pull request May 20, 2023

Chat agent does not parse properly when model returns a code block #2679

Closed

0ptim mentioned this pull request May 21, 2023

Agent runs failing sometimes due to output parser exception 0ptim/JellyChat#46

Closed

dev2049 reviewed May 22, 2023

View reviewed changes

rickbraddy-pharma added 4 commits May 22, 2023 19:49

resolved lint issues and reverified parsing of code is correct

6486e9e

resolved lint issues and reverified parsing of code is correct

e4b604f

Removed the Backups folder

bcffca6

Reformatted code with Black

02cca63

resolved ruff scan issues

272b87f

rickbraddy-pharma added 2 commits May 23, 2023 19:05

new unit tester for conversational_chat output_processor for code par…

c085fe7

…sing

new unit tester for conversational_chat parser for code

f316f77

eyurtsev reviewed May 23, 2023

View reviewed changes

tests/unit_tests/agents/conversational_chat/test_output_parser.py Outdated Show resolved Hide resolved

eyurtsev reviewed May 23, 2023

View reviewed changes

tests/unit_tests/agents/conversational_chat/test_output_parser.py Outdated Show resolved Hide resolved

rewrote tester for conversational chat output parser

7aac7a1

rickbraddy-pharma closed this May 30, 2023

rickbraddy-pharma deleted the fix-conversational-chat-json-parser branch May 30, 2023 15:52

dosubot bot mentioned this pull request Jul 27, 2023

LLM output parsing error for answers including code sections. #8357

Closed

14 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fixed json parsing for multiple generated JSON code segments, plus co… #5037

fixed json parsing for multiple generated JSON code segments, plus co… #5037

rickbraddy-pharma commented May 20, 2023

Kav-K commented May 21, 2023

0ptim commented May 21, 2023

schinto commented May 22, 2023

schinto commented May 22, 2023 •

edited

Loading

rickbraddy-pharma commented May 22, 2023

dev2049 left a comment

rickbraddy-pharma commented May 22, 2023

rickbraddy-pharma commented May 22, 2023

dev2049 commented May 22, 2023

rickbraddy-pharma commented May 23, 2023

rickbraddy-pharma commented May 23, 2023

rickbraddy-pharma commented May 23, 2023

rickbraddy-pharma commented May 23, 2023

rickbraddy-pharma commented May 24, 2023

rickbraddy-pharma commented May 24, 2023

schinto commented May 25, 2023 •

edited

Loading

rickbraddy-pharma commented May 25, 2023

rickbraddy-pharma commented May 25, 2023

schinto commented May 25, 2023

rickbraddy-pharma commented May 25, 2023

schinto commented May 30, 2023

rickbraddy-pharma commented May 30, 2023

fixed json parsing for multiple generated JSON code segments, plus co… #5037

fixed json parsing for multiple generated JSON code segments, plus co… #5037

Conversation

rickbraddy-pharma commented May 20, 2023

Fixes showstopper issue with parsing LLM-generated code that returns one or multiple code segments in JSON format

Before submitting

Who can review?

Kav-K commented May 21, 2023

0ptim commented May 21, 2023

schinto commented May 22, 2023

schinto commented May 22, 2023 • edited Loading

rickbraddy-pharma commented May 22, 2023

dev2049 left a comment

Choose a reason for hiding this comment

rickbraddy-pharma commented May 22, 2023

rickbraddy-pharma commented May 22, 2023

dev2049 commented May 22, 2023

rickbraddy-pharma commented May 23, 2023

rickbraddy-pharma commented May 23, 2023

rickbraddy-pharma commented May 23, 2023

rickbraddy-pharma commented May 23, 2023

rickbraddy-pharma commented May 24, 2023

rickbraddy-pharma commented May 24, 2023

schinto commented May 25, 2023 • edited Loading

rickbraddy-pharma commented May 25, 2023

rickbraddy-pharma commented May 25, 2023

schinto commented May 25, 2023

rickbraddy-pharma commented May 25, 2023

schinto commented May 30, 2023

rickbraddy-pharma commented May 30, 2023

schinto commented May 22, 2023 •

edited

Loading

schinto commented May 25, 2023 •

edited

Loading