Python: Filter only json content from basic planner execution input #1496

awharrison-28 · 2023-06-14T23:53:30Z

Motivation and Context

There are scenarios where the basic_planner prompt results in a valid plan surrounded by additional text. This renders the plan unable to be json deserialized. This PR addresses this scenario. Note that it does NOT address malformed json.

Description

added a regex in basic_planner.execute_plan_async that grabs only json blocks. See screenshot below of it ignoring additional, non-json text.
Added dependency on regex pip package (re package does not support regex recursion)

Contribution Checklist

The code builds clean without any errors or warnings
The PR follows SK Contribution Guidelines (https://github.com/microsoft/semantic-kernel/blob/main/CONTRIBUTING.md)
The code follows the .NET coding conventions (https://learn.microsoft.com/dotnet/csharp/fundamentals/coding-style/coding-conventions) verified with dotnet format
All unit tests pass, and I have added new tests where possible
I didn't break anyone 😄

…ble text is ignored

dluc · 2023-06-15T00:35:12Z

python/semantic_kernel/planning/basic_planner.py

-        generated_plan = json.loads(plan.generated_plan.result)
+
+        # Filter out good JSON from the result in case additional text is present
+        json_regex = r"\{(?:[^{}]|(?R))*\}"


couple of notes:

is the regex correct? I'm getting parse errors (using online tools), e.g. at ?R (the code works with python though)

could you add a comment explaining the regex?

maybe add unit tests?

is the regex safe (redos)?

Here's a breakdown of the regex pattern {(?:[^{}]|(?R))*}:

\{: This part of the pattern matches the opening curly brace {.

(?:: This part is the beginning of a non-capturing group. Non-capturing groups allow you to apply quantifiers to part of your regex, without capturing the matched text.

[^{}]: This part of the pattern is a character class that matches any character except the curly braces { and }.

|: This is the alternation (OR) operator, which means the pattern can match either the left side (any character except curly braces) or the right side (the next part of the pattern).

(?R): This part of the pattern is a regex recursion. It refers to the entire regular expression pattern, allowing the pattern to be applied recursively. This is useful for matching nested structures like JSON.

)*: This part of the pattern is a quantifier that means "match zero or more of the preceding pattern." In this case, the preceding pattern is the non-capturing group that contains the character class and the recursion.

\}: This part of the pattern matches the closing curly brace }.

Overall, the regex pattern can be described as follows:
Match an opening curly brace.
Match any number of characters that are not curly braces or recursively apply the entire regex pattern (to match nested JSON objects).
Match a closing curly brace.

…icrosoft#1496) ### Motivation and Context Closes microsoft#1469 There are scenarios where the basic_planner prompt results in a valid plan surrounded by additional text. This renders the plan unable to be json deserialized. This PR addresses this scenario. Note that it does NOT address malformed json. ### Description - added a regex in basic_planner.execute_plan_async that grabs only json blocks. See screenshot below of it ignoring additional, non-json text. - Added dependency on `regex` pip package (`re` package does not support regex recursion) Co-authored-by: Shawn Callegari <36091529+shawncal@users.noreply.github.com>

### Motivation and Context  This PR is a first draft for implementation of ActionPlanner in Python. I try to follow the C# implementation as much as possible. Fixes: #1917 ### Description  - I have implemented ActionPlanner in the `python/semantic_kernel/planning/action_planner` directory. The implementation is in a file called `action_planner.py` and the prompt used by the planner (a replica of the prompt used in C#) is found in `skprompt.txt` in the same directory. - The functions `good_examples`, `edge_case_examples` and `list_of_functions` are used by the prompt to populate examples and list of available functions in the kernel. - After creating the plan, I parse the JSON generated by the planner by using regex matching to extract out pure JSON as implemented in #1496 - Based on whether or not the planner was able to identify a function, I create a Plan instance and return it. ### Here is some example code for using the planner ```python import logging import semantic_kernel as sk from semantic_kernel.planning import ActionPlanner from semantic_kernel.connectors.ai.open_ai import ( OpenAIChatCompletion, OpenAITextCompletion ) from semantic_kernel.core_skills import ( MathSkill, FileIOSkill, TimeSkill, TextSkill ) async def main(): kernel = sk.Kernel() api_key, org_id = sk.openai_settings_from_dot_env() kernel.add_text_completion_service("dv", OpenAITextCompletion("text-davinci-003", api_key, org_id)) kernel.import_skill(MathSkill(), "math") kernel.import_skill(FileIOSkill(), "fileIO") kernel.import_skill(TimeSkill(), "time") kernel.import_skill(TextSkill(), "text") planner = ActionPlanner(kernel) ask = "What is the sum of 110 and 990?" plan = await planner.create_plan_async(goal=ask) result = await plan.invoke_async() print(result.result) if __name__ == "__main__": import asyncio asyncio.run(main()) ``` ### Queries - In the C# implementation, when populating the function parameters in the plan object, the `plan.parameters` variable is updated with the function parameters. When I did the same in python, these parameters weren't being passed onto the function in `invoke_async` method of the plan instance. The parameters are only being passed when I update them in the `plan.state` variable. This code is in `line 121` of `action_planner.py`. What is the right way to do this? - Some skills / plugins require one of the inputs to be passed as the `input` variable and not in the context. How should this be handled in the planner? ![image](https://github.com/microsoft/semantic-kernel/assets/69807105/967ea4b7-914e-46ac-89fb-1eb81c4ea9b0) **OR** ![image](https://github.com/microsoft/semantic-kernel/assets/69807105/13a21556-8028-4f54-b40a-e4ebd1c5e654) ### Issues that I faced - The `text-davinci-002` text-completion model regularly output incorrectly formatted JSON with commas missing. For example ```json {""plan"":{ ""rationale"": ""The list contains a function that allows for writing to files, so the plan is to use that function to write the desired content to the file 'output.txt'."", ""function"": ""fileIO.writeAsync"", ""parameters"": { ""input"": null, ""content"": ""A brief history of computer science"" ""path"": ""output.txt"" }}} ``` Pending Items - [x] Create unit tests - [x] Send the function parameters to the planner in list of available functions with/without input parameter. ### Contribution Checklist  - [x] The code builds clean without any errors or warnings - [x] The PR follows SK Contribution Guidelines (https://github.com/microsoft/semantic-kernel/blob/main/CONTRIBUTING.md) - [x] The code follows the .NET coding conventions (https://learn.microsoft.com/dotnet/csharp/fundamentals/coding-style/coding-conventions) verified with `dotnet format` - [x] All unit tests pass, and I have added new tests where possible - [x] I didn't break anyone 😄 --------- Co-authored-by: Abby Harrison <abby.harrison@microsoft.com> Co-authored-by: Abby Harrison <54643756+awharrison-28@users.noreply.github.com> Co-authored-by: Shawn Callegari <36091529+shawncal@users.noreply.github.com>

### Motivation and Context  This PR is a first draft for implementation of ActionPlanner in Python. I try to follow the C# implementation as much as possible. Fixes: microsoft#1917 ### Description  - I have implemented ActionPlanner in the `python/semantic_kernel/planning/action_planner` directory. The implementation is in a file called `action_planner.py` and the prompt used by the planner (a replica of the prompt used in C#) is found in `skprompt.txt` in the same directory. - The functions `good_examples`, `edge_case_examples` and `list_of_functions` are used by the prompt to populate examples and list of available functions in the kernel. - After creating the plan, I parse the JSON generated by the planner by using regex matching to extract out pure JSON as implemented in microsoft#1496 - Based on whether or not the planner was able to identify a function, I create a Plan instance and return it. ### Here is some example code for using the planner ```python import logging import semantic_kernel as sk from semantic_kernel.planning import ActionPlanner from semantic_kernel.connectors.ai.open_ai import ( OpenAIChatCompletion, OpenAITextCompletion ) from semantic_kernel.core_skills import ( MathSkill, FileIOSkill, TimeSkill, TextSkill ) async def main(): kernel = sk.Kernel() api_key, org_id = sk.openai_settings_from_dot_env() kernel.add_text_completion_service("dv", OpenAITextCompletion("text-davinci-003", api_key, org_id)) kernel.import_skill(MathSkill(), "math") kernel.import_skill(FileIOSkill(), "fileIO") kernel.import_skill(TimeSkill(), "time") kernel.import_skill(TextSkill(), "text") planner = ActionPlanner(kernel) ask = "What is the sum of 110 and 990?" plan = await planner.create_plan_async(goal=ask) result = await plan.invoke_async() print(result.result) if __name__ == "__main__": import asyncio asyncio.run(main()) ``` ### Queries - In the C# implementation, when populating the function parameters in the plan object, the `plan.parameters` variable is updated with the function parameters. When I did the same in python, these parameters weren't being passed onto the function in `invoke_async` method of the plan instance. The parameters are only being passed when I update them in the `plan.state` variable. This code is in `line 121` of `action_planner.py`. What is the right way to do this? - Some skills / plugins require one of the inputs to be passed as the `input` variable and not in the context. How should this be handled in the planner? ![image](https://github.com/microsoft/semantic-kernel/assets/69807105/967ea4b7-914e-46ac-89fb-1eb81c4ea9b0) **OR** ![image](https://github.com/microsoft/semantic-kernel/assets/69807105/13a21556-8028-4f54-b40a-e4ebd1c5e654) ### Issues that I faced - The `text-davinci-002` text-completion model regularly output incorrectly formatted JSON with commas missing. For example ```json {""plan"":{ ""rationale"": ""The list contains a function that allows for writing to files, so the plan is to use that function to write the desired content to the file 'output.txt'."", ""function"": ""fileIO.writeAsync"", ""parameters"": { ""input"": null, ""content"": ""A brief history of computer science"" ""path"": ""output.txt"" }}} ``` Pending Items - [x] Create unit tests - [x] Send the function parameters to the planner in list of available functions with/without input parameter. ### Contribution Checklist  - [x] The code builds clean without any errors or warnings - [x] The PR follows SK Contribution Guidelines (https://github.com/microsoft/semantic-kernel/blob/main/CONTRIBUTING.md) - [x] The code follows the .NET coding conventions (https://learn.microsoft.com/dotnet/csharp/fundamentals/coding-style/coding-conventions) verified with `dotnet format` - [x] All unit tests pass, and I have added new tests where possible - [x] I didn't break anyone 😄 --------- Co-authored-by: Abby Harrison <abby.harrison@microsoft.com> Co-authored-by: Abby Harrison <54643756+awharrison-28@users.noreply.github.com> Co-authored-by: Shawn Callegari <36091529+shawncal@users.noreply.github.com>

added a regex to execute_plan to ensure that any non-json deserializa…

f8a21e6

…ble text is ignored

awharrison-28 added the bug Something isn't working label Jun 14, 2023

awharrison-28 requested review from mkarle and alexchaomander June 14, 2023 23:53

github-actions bot added the python Pull requests for the Python Semantic Kernel label Jun 14, 2023

merge conflicts with main

1505aad

shawncal approved these changes Jun 15, 2023

View reviewed changes

Merge branch 'main' into python/address_non_deserializable_plans

a4564e5

shawncal merged commit 7b3dc2c into microsoft:main Jun 15, 2023
26 checks passed

dluc reviewed Jun 15, 2023

View reviewed changes

lemillermicrosoft mentioned this pull request Jun 27, 2023

ActionPlanner should be more forgiving when parsing out plan results. #1726

Closed

evchaki added this to the Sprint 33 milestone Jun 30, 2023

Sailesh-2209 mentioned this pull request Jul 11, 2023

Python: Action Planner implementation #1948

Merged

7 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Python: Filter only json content from basic planner execution input #1496

Python: Filter only json content from basic planner execution input #1496

awharrison-28 commented Jun 14, 2023

dluc Jun 15, 2023 •

edited

Loading

awharrison-28 Jun 15, 2023

Python: Filter only json content from basic planner execution input #1496

Python: Filter only json content from basic planner execution input #1496

Conversation

awharrison-28 commented Jun 14, 2023

Motivation and Context

Description

Contribution Checklist

dluc Jun 15, 2023 • edited Loading

Choose a reason for hiding this comment

awharrison-28 Jun 15, 2023

Choose a reason for hiding this comment

dluc Jun 15, 2023 •

edited

Loading