Expanded speaker name matching during speaker selection #2222

marklysze · 2024-03-31T12:16:33Z

Why are these changes needed?

During the speaker selection process, the LLM returns the name of the next speaker. This is fairly reliable with OpenAI's models but with open-source/weight models they struggle with returning just the single agent name and sticking to the format of the name.

In particular two issues are:

If the name has underscores some models, particularly Mistral AI's Mistral/Mixtral models, will escape them, e.g. 'Software_Developer' will be returned as 'Software\_Developer'.
If the name has underscores some models will replace the underscores with spaces, e.g. 'Software_Developer' will be returned as 'Software Developer'.

In the proposed changes: GroupChat's_mentioned_agents function, agent names will now match under any of the following conditions (all continue to be case-sensitive):

[Unchanged] Exact name match
[New] If the agent name has underscores it will match with spaces instead (e.g. 'Story_writer' == 'Story writer')
[New] If the agent name has underscores it will match with \_ (note: single backslash) instead of _ (e.g. 'Story_writer' == 'Story\_writer')

I've added a test function to test_groupchat.py.

If we proceed with this, I'd like to add to the tips for non-OpenAI models documentation as part of this PR.

Please see #1746 which outlines a number of issues with open-source/weight models and this is a part of addressing them. I don't believe this will affect the speaker selection process when using OpenAI's models.

My plan is to continue to create PRs to address these shortcomings and then close off #1746. If it is preferred that I consolidate them into one PR, please let me know.

Related issue number

Based on shortcomings identified in #1746.

Checks

I've included any doc changes needed for https://microsoft.github.io/autogen/. See https://microsoft.github.io/autogen/docs/Contribute#documentation to build and test documentation locally.
I've added tests (if relevant) corresponding to the changes introduced in this PR.
I've made sure all auto checks have passed.

…s and escaping underscores

…o function

codecov-commenter · 2024-03-31T12:18:01Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 49.71%. Comparing base (989c182) to head (ce6505e).

Additional details and impacted files

@@             Coverage Diff             @@
##             main    #2222       +/-   ##
===========================================
+ Coverage   37.83%   49.71%   +11.88%     
===========================================
  Files          77       77               
  Lines        7766     7766               
  Branches     1663     1800      +137     
===========================================
+ Hits         2938     3861      +923     
+ Misses       4579     3583      -996     
- Partials      249      322       +73

Flag	Coverage Δ
unittest	`14.35% <ø> (?)`
unittests	`48.67% <ø> (+10.85%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

afourney

LGTM

marklysze · 2024-04-02T01:41:50Z

Let me know if you want me to create a new PR with a branch in the AutoGen repository for testing, rather than my own.

sonichi · 2024-04-03T12:45:34Z

Let me know if you want me to create a new PR with a branch in the AutoGen repository for testing, rather than my own.

Yes, please do that and resolve the conflict.

marklysze · 2024-04-03T19:45:11Z

Let me know if you want me to create a new PR with a branch in the AutoGen repository for testing, rather than my own.

Yes, please do that and resolve the conflict.

No problem, all done and this can be closed.

New PR: #2267

marklysze and others added 3 commits March 31, 2024 10:00

Updated agent name matching in speaker selection to accommodate space…

f09658a

…s and escaping underscores

Merge branch 'microsoft:main' into select_speaker_name_handling

19ed4a7

Added testing of the name matching function and further description t…

b29a2ed

…o function

Merge branch 'microsoft:main' into select_speaker_name_handling

ce6505e

sonichi requested review from ekzhu, afourney, LittleLittleCloud and yiranwu0 April 1, 2024 04:44

sonichi added group chat group-chat-related issues alt-models Pertains to using alternate, non-GPT, models (e.g., local models, llama, etc.) labels Apr 1, 2024

sonichi requested a review from olgavrou April 1, 2024 04:45

afourney approved these changes Apr 1, 2024

View reviewed changes

olgavrou approved these changes Apr 2, 2024

View reviewed changes

marklysze mentioned this pull request Apr 3, 2024

Re-commit of #2222: Expanded speaker name matching during speaker selection #2267

Merged

3 tasks

ekzhu closed this Apr 4, 2024

marklysze had a problem deploying to openai1 April 30, 2024 20:48 — with GitHub Actions Failure

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Expanded speaker name matching during speaker selection #2222

Expanded speaker name matching during speaker selection #2222

marklysze commented Mar 31, 2024 •

edited

Loading

codecov-commenter commented Mar 31, 2024 •

edited

Loading

afourney left a comment

marklysze commented Apr 2, 2024

sonichi commented Apr 3, 2024

marklysze commented Apr 3, 2024 •

edited

Loading

Expanded speaker name matching during speaker selection #2222

Expanded speaker name matching during speaker selection #2222

Conversation

marklysze commented Mar 31, 2024 • edited Loading

Why are these changes needed?

Related issue number

Checks

codecov-commenter commented Mar 31, 2024 • edited Loading

Codecov Report

afourney left a comment

Choose a reason for hiding this comment

marklysze commented Apr 2, 2024

sonichi commented Apr 3, 2024

marklysze commented Apr 3, 2024 • edited Loading

marklysze commented Mar 31, 2024 •

edited

Loading

codecov-commenter commented Mar 31, 2024 •

edited

Loading

marklysze commented Apr 3, 2024 •

edited

Loading