Allow VirtualRecords to have multiple calls to the same component. #988

piotrm0 · 2024-03-11T21:27:02Z

Allow VirtualRecords to have multiple calls to the same component.
Added updates to the virtual_example with how that is done:

# The same method selector can indicate multiple invocations by mapping to a
# list of Dicts instead of a single Dict:

rec2 = VirtualRecord(
    main_input="Where is Germany?",
    main_output="Poland is in Europe",
    calls=
        {
            context_method: 
                [dict(
                    args=["Where is Germany?"],
                    rets=["Poland is a country located in Europe."]
                ), dict(
                    args=["Where is Germany?"],
                    rets=["Germany is a country located in Europe."]
                )
            ] 
        }
    )

Followed by feedback function variants for this:

# Select context to be used in feedback. We select the return values of the
# virtual `get_context` call in the virtual `retriever` component. Names are
# arbitrary except for `rets`.  If there are multiple calls to this method
# recorded, the first one is used by default though a warning will be issued.
context = context_method.rets[:]
# Same as context = context_method[0].rets[:]

# Alternatively, all of the contexts can be retrieved for use in feedback.
context_all_calls = context_method[:].rets[:]

Added combinations field to Feedback and argument to Feedback.aggregate to specify how to build argument dictionaries for feedback functions if selectors generate more than one thing. The default and existing mode is "product" but also added "zip" as an option as specified here:

class FeedbackCombinations(str, Enum):
    """How to collect arguments for feedback function calls.
    
    Note that this applies only to cases where selectors pick out more than one
    thing for feedback function arguments. This option is used for the field
    `combinations` of
    [FeedbackDefinition][trulens_eval.schema.FeedbackDefinition] and can be
    specified with
    [Feedback.aggregate][trulens_eval.feedback.feedback.Feedback.aggregate].
    """

    ZIP = "zip"
    """Match argument values per position in produced values. 
    
    Example:
        If the selector for `arg1` generates values `0, 1, 2` and one for `arg2`
        generates values `"a", "b", "c"`, the feedback function will be called 3
        times with kwargs:

        - `{'arg1': 0, arg2: "a"}`,
        - `{'arg1': 1, arg2: "b"}`, 
        - `{'arg1': 2, arg2: "c"}`

    If the quantities of items in the various generators do not match, the
    result will have only as many combinations as the generator with the
    fewest items as per python [zip][zip] (strict mode is not used).

    Note that selectors can use
    [Lens][trulens_eval.utils.serial.Lens] `collect()` to name a single (list)
    value instead of multiple values.
    """

    PRODUCT = "product"
    """Evaluate feedback on all combinations of feedback function arguments.

    Example:
        If the selector for `arg1` generates values `0, 1` and the one for
        `arg2` generates values `"a", "b"`, the feedback function will be called
        4 times with kwargs:

        - `{'arg1': 0, arg2: "a"}`,
        - `{'arg1': 0, arg2: "b"}`,
        - `{'arg1': 1, arg2: "a"}`,
        - `{'arg1': 1, arg2: "b"}`

    See [itertools.product][itertools.product] for more.

    Note that selectors can use
    [Lens][trulens_eval.utils.serial.Lens] `collect()` to name a single (list)
    value instead of multiple values.
    """

Added FeedbackStatus.SKIPPED to indicate that an eval was skipped and should not be ran again. Fixed runner to take this into account.
Fixed OpenAI provider to take in rpm/pace and use it for controlling rate of endpoint invocations.

review-notebook-app · 2024-03-11T21:27:08Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

joshreini1 · 2024-03-11T21:59:48Z

I think we're going for something slightly different in that the repeated call may not be directly in sequence:

rec2 = VirtualRecord(
    main_input="Where is Germany?",
    main_output="Poland is in Europe",
    calls=
        {
            context_method: 
                dict(
                    args=["Where is Germany?"],
                    rets=["Poland is a country located in Europe."]
                ),
            some_other_method: 
                dict(
                    args=["Where is Germany?"],
                    rets=["Poland is a country located in Europe."]
                ),
            context_method: 
                dict(
                    args=["Where is Germany?"],
                    rets=["Germany is a country located in Europe."]
                ),
        }
    )

…le_calls_to_same_path

…to ifexists

…iple_calls_to_same_path' into piotrm/virtual_mutliple_calls_to_same_path

example and updates to virtual records

a87f1c8

dosubot bot added the size:L This PR changes 100-499 lines, ignoring generated files. label Mar 11, 2024

dosubot bot added the documentation Improvements or additions to documentation label Mar 11, 2024

adjust note

2c3371e

piotrm0 requested a review from joshreini1 March 11, 2024 21:28

joshreini1 and others added 19 commits March 11, 2024 18:01

Merge branch 'main' into piotrm/virtual_mutliple_calls_to_same_path

98ef54a

Merge remote-tracking branch 'origin/main' into piotrm/virtual_mutlip…

9d10807

…le_calls_to_same_path

add combinations option to Feedback.aggregate

c2ca585

disabled selector prechecks on tru virtual

d7fe1c4

nit

70920f5

add env listing to eval pipeline after testing packages are installed

7151c9a

use attr instead of str to check combination parameters

3af84d5

remove unneeded pass

94009f0

add verbose to pip list

c2cef34

add verbose to pip installs in pipeline

09173ca

remove alias use in main __init__

2ba9f23

one more

49b39a8

rename schema in import

d3ce5db

adjust llama index version test

1c624cf

pydantic issue debugging

99e4533

add notes about weird import bug

2781da6

Merge branch 'main' into piotrm/virtual_mutliple_calls_to_same_path

d051d63

added skipped feedback status and used to mark feedbacks skipped due …

7d4a8a3

…to ifexists

Merge remote-tracking branch 'refs/remotes/origin/piotrm/virtual_mutl…

9d39b88

…iple_calls_to_same_path' into piotrm/virtual_mutliple_calls_to_same_path

joshreini1 approved these changes Mar 12, 2024

View reviewed changes

dosubot bot added the lgtm This PR has been approved by a maintainer label Mar 12, 2024

piotrm0 merged commit 83223c4 into main Mar 12, 2024
8 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow VirtualRecords to have multiple calls to the same component. #988

Allow VirtualRecords to have multiple calls to the same component. #988

piotrm0 commented Mar 11, 2024 •

edited

review-notebook-app bot commented Mar 11, 2024

joshreini1 commented Mar 11, 2024

Allow VirtualRecords to have multiple calls to the same component. #988

Allow VirtualRecords to have multiple calls to the same component. #988

Conversation

piotrm0 commented Mar 11, 2024 • edited

review-notebook-app bot commented Mar 11, 2024

joshreini1 commented Mar 11, 2024

piotrm0 commented Mar 11, 2024 •

edited