new tests #38571

MilesHolland · 2024-11-15T18:42:02Z

Initial PR to add 3 (5 with parameterization) tests to account for ALL e2e runs of evaluators. Current disables 3 evaluators due to consistent problems with playback tests.

Next steps will be to disable tests made redundant by these, and to re-enabled the 3 currently un-touched evaluators.

Also contains an unrelated nit docstring fix.

azure-sdk · 2024-11-15T19:12:56Z

API change check

API changes are not detected in this pull request.

w-javed · 2024-12-02T21:01:22Z

sdk/evaluation/azure-ai-evaluation/tests/e2etests/test_mass_evaluate.py

+    def test_evaluate_multimodal(self, multi_modal_input_type, multimodal_input_selector, azure_cred, project_scope):
+        # Content safety is removed due to being unstable in playback mode
+        evaluators = {
+            # "content_safety" : ContentSafetyMultimodalEvaluator(credential=azure_cred, azure_ai_project=project_scope),


What happens when you enable it. Does it fail the test or skip it?
do you have any theory or know root cause, why It must be happening?

* new tests * remove gleu * run black * skip * test ci: restore multimodal test * test ci: restore conversation test * change skip type * remove skips because they don't work * test only convo in CI * only skip convo test * just run multi * 2 tests * fix param placement * skip multi * just singlton * remove cs from first test * remove rai evals * remove prompty evals * all but 1 eval * update recordings * 2 evals * just cs * 2 rai service * prompty only * disable pf proxy * delay * more fixture tweaks * more os setting * skip windows in CI * re-enabled all tests * remove env setting * re-reacord * disable 1 test

new tests

93818ad

MilesHolland requested a review from a team as a code owner November 15, 2024 18:42

github-actions bot added the Evaluation Issues related to the client library for Azure AI Evaluation label Nov 15, 2024

MilesHolland added 26 commits November 15, 2024 14:22

remove gleu

09ca1bf

run black

a4d8a87

skip

f1aefa9

test ci: restore multimodal test

10dd5be

test ci: restore conversation test

e04ec36

change skip type

9dcbc3c

remove skips because they don't work

c195f88

test only convo in CI

85ba13b

only skip convo test

4a75aad

just run multi

33cfa7d

2 tests

a4b7c5a

fix param placement

eec0914

skip multi

2c6db6c

just singlton

772fe9e

remove cs from first test

7a46b5b

remove rai evals

4e5828f

remove prompty evals

d560c2d

all but 1 eval

9c1934f

Merge branch 'main' into eval/testing/grouped-eval-testing

253adcf

update recordings

358a1b4

2 evals

18476fc

just cs

a8ae272

2 rai service

8b6b782

prompty only

3772ccd

disable pf proxy

ad58f38

delay

a330920

MilesHolland added 6 commits November 22, 2024 10:31

more fixture tweaks

d6c88b5

more os setting

6f5f277

skip windows in CI

7a683c8

re-enabled all tests

5ba8fef

remove env setting

e1511cd

re-reacord

c769134

ninghu approved these changes Nov 22, 2024

View reviewed changes

kdestin approved these changes Nov 25, 2024

View reviewed changes

disable 1 test

e5ea0d6

MilesHolland merged commit 0a9b02d into main Dec 2, 2024
20 checks passed

MilesHolland deleted the eval/testing/grouped-eval-testing branch December 2, 2024 17:58

w-javed reviewed Dec 2, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

new tests #38571

new tests #38571

Uh oh!

MilesHolland commented Nov 15, 2024 •

edited

Loading

Uh oh!

azure-sdk commented Nov 15, 2024

Uh oh!

Uh oh!

w-javed Dec 2, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

new tests #38571

new tests #38571

Uh oh!

Conversation

MilesHolland commented Nov 15, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

azure-sdk commented Nov 15, 2024

Uh oh!

Uh oh!

w-javed Dec 2, 2024

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

MilesHolland commented Nov 15, 2024 •

edited

Loading