Improve xfails by allowing specifying expected errors | test(torchlib) #799

justinchuby · 2023-06-20T00:06:54Z

Allow specifying expected errors in xfails
Raise RuntimeError instead of AssertionError when ORT raises an error in test for more accurate xfails.
Specify the expected errors in some xfails (as a first pass)

Fixes #794

BowenBao · 2023-06-20T00:13:03Z

Thanks @justinchuby this is great! Can we guard on the error message too? Exception type feels a bit broad, for example I may want to flag change in behavior when ORT starts to complain about mismatching type instead of type not implemented.

justinchuby · 2023-06-20T02:01:28Z

The error message is harder because there isn't a pytest decorator for it. I could implement one but that will take longer. Do you have any suggestions? Thanks!

BowenBao · 2023-06-20T16:52:54Z

onnxscript/tests/function_libs/torch_lib/ops_test_common.py

    return DecorateMeta(
        op_name=op_name,
        variant_name=variant_name,
-        decorator=unittest.expectedFailure,


For my own understanding, why do we seem to decorate xfail at two places? One here, and the other down at

def normal_xfail_skip_test_behaviors( ... if test_behavior == "xfail": pytest.xfail(reason=reason)

There is the decorator that will decorate the test method, and there are context managers that handle the subtests. I haven't found a good way to unify the two. I may be possible for us to create our own decorators but I am not sure how it is supported by pytest?

It's not supported..

BowenBao · 2023-06-20T16:53:40Z

onnxscript/tests/function_libs/torch_lib/ops_test_common.py

+    if raises is None:
+        decorator = pytest.mark.xfail(reason=reason)
+    else:
+        decorator = pytest.mark.xfail(reason=reason, raises=raises)


Need to set strict = True?

https://docs.pytest.org/en/7.3.x/reference/reference.html#pytest-mark-xfail-ref

When strict=True any passing subtests will fail the whole test method. We have cases where some subtests fail and we may still want to run the test. Any suggestions?

BowenBao · 2023-06-20T17:01:29Z

onnxscript/tests/function_libs/torch_lib/ops_test_common.py

        if test_behavior == "xfail":
-            pytest.xfail(reason=reason)
+            if raises is not None:
+                with pytest.raises(raises):


Passing in the unittest instance, can we do

if test_behavior == "xfail": if raises is not None: with test_case.assertRaisesRegex(raises, expected_error_message_regex): raise pytest.xfail(reason=reason)

nvm, I think we need a context manager similar to this but wrapped over subtest calling.

edit: wait.. this function is already that context manager, so it should work?

It should work for any subtests, but I am not sure how to support the test methods?

titaiwangms · 2023-06-23T15:01:22Z

The error message is harder because there isn't a pytest decorator for it. I could implement one but that will take longer. Do you have any suggestions? Thanks!

A naive way would be we do string match on reason and exception message in normal_xfail_skip_test_behaviors, and it would raise xfail if only they match at some points.

I guess it doesn't have to be strict. Just point out the error occurred location and type.

justinchuby · 2023-06-23T15:04:10Z

The error message is harder because there isn't a pytest decorator for it. I could implement one but that will take longer. Do you have any suggestions? Thanks!

A naive way would be we do string match on reason and exception message in normal_xfail_skip_test_behaviors, and it would raise xfail if only they match at some points.

I guess it doesn't have to be strict. Just point out the error occurred location and type.

That's for the subtest I think? I am thinking if there's a way to use the decorators

titaiwangms · 2023-06-23T15:13:14Z

The error message is harder because there isn't a pytest decorator for it. I could implement one but that will take longer. Do you have any suggestions? Thanks!

A naive way would be we do string match on reason and exception message in normal_xfail_skip_test_behaviors, and it would raise xfail if only they match at some points.
I guess it doesn't have to be strict. Just point out the error occurred location and type.

That's for the subtest I think? I am thinking if there's a way to use the decorators

Do we have more cases needing xfail on whole test? The other dtypes?

justinchuby · 2023-06-23T17:32:27Z

Yes. Sometimes ORT fails for the op so we xfail the whole test

justinchuby added 3 commits June 19, 2023 23:16

Better xfails

e127a5b

Add expected error field

8978874

Use in tests

a1f7dfc

justinchuby requested review from titaiwangms and BowenBao June 20, 2023 00:06

justinchuby changed the title ~~Improve xfails by allowing specifying expected errors~~ Improve xfails by allowing specifying expected errors | test(torchlib) Jun 20, 2023

justinchuby added topic: torch_lib Related to the torch/aten function lib in development topic: testing labels Jun 20, 2023

justinchuby mentioned this pull request Jun 20, 2023

[torchlib] Improve xfail decorator to be more robust and accurate #794

Open

justinchuby added 3 commits June 20, 2023 00:13

Docs

b8f2ab0

Combine

5ed5676

Abort

9575ecf

BowenBao reviewed Jun 20, 2023

View reviewed changes

justinchuby added 2 commits June 22, 2023 17:12

Merge branch 'main' into justinchu/better-xfails

285590a

Raises

473d1fc

Merge branch 'main' into justinchu/better-xfails

b13141e

justinchuby marked this pull request as draft August 9, 2023 20:37

justinchuby closed this Jun 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve xfails by allowing specifying expected errors | test(torchlib) #799

Improve xfails by allowing specifying expected errors | test(torchlib) #799

justinchuby commented Jun 20, 2023 •

edited

Loading

BowenBao commented Jun 20, 2023

justinchuby commented Jun 20, 2023 •

edited

Loading

BowenBao Jun 20, 2023

justinchuby Jun 20, 2023

titaiwangms Jun 23, 2023

BowenBao Jun 20, 2023

justinchuby Jun 20, 2023

BowenBao Jun 20, 2023

BowenBao Jun 20, 2023 •

edited

Loading

justinchuby Jun 20, 2023

titaiwangms commented Jun 23, 2023 •

edited

Loading

justinchuby commented Jun 23, 2023

titaiwangms commented Jun 23, 2023

justinchuby commented Jun 23, 2023

Improve xfails by allowing specifying expected errors | test(torchlib) #799

Improve xfails by allowing specifying expected errors | test(torchlib) #799

Conversation

justinchuby commented Jun 20, 2023 • edited Loading

BowenBao commented Jun 20, 2023

justinchuby commented Jun 20, 2023 • edited Loading

BowenBao Jun 20, 2023

Choose a reason for hiding this comment

justinchuby Jun 20, 2023

Choose a reason for hiding this comment

titaiwangms Jun 23, 2023

Choose a reason for hiding this comment

BowenBao Jun 20, 2023

Choose a reason for hiding this comment

justinchuby Jun 20, 2023

Choose a reason for hiding this comment

BowenBao Jun 20, 2023

Choose a reason for hiding this comment

BowenBao Jun 20, 2023 • edited Loading

Choose a reason for hiding this comment

justinchuby Jun 20, 2023

Choose a reason for hiding this comment

titaiwangms commented Jun 23, 2023 • edited Loading

justinchuby commented Jun 23, 2023

titaiwangms commented Jun 23, 2023

justinchuby commented Jun 23, 2023

justinchuby commented Jun 20, 2023 •

edited

Loading

justinchuby commented Jun 20, 2023 •

edited

Loading

BowenBao Jun 20, 2023 •

edited

Loading

titaiwangms commented Jun 23, 2023 •

edited

Loading