Improved (in)equality testing with non-boolean results #5533

nickovs · 2020-01-13T09:53:21Z

This is a version of #5479 which makes use of a new flag in the flags word of mp_obj_type_t to indicate if various shortcuts can not be used when performing equality and inequality tests, as per the comment in the discussion of that PR.

The semantics of the new TYPE_FLAG_NO_EQUALITY_SHORTCUTS are that when this is clear (or uninitialised, in a static structure) the code can assume that this class (a) only ever returns a boolean result, (b) is reflexive, (c) only implements the __eq__ operator and not the __ne__ operator and (d) can not be equal to an instance of any different class that also has this flag clear. If the flag is set then at least one of these assumptions does not hold.

Currently four built-in classes have the flag set: float and complex are non-reflexive (since nan != nan) while bytearray and frozenszet instances can equal other builtin class instances (bytes and set respectively).

The flag is set for any new class defined by the user. In the future, code could be added to analyse user-defined classes to see if either of the __eq__ or __ne__ special methods are defined and clear the flag if both are absent in the class and all its superclasses.

Adding the flag (and setting it for a few built-in classes) looks like it reduced the amount of code and it should now take the fast paths for most common cases, while further improving the compatibility with CPython behaviour.

The PR also expands the test cases to cover some cases that previously did not function as CPython did.

…ion.

dpgeorge · 2020-01-13T13:18:26Z

Thanks for following up!

Initial tests for size difference gives:

   bare-arm:  +108 +0.164% 
minimal x86:  +220 +0.149% 
   unix x64:  +112 +0.022% 
unix nanbox:   +32 +0.007% 
      stm32:   +92 +0.024% PYBV10
     cc3200:  +120 +0.065% 
    esp8266:   +64 +0.009% GENERIC
      esp32:    -8 -0.001% GENERIC[incl -32(data)]
        nrf:   +76 +0.052% pca10040
       samd:   +96 +0.095% ADAFRUIT_ITSYBITSY_M4_EXPRESS

That's very reasonable.

py/objtype.h

…names

…nto equal_non_bool_v2

nickovs · 2020-01-15T05:05:24Z

@dpgeorge Something strange is happening with the test coverage. The new tests that I just added do cover the one line in objtype.c that shows as uncovered when I run make coverage_test on my Linux machine. I don't see how the tests could complete without exercising this path and I don't see how that the compiler could ever optimise this away. If you get the chance to run the coverage tests perhaps you could look to see if line 1398 in objtype.c is being hit.

dpgeorge · 2020-01-15T09:16:08Z

The new tests that I just added do cover the one line in objtype.c that shows as uncovered when I run make coverage_test on my Linux machine.

Yeah, that happens sometimes. It seems that gcov is not always accurate with the lines it hits. I'll double check it though.

dpgeorge · 2020-01-24T05:34:19Z

I see that the check and warning for comparison between str and bytes is now gone. There is a test for this in tests/basics/bytes_compare3.py but, because enabling the warning is a compile-time option, the test will actually pass even if the warning is not printed.

IMO the check and warning should be retained. I don't think there's an elegant way to add it, I think there needs to be explicit tests in mp_obj_equal_bop() for it (similar to how it worked before).

…bytes.

…nto equal_non_bool_v2

nickovs · 2020-01-25T23:45:40Z

I've reinstated the warning for comparisons of strings and bytes.

dpgeorge · 2020-01-29T06:38:03Z

py/obj.c

    // fast path for strings
-    if (mp_obj_is_str(o1)) {
+    if (is_str_1 && is_str_2) {


Is this correct? I think it should be || not &&... anyway, I can take it from here, I think I can make this a bit simpler.

There's definitely something wrong either here or the line below. Let me investigate and fix before you merge.

I've already fixed it and have done a few other clean ups.... I can push it as a new PR for further review.

dpgeorge · 2020-01-30T01:21:59Z

See #5593 for a slightly reworked version of this PR.

Improved shortcut cases for strings. Cleaned up indentation.

nickovs · 2020-01-30T02:00:29Z

I only just saw your last comment after I pushed a fix with some other improvements. I'm happy to go with your version if you prefer.

dpgeorge · 2020-01-30T02:05:19Z

I only just saw your last comment after I pushed a fix with some other improvements.

No worries. I'll see if your version of the string handling is smaller and work it in.

dpgeorge · 2020-01-30T03:54:46Z

py/obj.c

+    int pass_number = 0;
+
+    bool o2_shortcut = !(mp_obj_is_obj(o2) &&
+                         (((mp_obj_base_t*)MP_OBJ_TO_PTR(o2))->type->flags & MP_TYPE_FLAG_NO_EQUALITY_SHORTCUTS));


This doesn't work for object representations C&D where floats are not objects (they are inline values stored in the mp_obj_t itself), so it fails the nan==nan test (see nanbox failure on Travis).

dpgeorge · 2020-01-30T04:18:37Z

Ok, I've merged #5593 without the final commit from here. See commits c3450ef through c96a2f6

Thanks very much @nickovs for the hard work getting this done, it's tough making such deep changes to the core!

nickovs · 2020-01-30T04:24:56Z

Sounds good.

Thanks very much @dpgeorge for the hard work of creating MicroPython in the first place!

mcdeoliveira · 2020-01-31T06:06:08Z

Happy to see this done! Great job @nickovs and @dpgeorge.

…r-4981 Reversal of PR micropython#4981

nickovs added 4 commits January 1, 2020 09:37

Support non-boolean results for equality and inequality tests

3448e63

Implemented suggestions from dpgeorge. Added more tests.

12c4678

Refactored new equality testing to add a flag for compaison optimisat…

1f6ffbf

…ion.

Merge branch 'master' into equal_non_bool_v2

bd2fcc1

dpgeorge added the py-core label Jan 13, 2020

dpgeorge reviewed Jan 13, 2020

View reviewed changes

py/objtype.h Outdated Show resolved Hide resolved

nickovs added 4 commits January 13, 2020 16:18

Moved flags for mp_obj_type_t to obj.h and added MP_ prefix their to …

255f048

…names

Moved flags for mp_obj_type_t to obj.h and added MP_ prefix their to …

38fe10d

…names

Merge branch 'equal_non_bool_v2' of github.com:/nickovs/micropython i…

75854d6

…nto equal_non_bool_v2

Expanded test cases for equality of subclasses

608492f

nickovs added 2 commits January 25, 2020 16:41

Reinstated the optional warnings for comparisons between strings and …

d3c5b74

…bytes.

Merge branch 'equal_non_bool_v2' of github.com:/nickovs/micropython i…

469479e

…nto equal_non_bool_v2

dpgeorge reviewed Jan 29, 2020

View reviewed changes

This was referenced Jan 30, 2020

Implement non-bool results from (in)equality tests #5479

Closed

Improved (in)equality testing with non-boolean results (v3) #5593

Merged

Fixed warnings for string/bytes comparisons.

4202e06

Improved shortcut cases for strings. Cleaned up indentation.

dpgeorge reviewed Jan 30, 2020

View reviewed changes

dpgeorge closed this Jan 30, 2020

tannewt pushed a commit to tannewt/circuitpython that referenced this pull request Nov 4, 2021

Merge pull request micropython#5533 from CytronTechnologies/reverse-p…

0f9448d

…r-4981 Reversal of PR micropython#4981

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improved (in)equality testing with non-boolean results #5533

Improved (in)equality testing with non-boolean results #5533

nickovs commented Jan 13, 2020

dpgeorge commented Jan 13, 2020

nickovs commented Jan 15, 2020

dpgeorge commented Jan 15, 2020

dpgeorge commented Jan 24, 2020

nickovs commented Jan 25, 2020

dpgeorge Jan 29, 2020

nickovs Jan 30, 2020

dpgeorge Jan 30, 2020

dpgeorge commented Jan 30, 2020

nickovs commented Jan 30, 2020

dpgeorge commented Jan 30, 2020

dpgeorge Jan 30, 2020

dpgeorge commented Jan 30, 2020

nickovs commented Jan 30, 2020

mcdeoliveira commented Jan 31, 2020

Improved (in)equality testing with non-boolean results #5533

Improved (in)equality testing with non-boolean results #5533

Conversation

nickovs commented Jan 13, 2020

dpgeorge commented Jan 13, 2020

nickovs commented Jan 15, 2020

dpgeorge commented Jan 15, 2020

dpgeorge commented Jan 24, 2020

nickovs commented Jan 25, 2020

dpgeorge Jan 29, 2020

Choose a reason for hiding this comment

nickovs Jan 30, 2020

Choose a reason for hiding this comment

dpgeorge Jan 30, 2020

Choose a reason for hiding this comment

dpgeorge commented Jan 30, 2020

nickovs commented Jan 30, 2020

dpgeorge commented Jan 30, 2020

dpgeorge Jan 30, 2020

Choose a reason for hiding this comment

dpgeorge commented Jan 30, 2020

nickovs commented Jan 30, 2020

mcdeoliveira commented Jan 31, 2020