Add auto-detection of self-implemented methods #607

hynek · 2020-01-03T13:24:16Z

This adds the feature to auto-detect existing methods so the creation doesn't have to be prevented by hand. This has been confusing our users for a while (see #324, #416). It should become True by default when the new APIs emerge (attr.auto/attrs.define – this is actually one of the last blockers for me to start working on them).

Typing requires an update to the mypy plugin to work correctly (currently it says tests/typing_example.py:193: error: Name '__init__' already defined on line 190).

There is really only one controversial open question: if a method is detected, should it just prevent the one method or all of the related ones? E.g. if I define __eq__, what about __ne__?

I have decided to take the presence of any method that belongs to a flag to set the flag to False because it's easier to reason about. I'm open to discussion tho.

euresti · 2020-01-03T15:29:41Z

Is the concept with __init__ that it replaces the auto-created init, or that it gets turned into post_attr_init_?

hynek · 2020-01-03T19:46:13Z

Replaces. In the case of init we could consider special-case it to optionally create a __attrs_init__ that the user can call whenever to fix the “how can I call super()” problem forever.

wsanchez · 2020-01-07T19:30:05Z

There is really only one controversial open question: if a method is detected, should it just prevent the one method or all of the related ones? E.g. if I define __eq__, what about __ne__?

As a user, I would expect it to work like functools.total_ordering.

hynek · 2020-01-20T08:58:42Z

There is really only one controversial open question: if a method is detected, should it just prevent the one method or all of the related ones? E.g. if I define __eq__, what about __ne__?

As a user, I would expect it to work like functools.total_ordering.

Hm that makes it a lot more complex to both implement and reason about. 🤔 The current approach is about detecting implied settings, yours would be very different and then the option should be called do_no_overwrite_own or something?

wsanchez · 2020-01-26T19:12:09Z

There is really only one controversial open question: if a method is detected, should it just prevent the one method or all of the related ones? E.g. if I define __eq__, what about __ne__?

As a user, I would expect it to work like functools.total_ordering.

Hm that makes it a lot more complex to both implement and reason about. 🤔 The current approach is about detecting implied settings, yours would be very different and then the option should be called do_no_overwrite_own or something?

Yeah, that's harder, but I think it does fall solidly into the "remove boilerplate" goal of attrs.

That said, if we want to keep things simple, then I'd definitely favor disabling all related methods. My reasoning there is that I suspect that the alternative is complicated in a worse way than total_ordering:

If, for example, you only see __eq__ in a class and you only disable __eq__ and leave the attrs-generated __ne__ in place, then we'd have to document the implementation of __ne__ (e.g. NotImplemented if a different class, or inverse of __eq__) and call that part of the API. That seems straightforward enough, I guess (though committing to an implementation, even a very simple one, makes me nervous).

But then what happens if you instead find a class that only defines __ne__? Does __eq__ simply invert that or do what it currently does?

hynek · 2020-01-27T07:26:21Z

Yeah this is exactly why I’m apprehensive of the granular version. I’m already seeing the issues and questions from confused people.

Couldn’t people actually use total_ordering?

wsanchez · 2020-01-27T15:36:54Z

Yeah this is exactly why I’m apprehensive of the granular version. I’m already seeing the issues and questions from confused people.

For sure. What I mean is that for your "should it just prevent the one method or all of the related ones?" question, I would opt for the latter. Otherwise, you either have to explain how what is left is implemented, or do something like total_ordering.

Couldn’t people actually use total_ordering?

Sure. Couldn't auto_detect actually use total_ordering? :-)

hynek · 2020-01-29T19:30:27Z

Couldn’t people actually use total_ordering?

Sure. Couldn't auto_detect actually use total_ordering? :-)

It could but it would introduce an unprecedented amount of magic. I really prefer to add one line and be clear about what's happening here.

Unless I'm missing something, I think I've made up my mind here. I should add a functional test that verifies that people actually can use total_ordering.

hynek · 2020-02-08T09:50:39Z

I have added an example for @total_ordering in 2cd16a2.

pganssle

My initial intuition here is to agree with @wsanchez. As a user, having attrs use roughly the same semantics as functools.total_ordering would be the kind of thing I'd use to pitch the library to people: look at how little boilerplate you need for this! Look how seamlessly it scales up! Truly they are as gods among men who have crafted this masterpiece!

That said, I suspect that @hynek's reluctance here is at least partially borne of having tried to do it that way in the first place and realizing all the edge cases that would be super hard to address. I agree that the two options are basically "do it like functools.total_ordering" and "implement everything or nothing".

One possibility that I think might allow us to kick the can down the road a bit: what if we made the ambiguous condition (the relevant attrs attribute is unset but you've only defined one of the dunder methods) a hard failure, with a voluminous error message explaining other ways to accomplish the goal? Assuming @functools.total_ordering actually defines all the methods, that should allow people to choose whether they want to just decorate with @functools.total_ordering or to explicitly disable the relevant method auto-generation.

If we know that it's a hard failure, a later release adding functools.total_ordering-like (or really any) semantics to the library would be fully backwards compatible.

pganssle · 2020-02-11T12:57:22Z

changelog.d/607.change.rst

@@ -0,0 +1,5 @@
+``attrs`` can now automatically detect your own implementations and infer ``init=False``, ``repr=False``, ``eq=False``, ``order=False``, and ``hash=False`` if you set ``@attr.s(auto_detect=True)``.
+``attrs`` will ignore inherited methods.
+If the argument implies more than one methods (e.g. ``eq=True`` creates both ``__eq__`` and ``__ne__``), it's enough for *one* of them to exist and ``attrs`` will create *neither*.


Suggested change

If the argument implies more than one methods (e.g. ``eq=True`` creates both ``__eq__`` and ``__ne__``), it's enough for *one* of them to exist and ``attrs`` will create *neither*.

If the argument implies more than one method (e.g. ``eq=True`` creates both ``__eq__`` and ``__ne__``), it's enough for *one* of them to exist and ``attrs`` will create *neither*.

hynek · 2020-02-12T07:13:59Z

Honestly to me this is much less about the complexity to implement it (that's a one-time deal mostly) and much more about the clarity and obviousness when looking at attrs-using code.

I don't think expecting people to add one line to their code is too much to ask, if it makes the whole thing more explicit and clear about what happens. Also once we implement #602 (which after this lands might be a very good opportunity), the users can directly deduct attrs behavior from the settings.

attrs was always about saving boilerplate but it also was always about being explicit and clear. Here we have to choose what is more important and my gut feeling is telling me that clarity wins.

pganssle · 2020-02-12T16:25:54Z

Well, my contention is that there are two very intuitive options here and I think either option would be surprising to some people because the semantics of this are simply not obvious.

One problem I have with the "specifying any one method flips the default for all methods" version is that it seems very unlikely that I would want that behavior, so I'd want to have to be explicit about it. If you make it so that specifying __lt__ silently flips ord=True to ord=False, I think you'll get a lot of users confused as to why x > y is not working as they expected.

I think some people will want to explicitly define a partial ordering, but those people will know that they want it, and they'll either implement all the methods or they can explicitly provide ord=False.

Given the asymmetry in expectations, I would think that the intuitive options are:

Specifying a partial ordering / partial equality with auto detection and eq/ord not explicitly set raises an exception (so anyone who thinks it works like the second option are put on notice).
Specifying a partial ordering / partial equality with auto detection defines a total ordering in terms of the user-defined methods.

I suspect that if you go with the first option, you'll get a bunch of people who say, "Why raise an exception, there's not really another thing you'd want to do here!" Of course, people thoughtlessly complain about everything with non-obvious semantics as if the semantics should be obvious, so that's not necessarily a lot to go on.

That said, maybe I'm completely missing some edge cases where this could lead to major footguns, so I'm totally willing to be convinced.

As I said before, if we go with option 2 (throwing in the ambiguous case), we should have the option to switch to option 1 in the future in a backwards-compatible way if it turns out I'm right about the asymmetry in use cases and intuition.

wsanchez · 2020-02-12T21:13:59Z

I've come around to agreeing with @hynek about users just using total_ordering explicitly if that's what they want.

I'm sympathetic to the argument that if ord=true (or unspecified, since it's normally the default), that rather than altering the default behavior when you add an __lt__ method, raising an exception may be more correct in a sense. But attrs changes the defaults in other similar cases, so I don't think that we should inconsistent about that in this one.

pganssle · 2020-02-12T21:58:58Z

Hm, I feel like I must be missing something, because my most recent post started out conceding the point and as I started trying to write out my reasoning for that I ended up going in exactly the opposite direction.

For me the issue comes down to what behavior users would want and what they would expect. I would think that most people who define a partial ordering want a total ordering. I can think of very few scenarios where that isn't the case, and in all of those scenarios, I'd be trying to craft specific semantics and I would test those semantics explicitly.

To me it seems like the fraction of people who don't want total order extrapolation, are expecting that defining a partial order will set ord=False and who don't carefully test their implementation is very low. On the other hand, in addition to the prior of "the user wants total ordering" being much higher, I think the fraction of those people who don't carefully test the ordering semantics is higher as well, so I'd expect a lot of broken classes being deployed unless ord=None + partial ordering = total ordering OR ord=None + partial ordering = exception.

But maybe my priors are way off. Do you disagree with my premises, my reasoning or am I maybe missing another line of reasoning here?

(Note: I will admit that the "should you generate __eq__ when only __ne__" question almost had me change my mind here, since generating __eq__ from __ne__ seems weird, but having eq do something other than ord would be very bad. In the end, I decided that these practical concerns override my squeamishness about generating an __ne__).

hynek · 2020-02-13T06:46:15Z

As Paul wrote correctly: no matter what way we go, someone is going to be confused. I think this is the premise that everything starts from.

There has been a lot of text and I'm a bit lost, but it seems to me, that Paul is arguing for an implicit total ordering?

As I've laid out before, I don't like it because it it's too much implicit behavior. I like being able to look at an attrs class and know what's happening without considering all kinds of implications and I think asking the user to add one line @functools.total_ordering is a good compromise here.

Making the existence of methods switch off a switch is a very clear semantic that can be easily communicated: "if you implement __eq__ then eq=False". This is not true if we just skip implementing certain methods based on their presence. The value of eq suddenly means nothing anymore. There's no way we make it the behavior introspectable in any way.

This feels harder to understand and harder to reason about – especially if we magically start applying a total ordering to the class.

But maybe I'm just misunderstanding? We def – at least partially – argue about different things here.

Random nit: in future APIs ord will be False by default I think. Most people don't need it so it's unnecessary baggage.

pganssle · 2020-02-13T17:45:04Z

There has been a lot of text and I'm a bit lost, but it seems to me, that Paul is arguing for an implicit total ordering?

I like implicit total ordering as the behavior of ord=True (and the default behavior if ord defaults to True) when a partial ordering is defined, but I am not sure I totally grok the arguments against it since yes it is a bit magical but I think in the sense that it will magically do what almost everyone wants and probably won't bother the people who don't want that behavior. I'm definitely willing to concede that you and @wsanchez understand the ways the semantics can get hairy better than me, so my position is "I'm probably wrong about this."

The thing I am more confident about is that if you don't go with total ordering, I think that unless ord is explicitly set to False, I think it should be an exception to define a partial ordering, because I think almost no one actually wants a partial ordering, and it's better to loudly fail and say, "If you really want a partial ordering, set ord=False, if you want a total ordering, using functools.total_ordering".

Making the existence of methods switch off a switch is a very clear semantic that can be easily communicated: "if you implement __eq__ then eq=False". This is not true if we just skip implementing certain methods based on their presence. The value of eq suddenly means nothing anymore. There's no way we make it the behavior introspectable in any way.

This is very fair, and I think I'm starting to understand the concern here. I was thinking that eq=True would mean "this class is guaranteed to define both __eq__ and __ne__" and ord=True would mean "this class is guaranteed to define a total ordering" and not necessarily "if x is True/False, attrs generates these methods". I think that's fairly easy to grok, but if you think that what people care about is what methods attr.s will generate or if exposing that information in a public interface is a critical design goal, then I concede the point about total ordering (though I'm still not quite convinced on the point of silent vs. exception).

hynek · 2020-03-05T15:21:18Z

Paul, to sum it up (and me stopping procrastinating on this): you would be (begrudgingly) OK with the current implementation, if we raise a Warning if the the user forgets to implement certain methods.

Am I understanding you correctly?

pganssle · 2020-03-06T14:39:57Z

@hynek Yeah, that sounds about right.

hynek · 2020-03-07T13:54:43Z

So I've implemented the warning and realized that there's a problem: we'd force the decorator order on the users.

@attr.s(auto_detect=True)
@total_ordering
class C:
    ...

would be fine while

@total_ordering
@attr.s(auto_detect=True)
class C:
    ...

would annoy the users.

I'm not really sure what to do here?

pganssle · 2020-03-07T21:38:13Z

Hm... That is indeed annoying.

I think it really depends on something I don't have a huge amount of information about, which is the baseline rates of this kind of confusion. If a lot of people are going to be silently getting broken behavior from this because they don't know they need to use functools.total_ordering with auto_attrib=True if they want to define a custom comparison, then it's probably justified to warn people who do the decorator in the wrong order (possibly with an additional line like, "Make sure the @attr.s decorator is on top to avoid this warning").

If almost everyone who writes one knows to use functools.total_ordering but tends to put it outside the attr.s decorator, then it's just annoying people for no reason.

Luckily, I don't think either decision is irreversible (at least when deciding between warning / nothing rather than deciding between exception / warning / nothing). You can always start raising a warning later if you find people are having this problem a lot, or you can start with the warning and remove it if people complain a lot.

I think you probably have a better handle than me on what attrs users will want, so I'm confident that whatever you choose is right. Sorry if all I've managed to add to this thread was delays and extra noise.

hynek · 2020-03-08T09:13:04Z

You can always start raising a warning later if you find people are having this problem a lot, or you can start with the warning and remove it if people complain a lot.

Yeah I think we'll go without for now – with a fat warning (that nobody will read).

I think you probably have a better handle than me on what attrs users will want, so I'm confident that whatever you choose is right.

With 26 million downloads per month, I don't think there's such a thing as an “attrs user”. :) I guess the best thing I can do is to build something I want to use.

Sorry if all I've managed to add to this thread was delays and extra noise.

Heh delays caused by food for thought are always very welcome.

hynek · 2020-03-08T09:39:23Z

Since I believe we have agreement on the design/API, I hope someone will find the time to give the implementation a review now. ❤️🐶

Fixes #324

This reverts commit 590ef43.

hynek · 2020-03-16T12:03:03Z

OK I don't think waiting any further will make anyone come forward so I'm just YOLOing it in. Otherwise I'll never get Operation import attrs done.

hynek force-pushed the auto-detect branch from 2018a3e to 46fd2a3 Compare January 3, 2020 15:04

hynek force-pushed the auto-detect branch 2 times, most recently from 4e9db0e to 86589a1 Compare January 6, 2020 11:46

wsanchez added the Feature label Jan 7, 2020

hynek force-pushed the auto-detect branch from 77c784f to e62921c Compare February 8, 2020 08:57

hynek force-pushed the auto-detect branch from e32959b to dc25251 Compare February 10, 2020 16:29

pganssle reviewed Feb 11, 2020

View reviewed changes

hynek force-pushed the master branch from 55d5ef4 to 9fcfe34 Compare March 6, 2020 09:35

hynek force-pushed the auto-detect branch from 3dedd12 to 2396ecc Compare March 7, 2020 10:36

hynek force-pushed the auto-detect branch from 2396ecc to 09bf52d Compare March 8, 2020 09:24

hynek added this to the 20.1.0 milestone Mar 8, 2020

hynek force-pushed the auto-detect branch from 09bf52d to 38c1c8c Compare March 13, 2020 09:04

hynek added 6 commits March 13, 2020 11:07

Implement auto_detect

1246c43

Fixes #324

Add test demonstrating total_ordering

bdd9759

Ensure the order of applying total_ordering does not matter

5126357

Warn if a method is missing

cb58e91

Revert "Warn if a method is missing"

85f36fc

This reverts commit 590ef43.

Add stern warning that nobody will read

379f284

hynek force-pushed the auto-detect branch from 38c1c8c to 379f284 Compare March 13, 2020 10:11

hynek added 2 commits March 13, 2020 11:48

Merge branch 'master' into auto-detect

f4e2ef0

Merge branch 'master' into auto-detect

783f2c6

hynek merged commit 196d948 into master Mar 16, 2020

hynek deleted the auto-detect branch March 16, 2020 12:03

hynek mentioned this pull request Mar 16, 2020

Support custom __getstate__, __setstate__ for slotted classes #513

Closed

10 tasks

hynek mentioned this pull request May 14, 2020

[RFC] Inconvenient defaults? #487

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add auto-detection of self-implemented methods #607

Add auto-detection of self-implemented methods #607

hynek commented Jan 3, 2020 •

edited

Loading

euresti commented Jan 3, 2020

hynek commented Jan 3, 2020

wsanchez commented Jan 7, 2020

hynek commented Jan 20, 2020

wsanchez commented Jan 26, 2020

hynek commented Jan 27, 2020

wsanchez commented Jan 27, 2020 •

edited

Loading

hynek commented Jan 29, 2020

hynek commented Feb 8, 2020

pganssle left a comment

pganssle Feb 11, 2020

hynek commented Feb 12, 2020

pganssle commented Feb 12, 2020

wsanchez commented Feb 12, 2020

pganssle commented Feb 12, 2020

hynek commented Feb 13, 2020

pganssle commented Feb 13, 2020

hynek commented Mar 5, 2020

pganssle commented Mar 6, 2020

hynek commented Mar 7, 2020

pganssle commented Mar 7, 2020

hynek commented Mar 8, 2020

hynek commented Mar 8, 2020 •

edited

Loading

hynek commented Mar 16, 2020

	If the argument implies more than one methods (e.g. ``eq=True`` creates both ``__eq__`` and ``__ne__``), it's enough for one of them to exist and ``attrs`` will create neither.
	If the argument implies more than one method (e.g. ``eq=True`` creates both ``__eq__`` and ``__ne__``), it's enough for one of them to exist and ``attrs`` will create neither.

Add auto-detection of self-implemented methods #607

Add auto-detection of self-implemented methods #607

Conversation

hynek commented Jan 3, 2020 • edited Loading

euresti commented Jan 3, 2020

hynek commented Jan 3, 2020

wsanchez commented Jan 7, 2020

hynek commented Jan 20, 2020

wsanchez commented Jan 26, 2020

hynek commented Jan 27, 2020

wsanchez commented Jan 27, 2020 • edited Loading

hynek commented Jan 29, 2020

hynek commented Feb 8, 2020

pganssle left a comment

Choose a reason for hiding this comment

pganssle Feb 11, 2020

Choose a reason for hiding this comment

hynek commented Feb 12, 2020

pganssle commented Feb 12, 2020

wsanchez commented Feb 12, 2020

pganssle commented Feb 12, 2020

hynek commented Feb 13, 2020

pganssle commented Feb 13, 2020

hynek commented Mar 5, 2020

pganssle commented Mar 6, 2020

hynek commented Mar 7, 2020

pganssle commented Mar 7, 2020

hynek commented Mar 8, 2020

hynek commented Mar 8, 2020 • edited Loading

hynek commented Mar 16, 2020

hynek commented Jan 3, 2020 •

edited

Loading

wsanchez commented Jan 27, 2020 •

edited

Loading

hynek commented Mar 8, 2020 •

edited

Loading