Join GitHub today
GitHub is home to over 28 million developers working together to host and review code, manage projects, and build software together.Sign up
Set operation does not handle duplicate rows #333
Forgive me if this is the intended result, but it appears as though the set operation (complement(), intersection(), et. al) does not take into consideration the occurrence of duplicate rows.
Using an updated example from the documentation:
I would have thought that both occurrences of
Sincere apologies for leaving this hanging for so many months. For the record, I made a deliberate change way back to the behaviour of complement() with duplicate rows in response to #2 (originally discussed in #1). This was to make the diff() function behave as expected by users. So the current behaviour is intended.
It would be possible to add an option to complement to make it a strict set-like operation. I think I will do this to make the default behaviour clear for users.