Join GitHub today
GitHub is home to over 28 million developers working together to host and review code, manage projects, and build software together.Sign up
DISTINCT is ignored with IN restrictions #2837
when using IN restrictions DISTINCT is not applied to result set returned
The problematic part of the query (and query processing) is not in the distinct notation ,but the list of none unique values of the "IN" clause. this will trigger multiple executors on the same partition each of which will return the same result set, the combined result set will simply contain duplicates. will submit a patch to make the values list unique before execution.