Query: Identifying columns in the case of distinct #15873

smitpatel · 2019-05-31T00:47:14Z

No description provided.

smitpatel · 2019-11-12T18:49:56Z

If collection is applying distinct and not projecting required identifying columns then we should throw exception saying that collection.Distinct() is not allowed.

smitpatel · 2019-11-22T03:14:58Z

Consider scenario in #11178

smitpatel · 2020-03-04T04:34:46Z

Also TVF

maumar · 2020-06-08T19:49:45Z

example: #20505

smitpatel · 2020-07-14T21:27:45Z

Example, GroupBy aggregate in the projection which is not projecting the correlation key column.
See test Select_nested_collection_with_groupby

Added validation step for AddCollectionJoin which checks that if subquery contains Distinct or GroupBy, the projection contains all identifying columns needed to correctly bucket the results during materialization. Fixes #15873

Added validation step for AddCollectionJoin which checks that if subquery contains Distinct or GroupBy, the projection contains all identifying columns needed to correctly bucket the results during materialization. Also making sure that identifying columns can be correctly propagated during pushdown and joining - if they are not we mark them as such (by removing identifying columns altogether), so that we can throw exception when these columns are actually needed. Fixes #15873

Added validation step for AddCollectionJoin which checks that if subquery contains Distinct or GroupBy, the projection contains all identifying columns needed to correctly bucket the results during materialization. Also making sure that identifying columns can be correctly propagated during pushdown and joining - if they are not we mark them as such (by removing identifying columns altogether), so that we can throw exception when these columns are actually needed. Fixes #15873 Fixes #20184

ajcvickers · 2020-08-31T21:50:09Z

@maumar @smitpatel This issue is marked closed_fixed in rc1, but is still open and doesn't seem to be tracked by a PR.

Added validation step for AddCollectionJoin which checks that if subquery contains Distinct or GroupBy, the projection contains all identifying columns needed to correctly bucket the results during materialization. Also making sure that identifying columns can be correctly propagated during pushdown and joining - if they are not we mark them as such (by removing identifying columns altogether), so that we can throw exception when these columns are actually needed. Fixes #15873 Fixes #20184

…21990) Added validation step for AddCollectionJoin which checks that if subquery contains Distinct or GroupBy, the projection contains all identifying columns needed to correctly bucket the results during materialization. Also making sure that identifying columns can be correctly propagated during pushdown and joining - if they are not we mark them as such (by removing identifying columns altogether), so that we can throw exception when these columns are actually needed. Fixes #15873 Fixes #20184

…s when projecting a subset of column and adding distinct As fix to #15873 we started blocking some scenarios that used to work (by accident) - when we have a subquery using Distinct or GroupBy that doesn't happen to have any duplicates. Fix is to enable those scenarios (and others) by modifying identifier columns in case of distinct and group by, if the original identifiers are not already present. In case of distinct, the entire projection becomes unique identifier, as distinct guarantees it to be unique. In case of groupby, the grouping key becomes the identifier - since we only support grouping key or group aggregate in the projection, we are also guaranteed to have 1 row per unique grouping key. Also fix to #24288 - Query: add collection join tries to convert correlated collection from APPLY to JOIN for subqueries with Distinct and GroupBy, which is incorrect we would always try to convert subquery with groupby and distinct from apply to join, however we can only do this if the projection already contains the join key. Otherwise, adding the join key to the projection would change the meaning of operation in case of distinct and create invalid query in case of group by (projecting column that is not part of grouping key or aggregate). Fixes #22049 Fixes #24288

ajcvickers added this to the 3.0.0 milestone May 31, 2019

ajcvickers added the type-enhancement label May 31, 2019

ajcvickers assigned smitpatel May 31, 2019

smitpatel added the propose-punt label Jun 26, 2019

ajcvickers modified the milestones: 3.0.0, Backlog Jun 28, 2019

ajcvickers added punted-for-3.0 and removed propose-punt labels Jun 28, 2019

smitpatel removed their assignment Aug 7, 2019

maumar mentioned this issue Oct 8, 2019

Query: query with group by in subquery produces invalid SQL #15279

Closed

smitpatel mentioned this issue Oct 31, 2019

Nested select can't use IEqualityComparer #18608

Closed

smitpatel added the area-query label Nov 19, 2019

smitpatel mentioned this issue Dec 30, 2019

NavigationExpandingExpressionVisitor #18874

Closed

smitpatel mentioned this issue Apr 14, 2020

Distinct done on too much columns for subqueries #19826

Closed

smitpatel mentioned this issue Apr 28, 2020

Invalid SQL generated with 3.1 #20505

Closed

ajcvickers modified the milestones: Backlog, 5.0.0 May 1, 2020

ajcvickers assigned maumar May 1, 2020

maumar mentioned this issue Aug 6, 2020

Fix to #15873 - Query: Identifying columns in the case of distinct #21950

Closed

maumar mentioned this issue Aug 7, 2020

Fix to #15873 - Query: Identifying columns in the case of distinct #21990

Merged

Pilchie closed this as completed in #21990 Sep 1, 2020

maumar mentioned this issue Sep 29, 2020

GroupBy Children - N+1 DB trips #10472

Closed

maumar added the breaking-change label Oct 23, 2020

ajcvickers modified the milestones: 5.0.0-rc1, 5.0.0 Nov 7, 2020

maumar mentioned this issue Feb 27, 2021

Fix to #22049 - Query: consider updating select expression identifiers when projecting a subset of column and adding distinct #24293

Merged

Emill mentioned this issue Feb 28, 2021

Translate queries where a row includes a collection to use arrays instead of being implemented through joins npgsql/efcore.pg#1691

Open

maumar mentioned this issue May 31, 2022

EF Core 6 - Unable to translate a collection subquery in a projection #28130

Closed

maumar mentioned this issue Dec 1, 2022

Union with .SelectMany is not translated #29718

Open

maumar mentioned this issue Jan 5, 2023

InsufficientInformationToIdentifyElementOfCollectionJoin error with join of join selection in queries with union #29975

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Query: Identifying columns in the case of distinct #15873

Query: Identifying columns in the case of distinct #15873

smitpatel commented May 31, 2019

smitpatel commented Nov 12, 2019

smitpatel commented Nov 22, 2019

smitpatel commented Mar 4, 2020

maumar commented Jun 8, 2020

smitpatel commented Jul 14, 2020

ajcvickers commented Aug 31, 2020

Query: Identifying columns in the case of distinct #15873

Query: Identifying columns in the case of distinct #15873

Comments

smitpatel commented May 31, 2019

smitpatel commented Nov 12, 2019

smitpatel commented Nov 22, 2019

smitpatel commented Mar 4, 2020

maumar commented Jun 8, 2020

smitpatel commented Jul 14, 2020

ajcvickers commented Aug 31, 2020