SQL: Do not lift types unless explicitly asked for #209

rogeralsing · 2017-08-22T13:02:12Z

When running aggregating SQL queries over numeric fields, SC will lift the underlying type to a bigger one.

Int32 is lifted to Int64, Double is lifted to Decimal.
This is unintuitive and feels weird, especially lifting Double to Decimal which is not even the same kind of floating point mechanics.

This has implications for e.g. the Linq provider as when you do .Sum(x => x.Int32Prop),
Starcounter will actually return a Int64, we have to have special hacks that check if the generic T returned is of int32, then run the query expecting an int64, cast and return.

I get that operations like Sum can overflow their type, but this still applies to whatever type we lift to, its just a matter of volume.
I'd much rather see that we get the type of the property itself. and add the ability to specify a larger type if you need.

The text was updated successfully, but these errors were encountered:

rogeralsing · 2017-08-22T13:03:18Z

This is also an issue due to X6Decimal: #169 (comment)

miyconst · 2017-08-31T06:52:17Z

Int32 is lifted to Int64.

This is not a big deal, and can be omitted if difficult to implement.

Double is lifted to Decimal

This one is a real problem, since Double and Decimal are different data types, and when the result is not able to fit into current X6Decimal precision it crashes.

rogeralsing · 2017-09-01T07:50:27Z

This is not a big deal, and can be omitted if difficult to implement.

It sort of is, when it comes to build generic methods ontop of the API.
e.g. in Linq, any operation like Sum, Count, etc. we need to have hacks to solve this right now.
As the code expects int32 while the SC api will give you int64 under those conditions.

Mackiovello · 2017-10-09T20:16:11Z

From your description, it seems like the type is just lifted one level, when running aggregates in SQL queries, it seems like it lifts the type several levels:

using Starcounter;
using System.Linq;

[Database]
public class Person
{
    public int Age { get; set; }
}

class Program
{
    static void Main()
    {
        Db.Transact(() =>
        {
            new Person { Age = 1 };
            new Person { Age = 2 };
            new Person { Age = 4 };
        });

        Db.SQL("SELECT AVG(p.Age) FROM Person p").FirstOrDefault(); // ScErrCLRDecToX6DecRangeError 
    }
}

Is there any workaround for this? It would be good to document

un-tone · 2018-02-23T18:46:15Z

Double is lifted to Decimal

What the case for this? Looks that there is no issues with Double type - AVG for it returns a double type value.

un-tone · 2018-02-23T19:56:56Z

Shouldn't AVG return double type result, not decimal? Enumerable.Average works exactly so, for example. It will solve issues like in @Mackiovello's comment above.

cc @miyconst @bigwad @k-rus

un-tone · 2018-02-23T19:58:16Z

Also, I propose to leave Int64 result type for SUM operator for all integer types. It looks reasonable for me.

miyconst · 2018-02-24T15:09:57Z

Shouldn't AVG return double type result, not decimal? Enumerable.Average works exactly so, for example. It will solve issues like in @Mackiovello's comment above.

Sounds like the right decision for me. @bigwad?

Also, I propose to leave Int64 result type for SUM operator for all integer types. It looks reasonable for me.

Same here.

bigwad · 2018-02-26T09:40:51Z

We can prefer to adhere to SQL standard here.

p 6.5.9 says:

        b) If SUM is specified and DT is exact numeric with scale
         S, then the data type of the result is exact numeric with
          implementation-defined precision and scale S.

         c) If AVG is specified and DT is exact numeric, then the data
          type of the result is exact numeric with implementation-
          defined precision not less than the precision of DT and
          implementation-defined scale not less than the scale of DT.

in other words,
(b) says that if we SUM Int32 we can promote return type to Int64
(c) says, that for AVG we should never change an exact numeric type (i.e. int, decimal) to approximate type (i.e. double, float)

bigwad · 2018-02-26T09:43:52Z

Shouldn't AVG return double type result, not decimal? Enumerable.Average works exactly so, for example.

Seems it doesn't: https://msdn.microsoft.com/en-us/library/bb354760(v=vs.110).aspx

public static decimal Average(
	this IEnumerable<decimal> source
)

which makes perfect sense as overwise it would ruin decimal's accuracy.

un-tone · 2018-03-01T21:00:12Z

I have added tests for each aggregation operator (except COUNT) and input type combination here https://github.com/Starcounter/level1/compare/v2.3-Home209.
I also made a summary table with result types:

Operation	Decimal	Single	Double	SByte	Int16	Int32	Int64	Byte	UInt16	UInt32	UInt64
SUM	Decimal	Double	Double	Int64	Int64	Int64	Int64	n/a	n/a	n/a	n/a
SUM changes	-	-	-	-	-	-	-	-	-	-	-
MAX	Decimal	Double	Double	Int64	Int64	Int64	Int64	UInt64	UInt64	UInt64	UInt64
MAX changes	-	Single	-	SByte	Int16	Int32	-	Byte	UInt16	UInt32	-
MIN	Decimal	Double	Double	Int64	Int64	Int64	Int64	UInt64	UInt64	UInt64	UInt64
MIN changes	-	Single	-	SByte	Int16	Int32	-	Byte	UInt16	UInt32	-
AVG	Decimal	Double	Double	Decimal	Decimal	Decimal	Decimal	Decimal	Decimal	Decimal	Decimal
AVG changes	-	-	-	Double	Double	Double	Double	Double	Double	Double	Double
COUNT	Int64

In the "* changes" rows I wrote result types which I suppose should be implemented for the case.

@bigwad @miyconst please correct me or approve the proposal.

un-tone · 2018-03-02T09:14:00Z

After talking with Kostiantyn, I'm going to update and fix the table with required changes.

bigwad · 2018-03-02T09:14:10Z

SUM - why it's not applicable for unsigned datatypes?
AVG - implemented in accordance with the standard. no changes needed

un-tone · 2018-03-02T09:19:47Z

SUM - why it's not applicable for unsigned datatypes?

I was getting an exception when did the tests with unsigned integers. I'll re-check this and do updates here or will create a new GH issue for that

bigwad · 2018-03-02T09:24:19Z

AVG - implemented in accordance with the standard. no changes needed

Except the fact, that our decimal range is less than int64, then AVG should be int64 also. At least when the result exceeds decimal limit.

un-tone · 2018-03-02T09:46:04Z

@miyconst agree that SUM should not be changed.
@bigwad doesn't agree that AVG should be changed.

So, the things we should change is only MIN and MAX operators?

miyconst · 2018-03-02T09:51:40Z

Except the fact, that our decimal range is less than int64...

Implement full .Net CLR Decimal support.

un-tone · 2018-03-02T10:27:16Z

Created an issue #385 for the error with unsigned integer types.

un-tone · 2018-03-10T10:13:30Z

So, the things we should change is only MIN and MAX operators?

I need an approval to this conclusion before starting the work.

Operation	Decimal	Single	Double	SByte	Int16	Int32	Int64	Byte	UInt16	UInt32	UInt64
MAX	Decimal	Double	Double	Int64	Int64	Int64	Int64	UInt64	UInt64	UInt64	UInt64
MAX changes	-	Single	-	SByte	Int16	Int32	-	Byte	UInt16	UInt32	-
MIN	Decimal	Double	Double	Int64	Int64	Int64	Int64	UInt64	UInt64	UInt64	UInt64
MIN changes	-	Single	-	SByte	Int16	Int32	-	Byte	UInt16	UInt32	-

miyconst · 2018-03-12T06:54:06Z

@bigwad what do you say? I think the final table looks correct, but we need an option to cast smaller data types to bigger ones, in case of the overflow exception.

bigwad · 2018-03-12T13:08:49Z

I think the final table looks correct, but we need an option to cast smaller data types to bigger ones, in case of the overflow exception.

Don't get your point on overflow. Are we discussing the latest table with MIN and MAX functions?

miyconst · 2018-03-12T13:13:29Z

Don't get your point on overflow.

Apparently I was not fully awake, MIN/MAX will never overflow.

bigwad · 2018-05-03T21:40:42Z

Spent some time on this upon @un-tone's request to figure out the best way to refactor Anton's latest changes. My conclusion is simple - let's keep away from this. Unless we have guts to redesign half of the query processor, including Prolog parser.

Why is that?
First, this PR solves only part of typing problems, all of which have the same roots: incomplete type inferring in Prolog and limited abilities to process typed expressions in c# part. Because of that given such class:

    [Database]
    public class Bar
    {
        public float f { get; set; }
        public int i { get; set; }
        public Foo b { get; set; }
    }

Following queries will also return double instead of float and they are not fixed by Anton's PR
select MAX(b.b.f) from Bar b
select b.b.f from Bar b
select b.f from Bar b group by b.f having max(b.f)=1.0

Second, this PR has broken some queries that worked before:
select MAX(b.b) from Bar b group by i
select b.f from Bar b group by b.f having max(b.f)=1.0

So, ad hoc fixes don't work as they just add a mess to the query processor and miss the whole picture. And I'm sure I've missed a lot of corner cases myself.

un-tone · 2018-05-04T07:16:44Z

Thank you, @bigwad, for the investigation. Agree with your conclusion.

bigwad · 2018-05-04T08:12:45Z

We still shouldn't forget about the issue reported by @Mackiovello earlier in this thread: #209 (comment)

Though it is not about inappropriate type conversion - it's about we not respecting our own decimal limitations when performing calculations. I think we should do it in a separate issue.

Also, timebomb icon can be removed I think, as the most severe issue (converting decimal to double) is not confirmed.

Also2, while working on this I found out we don't have tests for HAVING clause in SqlTest.

un-tone · 2018-05-04T17:59:23Z

We still shouldn't forget about the issue reported by @Mackiovello earlier in this thread

https://github.com/Starcounter/level1/issues/4631

we don't have tests for HAVING clause in SqlTest

https://github.com/Starcounter/level1/issues/4630

un-tone · 2018-07-12T16:08:55Z

@miyconst can we specify the status of the issue? Should we move it to another project board or a release? Maybe even close it?

miyconst · 2018-07-12T16:12:12Z

@un-tone I move it to later, but let's keep it around for now.

miyconst assigned bigwad Aug 31, 2017

miyconst added bug db db-typesystem desired labels Aug 31, 2017

bigwad added _Old SQL and removed db db-typesystem labels Aug 31, 2017

bigwad assigned bigwad and miyconst and unassigned bigwad Aug 31, 2017

miyconst added qp qp-old and removed _Old SQL labels Aug 31, 2017

rogeralsing mentioned this issue Sep 3, 2017

Aggregates support Starcounter/Starcounter.Linq#3

Merged

miyconst changed the title ~~SQL: Do not lift types unless explicitly asked for~~ ⏰ SQL: Do not lift types unless explicitly asked for Oct 9, 2017

un-tone assigned un-tone and unassigned miyconst Mar 1, 2018

miyconst added the in progress label Mar 8, 2018

miyconst changed the title ~~⏰ SQL: Do not lift types unless explicitly asked for~~ SQL: Do not lift types unless explicitly asked for May 7, 2018

miyconst added later and removed in progress labels Jul 12, 2018

un-tone removed their assignment Apr 30, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SQL: Do not lift types unless explicitly asked for #209

SQL: Do not lift types unless explicitly asked for #209

rogeralsing commented Aug 22, 2017

rogeralsing commented Aug 22, 2017

miyconst commented Aug 31, 2017

rogeralsing commented Sep 1, 2017

Mackiovello commented Oct 9, 2017

un-tone commented Feb 23, 2018

un-tone commented Feb 23, 2018

un-tone commented Feb 23, 2018

miyconst commented Feb 24, 2018

bigwad commented Feb 26, 2018

bigwad commented Feb 26, 2018 •

edited

un-tone commented Mar 1, 2018

un-tone commented Mar 2, 2018

bigwad commented Mar 2, 2018

un-tone commented Mar 2, 2018 •

edited

bigwad commented Mar 2, 2018

un-tone commented Mar 2, 2018

miyconst commented Mar 2, 2018

un-tone commented Mar 2, 2018

un-tone commented Mar 10, 2018

miyconst commented Mar 12, 2018

bigwad commented Mar 12, 2018

miyconst commented Mar 12, 2018

bigwad commented May 3, 2018 •

edited

un-tone commented May 4, 2018

bigwad commented May 4, 2018 •

edited

un-tone commented May 4, 2018

un-tone commented Jul 12, 2018

miyconst commented Jul 12, 2018

SQL: Do not lift types unless explicitly asked for #209

SQL: Do not lift types unless explicitly asked for #209

Comments

rogeralsing commented Aug 22, 2017

rogeralsing commented Aug 22, 2017

miyconst commented Aug 31, 2017

rogeralsing commented Sep 1, 2017

Mackiovello commented Oct 9, 2017

un-tone commented Feb 23, 2018

un-tone commented Feb 23, 2018

un-tone commented Feb 23, 2018

miyconst commented Feb 24, 2018

bigwad commented Feb 26, 2018

bigwad commented Feb 26, 2018 • edited

un-tone commented Mar 1, 2018

un-tone commented Mar 2, 2018

bigwad commented Mar 2, 2018

un-tone commented Mar 2, 2018 • edited

bigwad commented Mar 2, 2018

un-tone commented Mar 2, 2018

miyconst commented Mar 2, 2018

un-tone commented Mar 2, 2018

un-tone commented Mar 10, 2018

miyconst commented Mar 12, 2018

bigwad commented Mar 12, 2018

miyconst commented Mar 12, 2018

bigwad commented May 3, 2018 • edited

un-tone commented May 4, 2018

bigwad commented May 4, 2018 • edited

un-tone commented May 4, 2018

un-tone commented Jul 12, 2018

miyconst commented Jul 12, 2018

bigwad commented Feb 26, 2018 •

edited

un-tone commented Mar 2, 2018 •

edited

bigwad commented May 3, 2018 •

edited

bigwad commented May 4, 2018 •

edited