Adopt changes from JNI for casting from float to decimal #10917

ttnghia · 2024-05-28T05:35:56Z

TBA

Depends on:

Implement kernel for casting float to decimal spark-rapids-jni#2078

Closes #9682, and closes #10809.

Signed-off-by: Nghia Truong <nghiat@nvidia.com>

thirtiseven · 2024-05-28T07:54:11Z

The results look really close.

The @approximate_float in integration test can even be removed under my tests.

CastOpSuite can pass with eps of 1e-13.

Related Spark UT still fails, also close:

- casting to fixed-precision decimals *** FAILED ***
  Incorrect evaluation: cast(10.03 as decimal(38,18)), actual: 10.030000000000002048, expected: 10.03 (RapidsTestsTrait.scala:350)

Performance test to cast 5000000 floats to 10 kinds of decimal types (in ms):

Type	CPU	24.08	#10909	This PR
Double	146,524	468.33	660.33	370.66
Float	82,691	412.33	480.67	275.33

ttnghia · 2024-05-28T17:16:12Z

Should that case be acceptable as the relative error is just 1.7e-16?
For 10.3, it is stored like 9.30000000000000071054 and it is very difficult to remove the trailing bonus digits if we are going to take 18 decimal digits.

ttnghia · 2024-05-29T01:03:53Z

I've just updated my JNI code in NVIDIA/spark-rapids-jni#2078, adding some utilities functions by @pmattione-nvidia and it seems to fix all of our failed tests. @thirtiseven please run tests again on your machine.

thirtiseven · 2024-05-29T03:11:21Z

It passed Spark UT, personally I think it is good enough for this issue.

If set eps = 0 in com.nvidia.spark.rapids.CastOpSuite#cast float/double to decimal, we can still see some mismatch:
Some cases in float test:

cpu: -6.963900469355742E15	gpu: -6.963900469355743E15
cpu: 4.005061702373036E19	gpu: 4.005061702373037E19
cpu: 7.1726783043136778E18	gpu: 7.1726783043136788E18
cpu: 4.9245108190671776E17	gpu: 4.9245108190671782E17
cpu: -6.6190078985307568E16	gpu: -6.6190078985307576E16
cpu: -8.286135559736182E15	gpu: -8.286135559736183E15
cpu: 9.999999933815812E18	gpu: 9.999999933815814E18
cpu: 9.2205076355788718E18	gpu: 9.2205076355788728E18
cpu: 1.00000004091847872E17	gpu: 1.00000004091847888E17

Only diffs in last few digits.

Some cases in double test:

cpu: -1.0E16	                gpu: -1.0000000000000002E16
cpu: -3.953408452257507E34	gpu: -3.9534084522575073E34
cpu: -1.0E16	                gpu: -1.0000000000000002E16
cpu: 7.1726781626632929E18	gpu: 7.172678162663294E18
cpu: 4.9245106955378758E17	gpu: 4.9245106955378765E17
cpu: 1.0E29                     gpu: 1.0000000000000001E29
cpu: -6.619008101723092E16	gpu: -6.6190081017230928E16
cpu: 9.999999999999999E29	gpu: 1.0E30
cpu: -6.200692612954865E31	gpu: -6.200692612954866E31
cpu: -8.286135303307476E15	gpu: -8.286135303307477E15
cpu: -5.75278800663036E23	gpu: -5.7527880066303604E23
cpu: -2.786658992657616E33	gpu: -2.7866589926576164E33
cpu: -4.857510460413498E34	gpu: -4.8575104604134985E34

There some cases like -1.0E16 vs -1.0000000000000002 and 9.999999999999999E29 vs 1.0E30, not sure if they can be fix easily.

New performance:

Type	CPU	24.08	#10909	This PR old	This PR new
Double	146,524	468.33	660.33	370.66	439
Float	82,691	412.33	480.67	275.33	378.33

thirtiseven · 2024-05-29T03:51:56Z

And again some thoughts about the float => string => decimal solution. In Spark/Java double is converted to string with as many, but only as many, more digits as are needed to uniquely distinguish the argument value from adjacent values of type double. doc. But the algorithm is complex and sometimes does not produce results only as many at low version jdk. So technically the best we can match now might be the only as many digits, which is implemented in jni ftos_converter.cuh. So we don't need to really convert it to string, and can get the digits in the string in int64. But it seems not to be easy to use in your approach.

ttnghia · 2024-05-29T22:06:43Z

My solution in NVIDIA/spark-rapids-jni#2078 is adopted from the ongoing work of @pmattione-nvidia. Paul is working on even a better solution, so hopefully it will be able to fix the failures above. Let's wait for it.

ttnghia added 3 commits May 27, 2024 08:37

Debugging

6076eb1

Signed-off-by: Nghia Truong <nghiat@nvidia.com>

Add test

d406ca2

Signed-off-by: Nghia Truong <nghiat@nvidia.com>

Adopt changes from JNI

d8f7eb7

Signed-off-by: Nghia Truong <nghiat@nvidia.com>

ttnghia added bug Something isn't working SQL part of the SQL/Dataframe plugin task Work required that improves the product but is not user facing labels May 28, 2024

ttnghia self-assigned this May 28, 2024

ttnghia mentioned this pull request May 28, 2024

[BUG] cast(9.95 as decimal(3,1)), actual: 9.9, expected: 10.0 #10809

Open

thirtiseven mentioned this pull request May 28, 2024

[WIP] Almost match Cast Floats to Decimal #10909

Closed

ttnghia changed the base branch from branch-24.06 to branch-24.08 May 29, 2024 22:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adopt changes from JNI for casting from float to decimal #10917

Adopt changes from JNI for casting from float to decimal #10917

ttnghia commented May 28, 2024

thirtiseven commented May 28, 2024 •

edited

Loading

ttnghia commented May 28, 2024 •

edited

Loading

ttnghia commented May 29, 2024

thirtiseven commented May 29, 2024 •

edited

Loading

thirtiseven commented May 29, 2024

ttnghia commented May 29, 2024

Adopt changes from JNI for casting from float to decimal #10917

Are you sure you want to change the base?

Adopt changes from JNI for casting from float to decimal #10917

Conversation

ttnghia commented May 28, 2024

thirtiseven commented May 28, 2024 • edited Loading

ttnghia commented May 28, 2024 • edited Loading

ttnghia commented May 29, 2024

thirtiseven commented May 29, 2024 • edited Loading

thirtiseven commented May 29, 2024

ttnghia commented May 29, 2024

thirtiseven commented May 28, 2024 •

edited

Loading

ttnghia commented May 28, 2024 •

edited

Loading

thirtiseven commented May 29, 2024 •

edited

Loading