ARROW-14518: [Ruby] Add support for Arrow::Array.new([BigDecimal]) #13377

kou · 2022-06-14T07:57:20Z

This requires bigdecimal 3.1.0 or later for BigDecimal#scale.

Arrow::Array.new([BigDecimal]) detects the max precision and scale
from BigDecimals and creates suitable Arrow::Decimal{128,256}DataType
automatically.

This also truncates given BigDecimal when the specified
Arrow::Decimal{128,256}DataType doesn't have enough and scale. This
still doesn't check precision. If an user specifies data that have too
much precision, the data are used as-is.

This requires bigdecimal 3.1.0 or later for BigDecimal#scale. Arrow::Array.new([BigDecimal]) detects the max precision and scale from BigDecimals and creates suitable Arrow::Decimal{128,256}DataType automatically. This also truncates given BigDecimal when the specified Arrow::Decimal{128,256}DataType doesn't have enough and scale. This still doesn't check precision. If an user specifies data that have too much precision, the data are used as-is.

github-actions · 2022-06-14T08:07:36Z

https://issues.apache.org/jira/browse/ARROW-14518

github-actions · 2022-06-14T08:07:38Z

⚠️ Ticket has not been started in JIRA, please click 'Start Progress'.

mrkn

@kou I left some comments.

mrkn · 2022-06-15T15:43:44Z

ruby/red-arrow/lib/arrow/array-builder.rb

+            precision = [builder_info[:precision] || 0, value.precision].max
+            scale = [builder_info[:scale] || 0, value.scale].max


These 2 lines don't permit overwriting the value's precision and the scale with smaller values by specifying the precision and the scale in builder_info. Is it intentional?

Could you provide an example case?
Arrow::Array.new([BigDecimal("1.1"), BigDecimal("11.11")])?

I misreading the code. It's OK now. But, I guess we may need to permit to pass precision and scale to Arrow::Array.new as optional arguments likeArrow::Array.new([BigDecimal("1.1"), BigDecimal("11.11")], scale: 1).

If users know an expected data type, they should use Arrow::XXXArray.new instead of Arrow::Array.new such as Arrow::Decimal128Array.new({scale: ..., precision: ...}, [BigDecimal(...). ...]).

Arrow::Array.new may not build a Arrow::Decimal128Array. In the case, the given optional arguments are just ignored. It may confuse users.

mrkn · 2022-06-15T15:48:53Z

ruby/red-arrow/test/test-array-builder.rb

+                       values: array.to_a,
+                     })
+      end
+


Is it unnecessary to test with BigDecimal::NAN and BigDecimal::INFINITY?

Oh, I didn't notice them.
What is the expected behavior with them? It seems that decimal classes in C++ can't represent NAN and INFINITY.

Throwing FloatDomainError?

We should build an Apache Arrow array with Arrow::ArrayBuilder.build instead of raising an exception. I'll use Arrow::StringArray for the case.

OK, It makes sense.

I've added a validation for BigDecimal::NAN and BigDecimal::INFINITY to Arrow::Decimal{128,256}ArrayBuilder. If users append BigDecimal::NAN or BigDecimal::INFINITY, FloatDomainError is raised.

kou · 2022-06-22T23:55:05Z

+1

…pache#13377) This requires bigdecimal 3.1.0 or later for BigDecimal#scale. Arrow::Array.new([BigDecimal]) detects the max precision and scale from BigDecimals and creates suitable Arrow::Decimal{128,256}DataType automatically. This also truncates given BigDecimal when the specified Arrow::Decimal{128,256}DataType doesn't have enough and scale. This still doesn't check precision. If an user specifies data that have too much precision, the data are used as-is. Authored-by: Sutou Kouhei <kou@clear-code.com> Signed-off-by: Sutou Kouhei <kou@clear-code.com>

kou requested a review from mrkn June 14, 2022 07:57

github-actions bot added the Component: Ruby label Jun 14, 2022

mrkn reviewed Jun 15, 2022

View reviewed changes

Check BigDecimal::NAN and BigDecimal::INFINITY

2489615

kou merged commit 6fd4d34 into apache:master Jun 22, 2022

kou deleted the ruby-array-builder-decimal branch June 22, 2022 23:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ARROW-14518: [Ruby] Add support for Arrow::Array.new([BigDecimal]) #13377

ARROW-14518: [Ruby] Add support for Arrow::Array.new([BigDecimal]) #13377

kou commented Jun 14, 2022

github-actions bot commented Jun 14, 2022

github-actions bot commented Jun 14, 2022

mrkn left a comment

mrkn Jun 15, 2022

kou Jun 16, 2022

mrkn Jun 19, 2022

kou Jun 20, 2022

mrkn Jun 20, 2022

mrkn Jun 15, 2022

kou Jun 16, 2022

mrkn Jun 19, 2022

kou Jun 20, 2022

mrkn Jun 20, 2022

kou Jun 22, 2022

kou commented Jun 22, 2022

		precision = [builder_info[:precision] \|\| 0, value.precision].max
		scale = [builder_info[:scale] \|\| 0, value.scale].max

ARROW-14518: [Ruby] Add support for Arrow::Array.new([BigDecimal]) #13377

ARROW-14518: [Ruby] Add support for Arrow::Array.new([BigDecimal]) #13377

Conversation

kou commented Jun 14, 2022

github-actions bot commented Jun 14, 2022

github-actions bot commented Jun 14, 2022

mrkn left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kou commented Jun 22, 2022