ARROW-4206: [Gandiva] support decimal divide and mod #3813

pravindra · 2019-03-05T12:09:51Z

No description provided.

fsaintjacques · 2019-03-05T14:48:56Z

@pravindra, add me as a reviewer, @pitrou is not available.

wesm · 2019-03-05T15:49:47Z

the "review requested" feature is only informational, I think @pravindra is one of the only contributors who uses it regularly =)

pravindra · 2019-03-06T05:52:28Z

@pravindra, add me as a reviewer, @pitrou is not available.

sorry I'm not able to add you as a reviewer. but, please do review.

praveenbingo

+1 LGTM.

fsaintjacques · 2019-03-07T19:03:26Z

cpp/src/gandiva/decimal_xlarge.cc

+  DCHECK_LE(scale, kMaxScaleMultiplier);
+
+  // Compute the scale multipliers once.
+  static std::array<int256_t, kMaxScaleMultiplier + 1> multipliers =


This is not thread safe.

The spec says it's thread-safe https://en.cppreference.com/w/cpp/language/storage_duration#Static_local_variables

Is there an exception for lambda ?

I've made this a global variable now.

There doesn't seem to be exception for lamda, I didn't know that local static storage was thread safe in C++11, wasn't in C11 and previous C++ versions. I'd say you can keep the previous version if you prefer.

fsaintjacques · 2019-03-07T19:04:31Z

cpp/src/gandiva/decimal_xlarge.cc

+      ([]() -> std::array<int256_t, kMaxScaleMultiplier + 1> {
+        std::array<int256_t, kMaxScaleMultiplier + 1> values;
+        values[0] = 1;
+        for (auto idx = 1; idx <= kMaxScaleMultiplier; idx++) {


Don't use auto on primitive types.

curious, why the restriction ? is this from the coding standard or for efficiency ?

Mostly for readability (and future refactoring), and making sure there's no signedness/width issues, though the compiler would help with the warnings.

fsaintjacques · 2019-03-07T19:05:40Z

cpp/src/arrow/util/basic_decimal.h

@@ -138,6 +138,9 @@ class ARROW_EXPORT BasicDecimal128 {
  /// - If 'round' is false, the right-most digits are simply dropped.
  BasicDecimal128 ReduceScaleBy(int32_t reduce_by, bool round = true) const;

+  // returns 1 for positive and zero decimal values, -1 for negative decimal values.
+  int64_t Sign() const;


small enough to make it inline.

fsaintjacques · 2019-03-07T19:17:16Z

cpp/src/arrow/util/decimal-test.cc

@@ -482,6 +482,49 @@ TEST(Decimal128Test, Multiply) {
  ASSERT_EQ(result.ToIntegerString(), "60501");
 }

+TEST(Decimal128Test, Divide) {
+  Decimal128 result;


This is valid for the next 3 tests:

Why the usage of a temporary variables, e.g. ASSERT_EQ(Decimal128("20100") / Decimal128("301"), ...)?

Why the usage of ToIntegerString, just compare Decimal("66")?

Would it be better to explicit some int64/int128 constructors such that you can at least call with Decimal(66)?

The end would look like

ASSERT_EQ(Decimal(20100) / Decimal128(301), Decimal128(66));

Please do some minimal property testing against known types, e.g.

for (x: rand_int32_range()) for (y: rand_int32_range()) ASSERT_EQ(Decimal(x) / Decimal128(y), Decimal128(x/y));

I usually try to test all obvious edge cases and some close bounds, e.g.
{INT_MIN, 0, INT_MAX} x {-2,-1,0,1,2}

The int64/int32 constructors already exist. I've added tests and fixed as suggested.

fsaintjacques · 2019-03-07T19:29:39Z

cpp/src/gandiva/decimal_xlarge.cc

+  int256_t result_large = x_large_scaled_up / y_large;
+  int256_t remainder_large = x_large_scaled_up % y_large;
+
+  if (abs(2 * remainder_large) >= abs(y_large)) {


I don't follow, please comment.

added comment

fsaintjacques · 2019-03-07T19:30:14Z

cpp/src/gandiva/precompiled/decimal_ops.cc

+  }
+
+  // scale upto the output scale, and do an integer division.
+  auto delta_scale = out_scale + y.scale() - x.scale();


Don't use auto on primitive types.

fsaintjacques · 2019-03-07T19:34:58Z

cpp/src/gandiva/precompiled/decimal_ops_test.cc

+  decimalops::Divide(reinterpret_cast<int64>(&context), DecimalScalar128{"201", 20, 3},
+                     DecimalScalar128{"1", 20, 2}, result_precision, result_scale,
+                     &overflow);
+  EXPECT_EQ(context.has_error(), false);


EXPECT_TRUE and EXPECT_FALSE exists for this purpose.

fsaintjacques · 2019-03-11T14:32:58Z

One last issue with the newer changes, why the usage of str() if it's the lack of int128_t constructor, this is a good time to add one, e.g.

Decimal128 result = Decimal128(x.str()) * Decimal128(y.str());

pravindra · 2019-03-12T07:41:11Z

One last issue with the newer changes, why the usage of str() if it's the lack of int128_t constructor, this is a good time to add one, e.g.
Decimal128 result = Decimal128(x.str()) * Decimal128(y.str());

Yeah - it's due to lack of int128_t constructor. I don't want to add boost dependency in the decimal header files since it'll spill over to the IR code, and glib. Since this is required only for testing, I'd like to leave this as now. Can revisit later if we have other uses of this.

I believe the gcc's int128_t doesn't work with windows.

ARROW-4206: [Gandiva] support decimal divide and mod

267f117

pravindra requested a review from pitrou March 5, 2019 12:09

ARROW-4206: [Gandiva] Fix build errors

697c234

praveenbingo approved these changes Mar 6, 2019

View reviewed changes

fsaintjacques reviewed Mar 7, 2019

View reviewed changes

ARROW-4206: [Gandiva] Add more tests/comments

a9ad13f

ARROW-4206: [Gandiva] add global symbol for new fns

96ef405

pravindra closed this in 31aa19d Mar 14, 2019

asfimport mentioned this pull request Mar 14, 2019

[Gandiva] Implement decimal divide #20786

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ARROW-4206: [Gandiva] support decimal divide and mod #3813

ARROW-4206: [Gandiva] support decimal divide and mod #3813

pravindra commented Mar 5, 2019

fsaintjacques commented Mar 5, 2019

wesm commented Mar 5, 2019

pravindra commented Mar 6, 2019

praveenbingo left a comment

fsaintjacques Mar 7, 2019

pravindra Mar 11, 2019

pravindra Mar 11, 2019

fsaintjacques Mar 11, 2019 •

edited

fsaintjacques Mar 7, 2019

pravindra Mar 11, 2019

pravindra Mar 11, 2019

fsaintjacques Mar 11, 2019 •

edited

fsaintjacques Mar 7, 2019

pravindra Mar 11, 2019

fsaintjacques Mar 7, 2019

pravindra Mar 11, 2019

fsaintjacques Mar 7, 2019

pravindra Mar 11, 2019

fsaintjacques Mar 7, 2019

pravindra Mar 11, 2019

fsaintjacques Mar 7, 2019

pravindra Mar 11, 2019

fsaintjacques commented Mar 11, 2019

pravindra commented Mar 12, 2019

ARROW-4206: [Gandiva] support decimal divide and mod #3813

ARROW-4206: [Gandiva] support decimal divide and mod #3813

Conversation

pravindra commented Mar 5, 2019

fsaintjacques commented Mar 5, 2019

wesm commented Mar 5, 2019

pravindra commented Mar 6, 2019

praveenbingo left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

fsaintjacques Mar 11, 2019 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

fsaintjacques Mar 11, 2019 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

fsaintjacques commented Mar 11, 2019

pravindra commented Mar 12, 2019

fsaintjacques Mar 11, 2019 •

edited

fsaintjacques Mar 11, 2019 •

edited