Inconsistent modulo #559

oprypin · 2015-04-20T10:20:40Z

>> (-7) % 5
# Crystal: -2
# Ruby: 3
>> (-7.0) % 5.0
# Crystal: undefined method '%' for Float64
# Ruby: 3.0
>> (-7).modulo(5)
# Crystal: -2
# Ruby: 3
>> (-7).remainder(5)
# Crystal: undefined method 'remainder' for Int32
# Ruby: -2
>> require "big_int"; (-7.to_big_i) % (5.to_big_i)
# Crystal: 3

I suggest leaving % as is for performance, but making modulo wrap around like Ruby (possible implementation: (a % b + b) % b).

Wouldn't be bad to add remainder which is the same as current %.

The text was updated successfully, but these errors were encountered:

jhass · 2015-04-20T10:22:59Z

We can have remainder, but % should be defined as modulo. That's what it is in other languages and having it behave differently is confusing.

See also http://llvm.org/docs/LangRef.html#srem-instruction

oprypin · 2015-04-20T10:24:43Z

I'm not sure what you mean.

C, Java, LLVM and many others give the same results as Crystal currently for %.
Ruby, Python do wraparound which is actually called "modulo"

jhass · 2015-04-20T10:26:33Z

I guess I still categorize Crystal more into the higher level languages that wrap around ;)

asterite · 2015-04-22T22:38:40Z

What should we do with this? I also found out that Ruby/Python behave different than C/D/Java/Crystal, I just don't know what's the correct thing to do. Maybe it's deciding what we want '%' to mean, either 'remainder' or 'modulo'?

jhass · 2015-04-22T22:40:59Z

As said I see Crystal closer to Ruby and Python, so having % do the modulo behavior would be IMO less confusing.

oprypin · 2015-04-22T22:46:46Z

The best meaning for % is with wraparound, but only if performance isn't taken into account. I'll stick with my previous suggestion, unless modulo with wraparound is less than 1.5 times slower than the current behavior.

asterite · 2015-04-22T22:51:00Z

You can always benchmark and see :-)

require "benchmark"

def modulo(a, b)
  (a % b + b) % b
end

FROM = -10_000
TO = 10_000

a1 = 0
a2 = 0

Benchmark.bm do |x|
  x.report("%") do
    a = 0
    (FROM..TO).each do |i|
      (FROM..TO).each do |j|
        next if j == 0
        a += i % j
      end
    end
    a1 = a
  end
  x.report("modulo") do
    a = 0
    (FROM..TO).each do |i|
      (FROM..TO).each do |j|
        next if j == 0
        a += modulo(i, j)
      end
    end
    a2 = a
  end
end

pp a1
pp a2

Gives:

             user     system      total        real
%        1.000000   0.000000   1.000000 (  1.005000)
modulo   1.800000   0.000000   1.800000 (  1.799956)

For what use one uses modulo with negative numbers?

jhass · 2015-04-22T23:00:03Z

Proving my bad math skills here, but did I get this right that modulo can be optimized to the remainder if the argument is positive? If so that seems to be worth the comparison here.

oprypin · 2015-04-22T23:00:35Z

I don't think this is the best possible implementation.

Modulo is used for cycling/wraparound like in https://github.com/BlaXpirit/crsfml/blob/master/examples/snakes.cr

Another example is keeping angles in 0...360 range. direction = (direction - turn_angle) % 360.0 and no nasty conditionals/loops.

jhass · 2015-04-22T23:06:58Z

Here's Ruby's http://rxr.whitequark.org/mri/source/numeric.c#2754

asterite · 2015-04-22T23:34:25Z

Just tried with this:

def modulo(a, b)
  if b == 0
    raise DivisionByZero.new
  elsif a > 0
    a.unsafe_mod(b)
  else
    (a.unsafe_mod(b) + b).unsafe_mod(b)
  end
end

and it's 1s vs 1.44s now. I have to try that Ruby implementation. But I think it would be OK to leave % as modulo and have a separate remainder method (like in Ruby).

asterite · 2015-04-23T13:12:27Z

Tried with this (seen here):

def modulo(a, b)
  if b == 0
    raise DivisionByZero.new
  elsif a > 0
    a.unsafe_mod(b)
  else
    a = a.unsafe_mod(b)
    a < 0 ? a + b : a
  end
end

and it's 1.008s for % vs 1.06s for modulo. I think we can use the "better" semantic at no cost :-)

ysbaddaden · 2015-04-23T13:40:02Z

👍

asterite · 2015-04-23T14:09:09Z

However:

# Ruby
13 % -4 #=> -3

# Crystal
modulo(13, -4) #=> 1

What's the reason for that?

This is starting to feel like a philosophical discussion :-)

ssvb · 2015-04-24T00:12:00Z

@asterite Maybe this variant then?

# 'a' and 'b' need to be two's complement signed integers of the same size
def modulo(a, b)
  if b == 0
    raise DivisionByZero.new
  elsif (a ^ b) >= 0
    a.unsafe_mod(b)
  else
    a = a.unsafe_mod(b)
    a == 0 ? a : a + b
  end
end

jhass · 2015-04-24T08:18:12Z

Here's Python's: https://github.com/python/cpython/blob/master/Objects/longobject.c#L2444

oprypin · 2015-04-24T08:48:20Z

@jhass That's basically BigInt. Nothing to see there 😐

Maybe this would be more relevant: https://github.com/python/cpython/blob/2.7/Objects/intobject.c#L583

ssvb · 2015-04-24T09:07:27Z

@BlaXpirit Thanks, this is an interesting link. They are also using xor to check if the operands have different sign, like in my modification of the @asterite's variant (which was just adapted to match Ruby's behaviour). Also cpython relies on the C language as a backend for implementing this operation and they have hints that there might be some portability issues between different platforms :-) But I don't think that this is something that we should worry about (as long as there is a good coverage for the modulo operation in the Crystal test suite). But I can run a quick test to check if this implementation works correctly on ARM hardware too.

ssvb · 2015-04-25T05:09:21Z

Here is the @asterite's benchmark program with a correctness test added:

require "benchmark"

# A slow reference implementation
def modulo_ref(a, b)
  a = a.to_i64
  b = b.to_i64
  (a % b + b) % b
end

# 'a' and 'b' need to be two's complement signed integers of the same size
def modulo(a, b)
  if b == 0
    raise DivisionByZero.new
  elsif (a ^ b) >= 0
    a.unsafe_mod(b)
  else
    a = a.unsafe_mod(b)
    a == 0 ? a : a + b
  end
end

# Correctness test

macro modulo_correctness_test_for_type(typename)
  ({{typename}}::MIN .. {{typename}}::MAX).each do |i|
    ({{typename}}::MIN .. {{typename}}::MAX).each do |j|
      next if j == 0
      next if i == {{typename}}::MIN && j == -1
      abort "problems with #{i} % #{j}" if modulo_ref(i, j) != modulo(i, j)
    end
  end
end

modulo_correctness_test_for_type(Int8)

# Benchmark

FROM = -10_000
TO = 10_000

a1 = 0
a2 = 0

Benchmark.bm do |x|
  x.report("%") do
    a = 0
    (FROM..TO).each do |i|
      (FROM..TO).each do |j|
        next if j == 0
        a += i % j
      end
    end
    a1 = a
  end
  x.report("modulo") do
    a = 0
    (FROM..TO).each do |i|
      (FROM..TO).each do |j|
        next if j == 0
        a += modulo(i, j)
      end
    end
    a2 = a
  end
end

pp a1
pp a2

The output on my system:

             user     system      total        real
%        0.010834   0.000000   0.010834 (  7.103003)
modulo   0.011841   0.000000   0.011841 (  7.768067)

A few notes:

The (a % b + b) % b is not really correct unless 'a' and 'b' are casted to a larger type. For example, a = -127_i8 ; b = -128_i8 ; p (a % b + b) % b prints 1 instead of -127
There is a signed integer division overflow problem to watch out, see Signed integer division overflow #575

ssvb · 2015-04-25T05:11:16Z

And most importantly, the casted to a larger type (a % b + b) % b really works in Crystal in the same way as a % b in Ruby, as can be tested with the help of the following two programs:

results_gen.rb

FROM = -1000
TO   = 1000

(FROM..TO).each do |i|
  (FROM..TO).each do |j|
    next if j == 0
    puts "#{i} #{j} #{i % j}"
  end
end

results_check.cr

STDIN.each_line do |line|
  a, b, ref = line.split.map { |x| x.to_i }
  abort "We have a problem with #{a} % #{b}" if (a % b + b) % b != ref
end

And running them as

$ ruby results_gen.rb | crystal results_check.cr

asterite · 2015-04-30T00:00:21Z

If you find anything wrong in the implementation or something to improve, please continue discussing here or send a pull request :-)

ssvb mentioned this issue Apr 24, 2015

The result of comparing signed and unsigned variables of the same size is inconsistent #571

Closed

asterite closed this as completed in 86a7fae Apr 29, 2015

jhass mentioned this issue Jun 7, 2015

Adds modulo support to Float #749

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Inconsistent modulo #559

Inconsistent modulo #559

oprypin commented Apr 20, 2015

jhass commented Apr 20, 2015

oprypin commented Apr 20, 2015

jhass commented Apr 20, 2015

asterite commented Apr 22, 2015

jhass commented Apr 22, 2015

oprypin commented Apr 22, 2015

asterite commented Apr 22, 2015

jhass commented Apr 22, 2015

oprypin commented Apr 22, 2015

jhass commented Apr 22, 2015

asterite commented Apr 22, 2015

asterite commented Apr 23, 2015

ysbaddaden commented Apr 23, 2015

asterite commented Apr 23, 2015

ssvb commented Apr 24, 2015

jhass commented Apr 24, 2015

oprypin commented Apr 24, 2015

ssvb commented Apr 24, 2015

ssvb commented Apr 25, 2015

ssvb commented Apr 25, 2015

asterite commented Apr 30, 2015

Inconsistent modulo #559

Inconsistent modulo #559

Comments

oprypin commented Apr 20, 2015

jhass commented Apr 20, 2015

oprypin commented Apr 20, 2015

jhass commented Apr 20, 2015

asterite commented Apr 22, 2015

jhass commented Apr 22, 2015

oprypin commented Apr 22, 2015

asterite commented Apr 22, 2015

jhass commented Apr 22, 2015

oprypin commented Apr 22, 2015

jhass commented Apr 22, 2015

asterite commented Apr 22, 2015

asterite commented Apr 23, 2015

ysbaddaden commented Apr 23, 2015

asterite commented Apr 23, 2015

ssvb commented Apr 24, 2015

jhass commented Apr 24, 2015

oprypin commented Apr 24, 2015

ssvb commented Apr 24, 2015

ssvb commented Apr 25, 2015

ssvb commented Apr 25, 2015

asterite commented Apr 30, 2015