MAINT: avoid memcpy when i == j #8921

rainwoodman · 2017-04-10T17:42:51Z

Valgrind complains about memcpy with overlapping address in mtrand.c
It happens when i == j in this loop.

Closer inspection the i == j iteration is not needed (it is a swap).
So, skip it and avoid depending on undefined behavior of memcpy.

related read:

https://sourceware.org/bugzilla/show_bug.cgi?id=12518

Valgrind complains about memcpy with overlapping address in mtrand.c It happens when i == j in this loop. Closer inspection the i == j iteration is not needed (it is a swap). So, skip it and avoid depending on undefined behavior of memcpy. related read: https://sourceware.org/bugzilla/show_bug.cgi?id=12518

juliantaylor · 2017-04-10T18:12:35Z

numpy/random/mtrand/mtrand.pyx

@@ -4828,6 +4828,7 @@ cdef class RandomState:
        cdef npy_intp i, j
        for i in reversed(range(1, n)):
            j = rk_interval(i, self.internal_state)
+            if i == j : continue # i == j is not needed and memcpy is undefined.


just changing the rk_interval i to i -1 should be enough

no that is wrong, disregard

That would work, but would change the behaviour of a given random seed, right?

worse it would mean a datapoint cannot stay in the same position after a shuffle which biases the result

juliantaylor · 2017-04-10T18:17:02Z

Full overlap memcpy is actually fine.
I am a bit concerned if this impacts performance negatively.
If it does one could put a compile time constant size check before the condition.

rainwoodman · 2017-04-10T18:44:34Z

small memcpy is also 'actually' fine (see the bz discussion), but since the spec says overlapped areas is undefined I think avoiding overlapping memcpy is the safest/portable approach; also rk_internal and the loop looked sufficiently complicated that adding an if shouldn't hurt the performance in any meaningful way?

juliantaylor · 2017-04-10T19:00:33Z

I don't really like changing the generated code to silence a non-issue, too bad we can't use memmove as older gccs won't inline that.
But then it will probably cost more time going over this the next time than the combined cost of the few extra cycles.

juliantaylor · 2017-04-10T19:07:09Z

uhm dumb question to ask after pressing the button, but where exactly is the overlap in this code? memcpy is done into a temporary buffer here.
what code causes the valgrind error?

juliantaylor · 2017-04-10T19:09:31Z

nevermind I see it ... my valgrind version just does not care about full overlap anymore.
I should just go to bed ._.

juliantaylor reviewed Apr 10, 2017

View reviewed changes

juliantaylor merged commit c0ab544 into numpy:master Apr 10, 2017

rainwoodman deleted the memcpy branch April 11, 2017 00:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MAINT: avoid memcpy when i == j #8921

MAINT: avoid memcpy when i == j #8921

rainwoodman commented Apr 10, 2017

juliantaylor Apr 10, 2017

juliantaylor Apr 10, 2017

eric-wieser Apr 10, 2017

juliantaylor Apr 10, 2017

juliantaylor commented Apr 10, 2017

rainwoodman commented Apr 10, 2017

juliantaylor commented Apr 10, 2017

juliantaylor commented Apr 10, 2017

juliantaylor commented Apr 10, 2017

MAINT: avoid memcpy when i == j #8921

MAINT: avoid memcpy when i == j #8921

Conversation

rainwoodman commented Apr 10, 2017

juliantaylor Apr 10, 2017

Choose a reason for hiding this comment

juliantaylor Apr 10, 2017

Choose a reason for hiding this comment

eric-wieser Apr 10, 2017

Choose a reason for hiding this comment

juliantaylor Apr 10, 2017

Choose a reason for hiding this comment

juliantaylor commented Apr 10, 2017

rainwoodman commented Apr 10, 2017

juliantaylor commented Apr 10, 2017

juliantaylor commented Apr 10, 2017

juliantaylor commented Apr 10, 2017