Pythran version of scipy.optimize._group_columns #13336

serge-sans-paille · 2021-01-03T18:46:25Z

No description provided.

serge-sans-paille · 2021-01-06T07:10:16Z

@rgommers this one looks good. The CI issue seems unrelated. My local benchmarks give interesting speedup

$ python -m timeit -s 'from numpy import array; n = 200; m = n - 12; x = array(range(n)); y = array(range(12, 12 +n)); xy = array(range(n*n)).reshape((n,n)); from _group_columns import group_sparse as gs, group_dense as gd;' 'gd(n, m, xy)'

pythran: 10 loops, best of 3: 115 usec per loop
cython: 10 loops, best of 3: 79.2 msec per loop

serge-sans-paille · 2021-01-09T07:10:46Z

@rgommers gentle ping ;-)

rgommers · 2021-01-13T20:36:53Z

Thanks for the ping, and sorry for the delay @serge-sans-paille. I'm kind of distracted by a proposal deadline until the 19th.

That's a massive speedup, guess there's a serious problem in the Cython code somehow. The code looks like a correct line-by-line translation, it's not clear to me why there should be a ~700x performance difference.

rgommers · 2021-01-13T20:48:54Z

I can't reproduce that. I get the same result for Pythran:

>>> n = 200
>>> m = n - 12
>>> xy = np.arange(n**2).reshape((n, n))
>>> n = 200
>>> m = n - 12
>>> x = np.arange(200)
>>> y = np.arange(12, 12 + n)
>>> xy = np.arange(n**2).reshape((n, n))
>>> from scipy.optimize._group_columns import group_dense
>>> %timeit group_dense(n, m, xy)
115 µs ± 444 ns per loop (mean ± std. dev. of 7 runs, 10000 loops each)

Cython code gives an exception for that, it only works with int32 xy:

>>> n = 200
>>> m = n - 12
>>> x = np.arange(200)
>>> y = np.arange(12, 12 + n)
>>> xy = np.arange(n**2).reshape((n, n)).astype(np.int32)
>>> from scipy.optimize._group_columns import group_dense
>>> %timeit group_dense(n, m, xy)
135 µs ± 1.43 µs per loop (mean ± std. dev. of 7 runs, 10000 loops each)

Then if I change Pythran to int32:

>>> xy = np.arange(n**2).reshape((n, n)).astype(np.int32)
>>> %timeit group_dense(n, m, xy)
75.6 µs ± 429 ns per loop (mean ± std. dev. of 7 runs, 10000 loops each)

So the performance gain is close to 2x. Looks like you swapped the numbers and wrote msec instead of usec?

serge-sans-paille · 2021-01-15T12:49:35Z

Oh, got it, I was using Python Int in my setup, which probably caused my Cython measurement to be meaningless. I've updated the Pythran export line to only accept C int, but it fails on windows I need to have both intc and int. I guess a speedup of x2 is already good ;-)

serge-sans-paille · 2021-01-22T06:59:15Z

@rgommers is there anything I should do for that one?

rgommers

This LGTM. This function is used only in least_squares it looks like. I did some quick timings on the first example in its docstring (Rosenbrock function with trf method), and the speedup is ~8%.

In it goes, thanks @serge-sans-paille

serge-sans-paille force-pushed the feature/pythran-group branch 9 times, most recently from 2762c33 to afd55c0 Compare January 6, 2021 05:50

Pythran version of scipy.optimize._group_columns

96a8ded

serge-sans-paille force-pushed the feature/pythran-group branch from afd55c0 to 96a8ded Compare January 6, 2021 15:30

rgommers added enhancement A new feature or improvement scipy.optimize labels Jan 13, 2021

serge-sans-paille force-pushed the feature/pythran-group branch from 96a8ded to cedfeb5 Compare January 15, 2021 10:33

serge-sans-paille force-pushed the feature/pythran-group branch from cedfeb5 to 96a8ded Compare January 15, 2021 12:50

rgommers approved these changes Jan 23, 2021

View reviewed changes

rgommers merged commit 764f104 into scipy:master Jan 23, 2021

rgommers added this to the 1.7.0 milestone Jan 23, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pythran version of scipy.optimize._group_columns #13336

Pythran version of scipy.optimize._group_columns #13336

serge-sans-paille commented Jan 3, 2021

serge-sans-paille commented Jan 6, 2021 •

edited

Loading

serge-sans-paille commented Jan 9, 2021

rgommers commented Jan 13, 2021

rgommers commented Jan 13, 2021

serge-sans-paille commented Jan 15, 2021

serge-sans-paille commented Jan 22, 2021

rgommers left a comment

Pythran version of scipy.optimize._group_columns #13336

Pythran version of scipy.optimize._group_columns #13336

Conversation

serge-sans-paille commented Jan 3, 2021

serge-sans-paille commented Jan 6, 2021 • edited Loading

serge-sans-paille commented Jan 9, 2021

rgommers commented Jan 13, 2021

rgommers commented Jan 13, 2021

serge-sans-paille commented Jan 15, 2021

serge-sans-paille commented Jan 22, 2021

rgommers left a comment

Choose a reason for hiding this comment

serge-sans-paille commented Jan 6, 2021 •

edited

Loading