Make list_cy more idiomatic #3

boothby · 2021-04-19T17:25:39Z

This is a more idiomatic implementation of cy_list, which avoids several performance pitfalls. On my machine, this results in about a 2x improvement in performance. This PR is incompatible with #2; maybe they should be separate tests.

00sapo · 2021-04-19T17:50:46Z

I had already tried to use cdef list internal_list and list in the signature of make_list, but slowed down the code for me... That's the point: Cython is completely unpredictable

bconstanzo · 2021-04-19T18:15:19Z

Chiming in here, I tried basically the same optimization/cleanup that @boothby did, on my machine I got almost an order of magnitude speedup:

In [9]: %timeit list_cy.iterate_list(a_list)
1000000.0007792843
1000000.0007792843
1000000.0007792843
1000000.0007792843
1000000.0007792843
1000000.0007792843
1000000.0007792843
1000000.0007792843
385 ms ± 6.15 ms per loop (mean ± std. dev. of 7 runs, 1 loop each)

In [10]: %timeit list_cyo.iterate_list(a_list)
1000000.0007792843
1000000.0007792843
1000000.0007792843
1000000.0007792843
1000000.0007792843
1000000.0007792843
1000000.0007792843
1000000.0007792843
2.71 s ± 182 ms per loop (mean ± std. dev. of 7 runs, 1 loop each)

Where list_cy is the optimized module and list_cyo is the code as you have right now in the repo. Give this a double check, it certainly improves times for cython.

mcabbott · 2021-04-19T18:22:11Z

FWIW, the equivalent change to the Julia version would also be more idiomatic, and much quicker:

make_list() = [fill(0.1, 10^4) for _ in 1:10^4];

Similar changes could also be made to iterate_list, but almost all the time is spent in make_list.

boothby · 2021-04-19T18:26:04Z

I noticed that the majority of the time was spent in make_list when I did the implementation for #2, too. @00sapo it might be interesting to time the make_list and iterate_list calls separately.

tuffnatty · 2021-04-19T18:28:06Z

list_cy.pyx

-            count += internal_list[j]
+cpdef float iterate_list(list a_list):
+    cdef double count = 0, d
+    cdef int i


i is unused here?

tuffnatty · 2021-04-19T18:29:02Z

list_cy.pyx

-            new_list.append(0.01)
-        a_list.append(new_list)
-    return a_list
+    return [[0.01]*(10**4) for _ in range(10**4)]


i, j are also unused here

made list_cy more idiomatic

aef824a

tuffnatty reviewed Apr 19, 2021

View reviewed changes

00sapo merged commit 0e413e4 into 00sapo:main May 22, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make list_cy more idiomatic #3

Make list_cy more idiomatic #3

boothby commented Apr 19, 2021

00sapo commented Apr 19, 2021

bconstanzo commented Apr 19, 2021 •

edited

mcabbott commented Apr 19, 2021

boothby commented Apr 19, 2021

tuffnatty Apr 19, 2021

tuffnatty Apr 19, 2021

Make list_cy more idiomatic #3

Make list_cy more idiomatic #3

Conversation

boothby commented Apr 19, 2021

00sapo commented Apr 19, 2021

bconstanzo commented Apr 19, 2021 • edited

mcabbott commented Apr 19, 2021

boothby commented Apr 19, 2021

tuffnatty Apr 19, 2021

Choose a reason for hiding this comment

tuffnatty Apr 19, 2021

Choose a reason for hiding this comment

bconstanzo commented Apr 19, 2021 •

edited