Release the GIL whenever possible #1512

JDWarner · 2015-05-14T22:47:01Z

From discussion in #1493, @mrocklin pointed out the many benefits of releasing the GIL for parallelization/serialization.

For many of our Cython files and functions this would be as simple as wrapping contents in a with nogil: block

with nogil:
    # function contents here

Opening as a separate issue to facilitate discussion.

The text was updated successfully, but these errors were encountered:

mrocklin · 2015-05-15T01:03:35Z

In original experiments by @cowlicks we were only able to get a 30% increase when using canny edge detection in multiple threads. I/O wasn't an issue so we assumed that the GIL was to blame.

A case study of removing the GIL and using apply_chunks from #1493 on a single algorithm might teach us something. We're happy to support from the dask side.

stefanv · 2015-05-15T05:12:27Z

Yes, please--any energy going to this is good, because we have been very bad on this front.

tacaswell · 2015-05-15T22:19:32Z

Commenting so I get emails from this thread, sorry for the noise.

stefanv · 2015-05-20T00:04:00Z

Is there anyone who has Airspeed Velocity set up? It would be quite useful at this point.

https://github.com/spacetelescope/asv/

@yarikoptic, any advice?

jni · 2015-05-20T00:39:02Z

@stefanv using asv reliably needs a dedicated server, in my understanding, so that the benchmarks are comparable across timepoints. (I have not yet tried this myself.)

I met someone at MPUG (Melbourne Python Users Group) who has been playing around with asv. I've pinged him over email because I don't know his Github handle. He might be able to contribute here!

stefanv · 2015-05-20T01:21:47Z

Perhaps the astropy team would be willing to help, @cdeil?

koenvb · 2015-05-20T02:36:17Z

Happy to contribute here. I played around a bit with asv but as noted in the talk from scipy 2014 the benchmarks are tied to the particular machine and software. Next is the benchmarks you want to run. This is the repo with benchmarks from astropy using asv. The setup is straightforward and it just comes down to defining which benchmarks and which test matrix you want.

Seems scikit-bio wants to start using asv as well.
Maybe @anderspitman can shed some light on this.

jni · 2015-05-20T05:45:29Z

@koenvb how about you fork scikit-image and define a couple of benchmarks? The gallery is a good place to start.

anderspitman · 2015-05-20T06:21:39Z

@koenvb happy to share our experience setting up asv if you have any questions. It's a great project and pretty straight forward to set up, but there are definitely some subtleties, and the operation can be a bit opaque and difficult to debug when something goes wrong.

cdeil · 2015-05-20T07:20:58Z

I've only played around with asv. For Astropy it's @mdboom and @astrofrog that have the most experience with asv and I think are running cron jobs to continuously run asv.

Yes, you need to find a dedicated machine to run the benchmarks. But that can come later, getting familiar with asv and implementing a useful set of benchmarks can come first.

You also need to discuss a bit about what kind of benchmarks you want and how they can be helpful to you without taking too much time to implement and run and then review and discuss the results. The main aspect of asv is that it runs the same benchmark for many commits in your repo, so you'll mainly learn something useful if there's a change in performance, i.e. a regression or improvement.

sciunto · 2015-05-20T11:59:44Z

Probably most of you are familiar with the GIL, but I felt the need to learn more about it. I found this article well done, with an enlightening video. http://lbolla.info/blog/2013/12/23/python-threads-cython-gil

JDWarner · 2015-05-21T20:51:53Z

Closed by #1519

mrocklin · 2015-05-21T20:54:29Z

Well that was fast!

JDWarner closed this as completed May 21, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Release the GIL whenever possible #1512

Release the GIL whenever possible #1512

JDWarner commented May 14, 2015

mrocklin commented May 15, 2015

stefanv commented May 15, 2015

tacaswell commented May 15, 2015

stefanv commented May 20, 2015

jni commented May 20, 2015

stefanv commented May 20, 2015

koenvb commented May 20, 2015

jni commented May 20, 2015

anderspitman commented May 20, 2015

cdeil commented May 20, 2015

sciunto commented May 20, 2015

JDWarner commented May 21, 2015

mrocklin commented May 21, 2015

Release the GIL whenever possible #1512

Release the GIL whenever possible #1512

Comments

JDWarner commented May 14, 2015

mrocklin commented May 15, 2015

stefanv commented May 15, 2015

tacaswell commented May 15, 2015

stefanv commented May 20, 2015

jni commented May 20, 2015

stefanv commented May 20, 2015

koenvb commented May 20, 2015

jni commented May 20, 2015

anderspitman commented May 20, 2015

cdeil commented May 20, 2015

sciunto commented May 20, 2015

JDWarner commented May 21, 2015

mrocklin commented May 21, 2015