zero-copy support in cffi backend #1365

mattip · 2020-02-25T14:10:07Z

When trying out PyPy2 with spyder via the spyder-kernels package (which is extends the ipython protocol?), I discovered that CFFI-based Frame was not working correctly: it did not support the buffer protocol. PyPy extends the ability of pure python classes to support the buffer protocol. If you inherit from the __pypy__.bufferable.bufferable class and then instaniate an instance of your class b, callng memoryview(b) will call self.__buffer__(flags) and give you a chance to return a buffer.

I implemented this, but it seems there is some code in the cython backend to release the buffer via a gc callback? In any case, I unskipped some of the tests.

Does this look OK? The only tests that are failing are the ones that check tracker.done, which makes me think maybe this is leaking memory.

minrk · 2020-02-26T12:42:36Z

This is great! Supporting the buffer interface isn't enough for the cffi backend to support zero-copy sends, though, which is what the done tests cover. This is still an improvement, so if you put back the skips on the two tracker tests, I think this is great.

In the pypy2 tests, it looks like numpy.frombuffer doesn't work with buffer-interface-providers:

B = numpy.frombuffer(msg, A.dtype).reshape(A.shape)
TypeError: expected string or Unicode object, Frame found

maybe need a version check or direct check for the presence of this bufferable type?

mattip · 2020-02-27T16:47:59Z

In the pypy2 tests, it looks like numpy.frombuffer doesn't work with buffer-interface-providers

There was a bug in PyPy, the fix should appear in the pypy2 nightly build tomorrow.

mattip · 2020-02-27T16:48:30Z

Supporting the buffer interface isn't enough for the cffi backend to support zero-copy sends

Is there something I can do to get this going?

minrk · 2020-03-03T14:01:05Z

Is there something I can do to get this going?

You could give it a go. To implement zero-copy, Frame needs to match the Cython version more, which you can use as reference.

Mainly:

send/Frame on a buffer with copy=False should not copy memory with zmq_msg_init_size, instead it should create a zmq_msg_t with zmq_msg_init_data referencing that memory and an associated GIL-less zmq_free_fn.
The zmq_free_fn sends a message to the garbage collector (which is in Python and thus shared between backends).

mattip · 2020-03-30T11:52:07Z

in a322aa0 I

created a separate build step for the cffi c-extension module
changed the cffi interfaces to use that module
copied the copy=False behaviour from message.pyx to message.py

Now test_multi_tracker in test_message.py hangs. How do I instrument the callbacks so I can see what is not being called? I don't see the fprintf output from the call to free_python_msg

mattip · 2020-03-30T12:30:33Z

setup.py


 # whether any kind of bdist is happening
 # do this before importing anything from distutils
 doing_bdist = any(arg.startswith('bdist') for arg in sys.argv[1:])

-if any(bdist in sys.argv for bdist in ['sdist', 'bdist_wheel', 'bdist_egg']):
-    import setuptools
+from setuptools import setup


any supported python should be able to use setuptools, and I think is to be preferred over distutils where possible

If we're switching to requiring setuptools, let's make sure to avoid the main reason we haven't required setuptools: that it breaks python setup.py install by implicitly doing egg installs and corrupting sys.path:

from setuptools.command.bdist_egg import bdist_egg class bdist_egg_disabled(bdist_egg): """Disabled version of bdist_egg Prevents setup.py install from performing setuptools' default easy_install, which it should never ever do. """ def run(self): sys.exit( "Aborting implicit building of eggs. Use `pip install .` to install from source." ) ... setup_args['cmdclass'] = { ... 'bdist_egg': bdist_egg if 'bdist_egg' in sys.argv else bdist_egg_disabled, }

setup.py

minrk · 2020-05-07T10:55:52Z

buildutils/build_cffi.py

+            library_dirs=cfg['library_dirs'],
+            runtime_library_dirs=cfg['runtime_library_dirs'],
+               source="""
+            #include <stdio.h>


Can this be in a .c file instead of inlined?

minrk · 2020-05-07T11:01:05Z

setup.py


 # whether any kind of bdist is happening
 # do this before importing anything from distutils
 doing_bdist = any(arg.startswith('bdist') for arg in sys.argv[1:])

-if any(bdist in sys.argv for bdist in ['sdist', 'bdist_wheel', 'bdist_egg']):
-    import setuptools
+from setuptools import setup


If we're switching to requiring setuptools, let's make sure to avoid the main reason we haven't required setuptools: that it breaks python setup.py install by implicitly doing egg installs and corrupting sys.path:

from setuptools.command.bdist_egg import bdist_egg class bdist_egg_disabled(bdist_egg): """Disabled version of bdist_egg Prevents setup.py install from performing setuptools' default easy_install, which it should never ever do. """ def run(self): sys.exit( "Aborting implicit building of eggs. Use `pip install .` to install from source." ) ... setup_args['cmdclass'] = { ... 'bdist_egg': bdist_egg if 'bdist_egg' in sys.argv else bdist_egg_disabled, }

buildutils/build_cffi.py

- it's long enough that it's easier to be a standalone file - load constant_names without zmq being importable - address formatting, dropped py2compat

it's called lib now

- use manual malloc/free as ffi.new has the wrong lifecycle - implement zero-copy recv - get buffer from zmq_msg

minrk · 2021-01-13T15:08:08Z

zero-copy now works in PyPy. Thanks @mattip!

mattip · 2021-01-13T15:13:14Z

Thanks for finishing this up. Are there any benchmarks for pyzmq?

mattip · 2021-01-13T15:20:39Z

I found https://github.com/achimnol/asyncio-zmq-benchmark (that @minrk contributed to). Is that still relevant?

mattip commented Mar 30, 2020

View reviewed changes

setup.py Show resolved Hide resolved

minrk reviewed May 7, 2020

View reviewed changes

mattip force-pushed the pypy-cffi branch from a322aa0 to bde44bd Compare November 2, 2020 18:25

minrk force-pushed the pypy-cffi branch 2 times, most recently from 9b5cb5e to a758f8a Compare November 27, 2020 13:40

mattip and others added 8 commits January 12, 2021 13:43

PyPy can make a pure python class support the buffer protocol

fe10b73

use a proper cffi extension module, register for gc

f0104f9

split _cffi.c into its own file

f138ed2

- it's long enough that it's easier to be a standalone file - load constant_names without zmq being importable - address formatting, dropped py2compat

allow opting in to cffi backend on cpython

007cd88

omit cython extensions when installing with pypy

2dd18d3

fix _cffi.lib import

1b25204

it's called lib now

cffi: add some missing message-tracker attributes

fd69d88

test pypy on linux

ee1a701

minrk force-pushed the pypy-cffi branch 2 times, most recently from 22fbd09 to 1f821b3 Compare January 13, 2021 12:58

get zero-copy working on cffi

bc45c80

- use manual malloc/free as ffi.new has the wrong lifecycle - implement zero-copy recv - get buffer from zmq_msg

minrk force-pushed the pypy-cffi branch from 1f821b3 to bc45c80 Compare January 13, 2021 13:59

minrk changed the title ~~WIP: PyPy can make a pure python class support the buffer protocol~~ zero-copy support in cffi backend Jan 13, 2021

minrk merged commit 11ae0de into zeromq:master Jan 13, 2021

mattip mentioned this pull request Jan 21, 2021

Rebuild for pypy37 conda-forge/ipykernel-feedstock#72

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

zero-copy support in cffi backend #1365

zero-copy support in cffi backend #1365

mattip commented Feb 25, 2020

minrk commented Feb 26, 2020

mattip commented Feb 27, 2020

mattip commented Feb 27, 2020

minrk commented Mar 3, 2020 •

edited

mattip commented Mar 30, 2020

mattip Mar 30, 2020

minrk May 7, 2020

minrk May 7, 2020

minrk May 7, 2020

minrk commented Jan 13, 2021

mattip commented Jan 13, 2021

mattip commented Jan 13, 2021

zero-copy support in cffi backend #1365

zero-copy support in cffi backend #1365

Conversation

mattip commented Feb 25, 2020

minrk commented Feb 26, 2020

mattip commented Feb 27, 2020

mattip commented Feb 27, 2020

minrk commented Mar 3, 2020 • edited

mattip commented Mar 30, 2020

mattip Mar 30, 2020

Choose a reason for hiding this comment

minrk May 7, 2020

Choose a reason for hiding this comment

minrk May 7, 2020

Choose a reason for hiding this comment

minrk May 7, 2020

Choose a reason for hiding this comment

minrk commented Jan 13, 2021

mattip commented Jan 13, 2021

mattip commented Jan 13, 2021

minrk commented Mar 3, 2020 •

edited