Rewrite OpenEXR python bindings using pybind11 and numpy #1756

cary-ilm · 2024-05-16T04:28:26Z

This introduces an entirely new python API for reading and writing OpenEXR files that supports all file features (or will soon): scanline, tiled, deep, multi-part, etc. It uses numpy arrays for pixel data, and it supports both separate arrays for each channel and interleaving of RGB data into a single composite channel. It leaves the existing binding API in place for backwards-compatibility; there's no overlap between the two APIs.

See src/wrappers/python/README.md for examples of the new API.

The API is simple: the File object holds a py::list of Part objects, each of which has a py::dict for the header and a py::dict for the channels, and each channel holds a numpy array for the pixel data. There's intentionally no support for selective scanline/time reading; to keep the API simple, reading a file reads the entire channel data for all parts. There is, however, an option to read just the headers and skip the pixel data entirely.

A few things don't work yet but should be fixed before release:

Reading and writing of deep data samples isn't finished.
ID manifest attributes aren't supported yet.
For mip-mapped images, it current only reads the top level.
Channel subsampling is not working.
I'm not 100% sure the memory ownership is handled properly.

This also does not (yet) properly integrate the real Imath types from the proper Imath bindings. It leaves in place for now the internal "Imath.py" module, but defines its own internal set of set of Imath classes. This needs to be resolved once the Imath bindings are distributed via pypi.org.

The test suite downloads images from openexr-images and runs them through a battery of reading/writing tests. Currently, the download is enabled via the OPENEXR_TEST_IMAGE_REPO environment variable that is off by default but is on in the python wheel CI workflow.

This also adds operator== methods to KeyCode and PreviewImage, required in order to implement operator== for the Part and File objects.

meshula · 2024-05-22T16:19:47Z

For the purposes of code review, could you elaborate a little on this ~

It leaves the existing binding API in place for backwards-compatibility; there's no overlap between the two APIs.

Specifically I'm wondering if the existing API implementation is untouched, or if the existing API is re-implemented using pybind11. There's a lot here to read and before starting I'm wondering if also we need to be verifying that existing API is fulfilled in the new implementation.

cary-ilm · 2024-05-22T16:51:07Z

Thanks for looking! The existing C-python implementation remains untouched. I renamed the file to PyOpenEXR_old.cpp didn't change anything other than to import the symbols into the new pybind11-defined module. The old API implements classes InputFile and OutputFile, which continue to operate as before, while the new implementation has a single new class File, which does both reading and writing.

The test for the old API, which illustrates the usage, is in src/wrappers/python/tests/test_old.py

This seemed a reasonable approach since I wanted to change so much about the API, primarily the switch to numpy for pixel data, but not entirely break everything.

The other .py files in src/wrappers/python/tests use the new API. Definitely start there, since getting your feedback on the usage is probably more important than on the actual pybind11 implementation.

The API is simple enough that the src/wrappers/python/README.md is a pretty good summary. It doesn't illustrate everything but it covers the basics.

lji-ilm · 2024-05-24T23:14:51Z

src/wrappers/python/README.md

+        f.write("readme_modified.exr")
+
+        with OpenEXR.File("readme_modified.exr") as o:
+            assert o.header()["displayWindow"] == OpenEXR.Box2i((3,4),(5,6))


If we want the outward facing API to be really pythonic, probably we don't need the IMath types?

A more pythonic way would simply be o.header()["displayWindow"] == ((3 ,4), (5, 6))

I looked up src\wrappers\python\IMath.py, it doesn't seem there is a substantial difference from python tuples, except for the print (repr) method. IMath.Box did not really type it all the way down, with its min and max looks like python tuples (although the author's intention is to use IMath.point).

If we really want to enforce the dimensionality of the parameter but remain pythonic or num-pythonic, maybe set this to an ndarray and assert its shape is (2, 2). I tend to think a few classes (esp. point and box) in Imath.py is reinventing the wheel in a C-style and doesn't really read like modern numpy/scipy code. These data could all just be ndarrays, that's actually less confusion than letting the user figure out whether box.min should be plain tuple or Imath.point. The important things are this parameter's type and shape, which are all enforced in ndarry for free (up on assign, arithmetic and comparison).

what do people think?

I'm also inclined to believe that maybe we should accept and promote the built-in Python types (tuples, arrays, dictionaries) for all the usual "short" data types for which they seem a decent fit, and use Numpy arrays for blocks of image pixel data, and call it a day. The Imath types are not Pythonic, but rather look like somebody simply wrapped a C++ library. They're not friendly to people who must pass data back and forth between OpenEXR and almost any other Python library (Alembic is the only Python interface I know of that expects people to use classes from Imath's Python bindings).

cary-ilm · 2024-06-02T04:24:51Z

I removed all the Imath classes and replaced them with numpy arrays and tuples. Vec 2/3 Int/Float/Double attributes appear in the header dict as 2- or 3-element numpy arrays of type int32, float32, or float64, and M33/44 Float/Double as 3x3 or 4x4 numpy arrays. Box2s are tuples of 2-element arrays.

RationalAttributes now appear as Fraction objects from the fractions module. DoubleAttributes appear as single-element numpy arrays of type float64, so they get preserved when written. Chromaticities are 8-element tuples.

I left the TimeCode and KeyCode classes in place, since they have convenient member fields that would be unintuitive in a tuple.

The File object also now implements __enter__/__exit__ so that with File(...) as f works.

The README.md examples have been updated accordingly, so they're more pythonic now.

src/wrappers/python/PyOpenEXR.cpp

lji-ilm · 2024-06-18T19:26:47Z

I am following up with our discussion about how to best represent deep data in numpy, following up our last meeting.

I also want to pay an tribute to A-buffer here since I think EXR deep is probably just thinking about the same thing :) : https://dl.acm.org/doi/pdf/10.1145/965139.807360

Now onto the real topic. array:object[array:dtype:float] seem to be the way to go. By keeping both levels to be numpy data structures, it best supports numpy's slicing and other array based operations. The following code is an excerpt from HDF5's official h5py library's vlen data type implementation (https://docs.h5py.org/en/latest/special.html#arbitrary-vlen-data)

array([array([1, 2, 3], dtype=int32), array([1, 2, 3, 4, 5], dtype=int32)], dtype=object)

A few stackoverflow posts discussed this question, and the above answer is generally prefered due to the fact that it doesn't involve native python lists, which will not support numpy operations. The following is only one example:

https://stackoverflow.com/questions/3386259/how-to-make-a-multidimension-numpy-array-with-a-varying-row-size

A special mention to the "Awkward Arrays" library, which specialize in dealing with this class of "varying data per sample" problems: https://awkward-array.org/doc/main/getting-started/index.html . I don't think we need to add a dependency, however the first two questions in FaQ did a good job in terms of a state-of-the-art review of the implementations.

I reviewed a few scientific imaging formats, including FITS (https://fits.gsfc.nasa.gov/fits_primer.html), but their data container per pixel (or per sample) are homogeneous. HDF5's vlen is the only one I spotted in a major format that deals with deep-like data formats. We might be quite on the frontier here.

cary-ilm · 2024-06-18T23:13:58Z

Thanks for that consideration, @lji-ilm. I'm satisfied with the dtype=object approach, the implementation is straightforward and the structure is I think what the user would naturally expect.

And for what it's worth, ChatGPT is what originally offered this suggestion, after nobody answered my query on the pybind11 discussion page.

lji-ilm · 2024-07-04T23:50:40Z

I think the current build install with OPENEXR_BUILD_PYTHON=ON will copy the built artefact to /usr/local/python/OpenEXR, which might be a distro-specific install path.

My wsl ubuntu 22.04 uses a few paths under /usr/local/lib, such as /usr/local/lib/python3.10 etc in the vanilla python apt-get installation for global python paths. I don't think it's consistent across the various distros. It might be worthwhile to have a note about the artefact's cmake install path v.s. vanilla python path to avoid surprises, such as the import test fail.

Maybe we should have a step in CMAKE to ensure python paths... However if at the end of the day we expect people to pip this, maybe it's not a concern.

lji-ilm · 2024-07-05T00:42:36Z

src/wrappers/python/PyOpenEXR.cpp

+// If 'rgba' is true, gather 'R', 'G', 'B', and 'A' channels and interleave
+// them into  a 3- or 4- (if 'A' is present) element numpy array. In the case
+// of raw 'R', 'G', 'B', and 'A' channels, the corresponding key in the
+// channels dict is "RGB" or "RGBA".  For channels with a prefix,
+// e.g. "left.R", "left.G", etc, the channel key is the prefix.


I would like to have another thought about this API.

What happens with the rgba parameter is essentially this:

There is an extra flip in the core Python API C implementation.

If you do NOT turn it on, what you get is more low level (more conforming to the file's storage format), but more difficult to use in subsequent python/numpy operations.

If you DO turn it on, what you get is higher level, but easier to use in subsequent numpy operations, especially for beginners.

This data transformation can be done in python with numpy, without bothering with C++ code.

It caught my attention because:

The default is hard to use and more low level, but is general purpose. You have to flip a switch to get something that's easier to use for the noob use cases, but would not work out in advanced cases. This feels backwards.

This can be done with numpy calls after the binding layer, instead of doing it in C++ before the binding layer

I suggest we change it to one of the following:

Remove this parameter and the channel gathering in C++ code. Provide another OpenEXR.utils module that is entirely written in python that does tasks similar to this. (that can be either inside or outside of the official repo -- we can debate whether that would reinvent wheels of openimageIO)

Flip the semantics this parameter, the C++ code will always attempt to gather channel into a nice rgba layout if it could do so, but you can put in an extra parameter to tell it to NOT do this.

I rethought this after a week.

I think the most striaght forward way might be simply

don't have this parameter, simplify the API interface

always prep a channel "RGB" and "RGBA" numpy array if possible according to the file structure. If not possible, then don't prep it.

There are some python techniques that can do a lazy bind (e.g. only allocate memory on R, G, B initially and only allocate "RGBA" when this key is queried on the channel dict), but i wonder if that worth it.

Overall i think my thought is about that extra parameter. either remove it or move it onto the channel object. It probably should not stay on the File object's initializer

I tend to agree. I've left it in place but inverted the sense of the parameter and renamed it separate_channels, off by default. So the default behavior is the most convenient, even if it differs from the file structure.

I'm still not completely happy with this.

pyproject.toml

This introduces an entirely new python API for reading and writing OpenEXR files that supports all file features (or will soon): scanline, tiled, deep, multi-part, etc. It uses numpy arrays for pixel data, and it supports both separate arrays for each channel and interleaving of RGB data into a single composite channel. It leaves the existing binding API in place for backwards-compatibility; there's no overlap between the two APIs. See src/wrappers/python/README.md for examples of the new API. The API is simple: the ``File`` object holds a py::list of ``Part`` objects, each of which has a py::dict for the header and a py::dict for the channels, and each channels hold a numpy array for the pixel data. There's intentionally no support for selective scanline/time reading; reading a file reads the entire channel data for all parts. There *is*, however, an option to read just the headers and skip the pixel data entirely. A few things don't work yet: - Reading and writing of deep data isn't finished. - ID manfest attributes aren't supported yet. - For mipmaped images, it current only reads the top level. This also does not (yet) properly integrate the real Imath types. It leaves in place for now the internal "Imath.py" module, but defines its own internal set of set of Imath classes. This needs to be resolve once the Imath bindings are distributed via pypi.org The test suite downloads images from openexr-images and runs them through a battery of reading/writing tests. Currently, the download is enabled via the ``OPENEXR_TEST_IMAGE_REPO`` environment variable that is off by default but is on in the python wheel CI workflow. This also adds operator== methods to ``KeyCode`` and ``PreviewImage``, required in order to implement operator== for the ``Part`` and ``File`` objects. Signed-off-by: Cary Phillips <cary@ilm.com>

Signed-off-by: Cary Phillips <cary@ilm.com>

- Add __enter__/__exit__ so "with File(name) as f" works - Allow channel dict to reference pixel array directly - Fix subsampling Signed-off-by: Cary Phillips <cary@ilm.com>

Also: - Use single-element array for DoubleAttribute - Use fractions.Fraction for Rational - Use 8-element tuple for chromaticities - Add __enter__/__exit__ so "with" works with File object - remove operator== entirely, it's way too hard to implement something that's actually useful for testing purposes. The tests do their own validations. Signed-off-by: Cary Phillips <cary@ilm.com> pixel comparison Signed-off-by: Cary Phillips <cary@ilm.com>

Also works properly for separate channels and when coalescing into RGB/RGBA arrays. Signed-off-by: Cary Phillips <cary@ilm.com>

Signed-off-by: Cary Phillips <cary@ilm.com>

lji-ilm · 2024-08-08T23:11:03Z

src/wrappers/python/PyOpenEXR.cpp

+        channel_name == 'B' ||
+        channel_name == 'A')
+    {
+        // has the right final character. The preceding character is either a


Truncated comment.

Also for this entire method -- is it appropriate to write it on the python side instead of the C side? It seems just some string manipulations and would be expressed more naturally in python, without much performance penalty of moving it off C code.

The function is used throughout this file, moving it would mean you'd have to move most of the implementation to Python....

Good point... Let's leave it then (but finish the comment :)

lji-ilm · 2024-08-08T23:13:14Z

src/wrappers/python/PyOpenEXR.cpp

+}
+
+//
+// Return whether "name" corresponds to one of the 'R', 'G', 'B', or 'A'


This sounds like it return a book or 0/1, but in fact it is a mapping between the channel name and array indices, i think.
It might be better to rename it to something along the line of "map to channel indices" or "get channel index" etc...?

I renamed it channelNameToRGBA(). Still confusing, but hopefully a little less so.

lji-ilm · 2024-08-08T23:18:39Z

src/wrappers/python/PyOpenEXR.cpp

+
+template <class P, class T>
+bool
+is_v2(const py::object& object, Vec2<T>& v)


I can't find a copy of IMath's python bindings at hand, but should these marshallings better be in IMath rather than OpenEXR...?

Also i understand this is internal, but this is actually a conversion rather than a read-only test, so maybe rename them to "convert_to_v2" or simply "to_v2"

I renamed is_v2() to objectToV2() and the others similarly.

meshula · 2024-08-09T02:12:29Z

I'm happy if we land this; I haven't got more feedback to give, seems like playing with it for a while is a good next step.

lji-ilm · 2024-08-09T20:02:54Z

Agreed. Actually I have been constantly using this binding in my experiments of benchmarking/compression in the last few weeks, so not only it works but it also works under a bit of stress.

Hopefully we can see this in pip install OpenEXR soon!

Signed-off-by: Cary Phillips <cary@ilm.com>

cary-ilm mentioned this pull request May 21, 2024

OpenEXR.OutputFile Fails to Write to BytesIO Buffer #1758

Open

lji-ilm reviewed May 24, 2024

View reviewed changes

meshula reviewed Jun 2, 2024

View reviewed changes

src/wrappers/python/PyOpenEXR.cpp Outdated Show resolved Hide resolved

meshula reviewed Jun 2, 2024

View reviewed changes

src/wrappers/python/PyOpenEXR.cpp Outdated Show resolved Hide resolved

meshula reviewed Jun 2, 2024

View reviewed changes

src/wrappers/python/PyOpenEXR.cpp Outdated Show resolved Hide resolved

lji-ilm reviewed Jul 5, 2024

View reviewed changes

JeanChristopheMorinPerso reviewed Jul 16, 2024

View reviewed changes

pyproject.toml Outdated Show resolved Hide resolved

cary-ilm added 8 commits July 27, 2024 12:07

Fix subsampling

65bf5fe

Signed-off-by: Cary Phillips <cary@ilm.com>

- Make README.md examples more pythonic

02ff105

- Add __enter__/__exit__ so "with File(name) as f" works - Allow channel dict to reference pixel array directly - Fix subsampling Signed-off-by: Cary Phillips <cary@ilm.com>

Deep reading/writing works for both scanline and tiles

1d4b1ef

Also works properly for separate channels and when coalescing into RGB/RGBA arrays. Signed-off-by: Cary Phillips <cary@ilm.com>

doc strings

a8e9230

Signed-off-by: Cary Phillips <cary@ilm.com>

Rename rgba param to separate_channels, off by default.

fd8df75

Signed-off-by: Cary Phillips <cary@ilm.com>

remove numpy as build requirement

6ee1a88

Signed-off-by: Cary Phillips <cary@ilm.com>

cary-ilm force-pushed the pybind11.2 branch from 7f746d8 to 6ee1a88 Compare July 28, 2024 00:54

lji-ilm reviewed Aug 8, 2024

View reviewed changes

cary-ilm added 3 commits August 12, 2024 12:10

Rename rgbaChannel() to channelNameToRGBA; rename is_* to objectTo*

2eb3730

Signed-off-by: Cary Phillips <cary@ilm.com>

Add "deprecated" to InputFile/OutputFile/Imath doc strings

ed2b063

Signed-off-by: Cary Phillips <cary@ilm.com>

Explicitly set python versions for wheel workflows, and exclude 3.13

1e3993c

Signed-off-by: Cary Phillips <cary@ilm.com>

cary-ilm force-pushed the pybind11.2 branch from 790ba19 to 1e3993c Compare August 12, 2024 23:39

resolve conflicts

7addf5e

Signed-off-by: Cary Phillips <cary@ilm.com>

resolve conflict

f303d8e

Signed-off-by: Cary Phillips <cary@ilm.com>

cary-ilm merged commit 84d7d52 into AcademySoftwareFoundation:main Aug 13, 2024
35 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rewrite OpenEXR python bindings using pybind11 and numpy #1756

Rewrite OpenEXR python bindings using pybind11 and numpy #1756

cary-ilm commented May 16, 2024

meshula commented May 22, 2024

cary-ilm commented May 22, 2024

lji-ilm May 24, 2024

lgritz May 25, 2024

cary-ilm commented Jun 2, 2024

lji-ilm commented Jun 18, 2024

cary-ilm commented Jun 18, 2024

lji-ilm commented Jul 4, 2024 •

edited

Loading

lji-ilm Jul 5, 2024 •

edited

Loading

lji-ilm Jul 19, 2024

cary-ilm Jul 24, 2024

lji-ilm Aug 8, 2024

meshula Aug 9, 2024

lji-ilm Aug 9, 2024

lji-ilm Aug 8, 2024

cary-ilm Aug 13, 2024

lji-ilm Aug 8, 2024

lji-ilm Aug 8, 2024

cary-ilm Aug 13, 2024

meshula commented Aug 9, 2024

lji-ilm commented Aug 9, 2024

Rewrite OpenEXR python bindings using pybind11 and numpy #1756

Rewrite OpenEXR python bindings using pybind11 and numpy #1756

Conversation

cary-ilm commented May 16, 2024

meshula commented May 22, 2024

cary-ilm commented May 22, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

cary-ilm commented Jun 2, 2024

lji-ilm commented Jun 18, 2024

cary-ilm commented Jun 18, 2024

lji-ilm commented Jul 4, 2024 • edited Loading

lji-ilm Jul 5, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

meshula commented Aug 9, 2024

lji-ilm commented Aug 9, 2024

lji-ilm commented Jul 4, 2024 •

edited

Loading

lji-ilm Jul 5, 2024 •

edited

Loading