DM-22073: Add support for writing matplotlib figures #205

leeskelvin · 2019-11-14T16:59:32Z

No description provided.

timj

This looks good although I do have some comments for improvements to the tests.

It is a bit annoying that butler.get can't work but it is important that butler.getUri is tested.

timj · 2019-11-14T17:03:52Z

tests/test_matplotlibFormatter.py

+import os
+import shutil
+
+import matplotlib


Please protect the matplotlib import and skip the tests if matplotlib is not available. daf_butler should not require matplotlib.

Is this resolved? I agree that it's important.

(It looks resolved, given the new try block for the import of matplotlib, but I haven't been able to test it in practice.)

It's hard for any of us to test it, I think, because we all use conda enviroments that always need to have matplotlib for other reasons. But I'm pretty confident what was merged will work.

timj · 2019-11-14T17:05:16Z

python/lsst/daf/butler/formatters/matplotlibFormatter.py

+    """Interface for writing matplotlib figures.
+    """
+
+    extension = '.png'


Please use double quotes in daf_butler to be consistent with the rest of the code base.

timj · 2019-11-14T19:31:13Z

tests/test_matplotlibFormatter.py

+                                  universe=butler.registry.dimensions)
+        butler.registry.registerDatasetType(datasetType)
+        pyplot.imshow(np.random.randn(3, 4))
+        butler.put(pyplot.gcf(), datasetType)


@TallJimbo does this work without a DataId because the dataset type has no dimensions?

Yes, exactly.

I think that since this code could be seen as providing example usage that we should not be relying on the super special case here of datasetType and datasetRef being interchangeable. Please receive the ref from the butler.put here and then use it later in the getUri and later butler methods below. This all works because there are no dimensions but we don't want to give the impression that it's going to work more generally.

ref = butler.put(...) parsed = ...(butler.getUri(ref)) ... butler.datasetExists(ref)

etc.

timj · 2019-11-15T17:00:46Z

tests/test_matplotlibFormatter.py

+            # predicting the filename path based on test run
+            self.assertTrue(
+                filecmp.cmp(
+                    os.path.join(self.root, 'testrun', datasetType.name,


I think I would prefer it if this path was constructed from calling butler.getUri since then you aren't relying on knowing how the file name was constructed within the datastore.

That's what we tried first, but annoyingly filecmp apparently can't handle file:// prefixes, and it occurred to me that actual usage would look more like this, but perhaps with users explicitly defining the template so they'd know where to look in general, rather than asking the butler where to look each time.

Do you have an incantation handy for how to get that file:// prefix off gracefully?

You can either use ButlerURI or you can go for the standard option of:

parsed = urllib.parse.urlparse(uri)

and then use parsed.path.

timj · 2019-11-15T17:02:32Z

tests/test_matplotlibFormatter.py

+                    file.name,
+                    shallow=True
+                )
+            )


Please add calls to butler.datasetExists and butler.remove so that we can check that these PNG images are being treated like a normal dataset. Also try to do a butler.get and check that it fails with with self.assertRaises.

TallJimbo · 2019-11-15T17:31:03Z

@leeskelvin, I'm going to let you take the lead on addressing these comments, but please find me for more pair-programming if you want help with anything.

timj approved these changes Nov 15, 2019

View reviewed changes

leeskelvin force-pushed the tickets/DM-22073 branch 7 times, most recently from 7c004c5 to 06755d1 Compare November 18, 2019 20:57

Add support for writing matplotlib figures

8cb7c8f

leeskelvin force-pushed the tickets/DM-22073 branch from 06755d1 to 8cb7c8f Compare November 18, 2019 21:31

leeskelvin merged commit 8cb7c8f into master Nov 19, 2019

leeskelvin deleted the tickets/DM-22073 branch November 19, 2019 00:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DM-22073: Add support for writing matplotlib figures #205

DM-22073: Add support for writing matplotlib figures #205

leeskelvin commented Nov 14, 2019

timj left a comment

timj Nov 14, 2019

gpdf Nov 20, 2019

gpdf Nov 20, 2019

TallJimbo Nov 20, 2019

timj Nov 14, 2019

timj Nov 14, 2019

TallJimbo Nov 15, 2019

timj Nov 18, 2019

timj Nov 15, 2019

TallJimbo Nov 15, 2019

timj Nov 15, 2019

timj Nov 15, 2019

TallJimbo commented Nov 15, 2019 •

edited

DM-22073: Add support for writing matplotlib figures #205

DM-22073: Add support for writing matplotlib figures #205

Conversation

leeskelvin commented Nov 14, 2019

timj left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

TallJimbo commented Nov 15, 2019 • edited

TallJimbo commented Nov 15, 2019 •

edited