html repr of xarray object (for the notebook) #1627

benbovy · 2017-10-11T21:49:20Z

Edit: preview for Dataset and DataArray (pure html/css)

Dataset: https://jsfiddle.net/tay08cn9/4/
DataArray: https://jsfiddle.net/43z4v2wt/9/

I started to think a bit more deeply about how could look like a more rich, html-based representation of xarray objects that we would see, e.g., in jupyter notebooks.

Here are some ideas for Dataset: https://jsfiddle.net/9ab4c3tr/35/

Some notes:

The html repr looks pretty similar than the plain-text repr. I think it's better if they don't differ too much from each other.
For the sake of consistency, I've stolen some style from pandas.Dataframe repr as it is shown in jupyterlab.
I tried to emphasize the most important parts of the repr, i.e., the lists of dimensions, coordinates and variables.
I think it's best if we keep a very lightweight implementation, i.e., pure HTML/CSS (no Javascript). It already allows some interaction like hover effects and collapsible sections. However, I doubt that more fancy stuff (like, e.g., highlighting on hover a specific dimension simultaneously at several places of the repr) would be possible here without Javascript. I have limited skills in this area, though.

It is still, of course, some preliminary thoughts. Any feedback/suggestion is welcome, even opinions about whether an html repr is really needed or not!

The text was updated successfully, but these errors were encountered:

rabernat · 2017-10-11T21:53:42Z

OMG this is so cool!

shoyer · 2017-10-11T22:48:33Z

OMG indeed this looks fantastic! We should test this on more examples, but I already love it. The underline is a great way to illustrate which variables are dimensions.

I think it's best if we keep a very lightweight implementation, i.e., pure HTML/CSS (no Javascript)

Agreed. Notebook viewers tend to sanitize out JavaScript, especially if originating from an unknown source.

highlighting on hover a specific dimension simultaneously at several places of the repr

This would also be awesome, if possible with pure CSS. I don't know, but I'm going to bug my colleague who does JavaScript visualization.

shoyer · 2017-10-11T23:41:49Z

OK, so bad news is that it not possible to select elements other than siblings or descendant tags with CSS. So selecting "cousin" tags like a dimension name at multiple locations in the repr is out.

But we could potentially add JavaScript for fancy hover effects. Even if it gets stripped out in many cases (for untrusted notebooks), it should degrade gracefully to the HTML only repr. My main concern would annoying prompts in the notebook interface that ask a user if they want to trust outputs or not.

darothen · 2017-10-12T02:26:05Z

Wow, great job @benbovy!

With the upcoming move towards Jupyter Lab and a better infrastructure for custom plugins, could this serve as the basis for a "NetCDF Extension" for Jupyter Lab? It would be great if double clicking on a NetCDF file in the JLab file explorer could open up this sort of information, or even a quick and dirty ncview-like plotter.

benbovy · 2017-10-12T13:00:59Z

Thanks for the feedback!

Here are a few more ideas: https://jsfiddle.net/9ab4c3tr/48/

main section titles are colored so that these are more detached from the content (not sure I really like it, though),
subtle shade variations, notably for displaying the first values for each variable,
drop-downs for displaying attributes per variable if any (collapsed by default).
hover-effect for dimensions: highlight all variables having the hovered dimension (uses Javascript)

We might use drop-downs to display other useful information as well, such as the type of array (e.g., dask-array, in-memory numpy.array, etc...)

Fancy hover effects would be awesome indeed, although my concern is that too much hover effect would be a source of distraction.

A jupyterlab NetCDF viewer extension would be awesome too! It might also leverage phosphor's datagrid (phosphorjs/phosphor#285) to explore the raw data values.

rabernat · 2017-10-12T14:02:06Z

Although this could clearly be made more fancy and complicated, I think what you have here would already be a great addition. I love the drop-down attributes (those are missing from the standard repr). Don't hesitate to start a PR! More fanciness could always be added in the future based on user feedback.

…

On Thu, Oct 12, 2017 at 9:01 AM, Benoit Bovy ***@***.***> wrote: Thanks for the feedback! Here is a few more ideas: https://jsfiddle.net/9ab4c3tr/48/ - main sections titles are colored so that these are more detached from the content (not sure I really like it, though), - subtle shade variations, notably for displaying the first values for each variable, - drop-downs for displaying attributes per variable if any (collapsed by default). - hover-effect for dimensions: highlight all variables having the hovered dimension (uses Javascript) We might use drop-downs to display other useful information as well, such as the type of array (e.g., dask-array, in-memory numpy.array, etc...) Fancy hover effects would be awesome indeed, although my concern is that too much hover effect would be a source of distraction. A jupyterlab NetCDF viewer extension would be awesome too! It might also leverage phosphor's datagrid (phosphorjs/phosphor#285 <phosphorjs/phosphor#285>) to explore the raw data values. — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#1627 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/ABJFJrhyjDbEVEz5AqUN6l2ppExwE3xiks5srg2MgaJpZM4P2LOr> .

shoyer · 2017-10-12T15:15:59Z

As long as we're nit-picking style....

main section titles are colored so that these are more detached from the content (not sure I really like it, though),

I slightly prefer the grayscale you had before -- the section titles are already well detached.

subtle shade variations, notably for displaying the first values for each variable,

Love it!

drop-downs for displaying attributes per variable if any (collapsed by default).

It is wonderful to expose this information! I have a slight concern that the + icons could be confusing with the * we use for index variables in the normal repr. I don't have any good ideas for fixing this yet.

hover-effect for dimensions: highlight all variables having the hovered dimension (uses Javascript)

For highlighting, maybe something a little more subtle would work well, either:

only highlighting the dimension names below, not the full variables, or
only highlight full variables with a matching name, as well as all matching dimensions.

benbovy · 2017-10-12T16:02:59Z

Before starting a PR on this, I'd like to get the design right (at least the static part) and also to clarify the repr of other xarray objects such as DataArray, Variable, Dataset.coords and Dataset.data_vars.

For Dataset.coords and Dataset.data_vars it would be pretty straightforward (we may simply display the corresponding sections in Dataset html repr).

For DataArray and Variable I have still no clear idea on what would be a good representation for the data values:

Option A: embed the plain-text representation of the wrapped numpy.array (or dask.array) in a HTML container (https://jsfiddle.net/43z4v2wt/1/). The result is not that bad, although monospace and sans-serif fonts do not mix very well IMO.
Option B: a html-formatted table for data values as well. That would be tricky for >2d arrays, it has already been discussed in other issues without real consensus.

benbovy · 2017-10-12T16:15:29Z

have a slight concern that the + icons could be confusing with the * we use for index variables in the normal repr.

Maybe putting the + icon more to the left would do the trick (https://jsfiddle.net/9ab4c3tr/49/).

For highlighting, maybe something a little more subtle would work well

Changing font-weight instead of background-color works better IMO (same link above).

shoyer · 2017-10-12T16:17:09Z

Option A: embed the plain-text representation of the wrapped numpy.array (or dask.array) in a HTML container (https://jsfiddle.net/43z4v2wt/1/). The result is not that bad, although monospace and sans-serif fonts do not mix very well IMO.

I think this is probably our best option. My main suggestion is it should be possible to click somewhere (maybe on top xarray.DataArray header) to hide/show the data section.

shoyer · 2017-10-12T16:31:25Z

Maybe putting the + icon more to the left would do the trick (https://jsfiddle.net/9ab4c3tr/49/).

Now it just looks a little out of place :). I liked it better closer to the variable name.

Changing font-weight instead of background-color works better IMO (same link above).

Hmm, I'm not sure. I find text that moves very distracting -- I like changing background colors.

benbovy · 2017-10-12T17:06:26Z

My main suggestion is it should be possible to click somewhere (maybe on top xarray.DataArray header) to hide/show the data section.

See https://jsfiddle.net/43z4v2wt/3/

I find text that moves very distracting

Indeed, I agree now, it's annoying.

shoyer · 2017-10-12T17:57:45Z

One more trick we could add for the Dataset repr:

When hovering over data, show repr for the full array values (as a numpy array) in a box (similar to "title" text, but ideally fixed-width format).

benbovy · 2017-10-12T21:10:16Z

When hovering over data, show repr for the full array values (as a numpy array) in a box (similar to "title" text, but ideally fixed-width format).

That would be nice, although I guess that it would require javascript.

Alternatively, we can imagine two drop-downs per variable, one for the attributes and one for the full array values (numpy or dask repr). Each would be shown/hidden by two distinct symbols or links, perhaps both located on the right after the preview-values so that we avoid confusion with symbol * in the normal repr.

In the DataArray repr, instead of completely collapsing the data section we may reduce it into a one-line preview of the first values. We can even imagine showing the reduced version by default in cases where the full data section would take too much vertical space.

benbovy · 2017-10-12T21:23:34Z

A comment regarding the use of javascript: after reading this discussion (bokeh/bokeh#6700) I'm not sure that maintaining a javascript-based repr that works with both jupyterlab and the classic notebook is worth the few fancy features it would provide.

I'd rather stick with a pure html/css solution here. It might still co-exist with a full-featured jupyterlab extension for viewing NetCDF files.

shoyer · 2017-10-12T21:35:39Z

Alternatively, we can imagine two drop-downs per variable, one for the attributes and one for the full array values (numpy or dask repr). Each would be shown/hidden by two distinct symbols or links, perhaps both located on the right after the preview-values so that we avoid confusion with symbol * in the normal repr.

Yes, I like that. Possibly clicking on the array values would show the larger preview.

I did play around a little with a hover drop down, but the positioning is a little hacky:
https://jsfiddle.net/gux879hn/2/

fmaussion · 2017-10-13T09:11:07Z

Note that the info() method was added a while ago: #1176

At least in the notebook, this change would make it obsolete.

benbovy · 2017-10-13T12:16:01Z

New version for Dataset: https://jsfiddle.net/9ab4c3tr/50/

clicking on the name of a variable with A symbol shows its attributes.
clicking on the first-values of a variable shows the .data repr

benbovy · 2017-10-13T13:30:44Z

And for DataArray: https://jsfiddle.net/43z4v2wt/5/ (possibility to reduce the full data repr to a one-line preview)

fujiisoup · 2017-10-13T14:38:12Z

This is wonderful!

Is it reasonable to add a drop down in the attribute section if there are too many items there?

My data sometimes have a long list of attributes and important information about coordinate and data_vars are not seen without scrolling back the page.
(This may be a rare case though...)

benbovy · 2017-10-13T14:57:06Z

Is it reasonable to add a drop down in the attribute section if there are too many items there?

In the examples above, you can click on the Attributes section title to collapse the whole section (same with coordinates and data variables sections).

Given your case, maybe it would be nice to add a rule to show the attribute section collapsed when the number of attributes is too long. If we allow that, then maybe it would be nice to also show the number of items in the section titles, e.g., ► Attributes (10):

shoyer · 2017-10-13T15:23:05Z

I have a slight proposed tweak on the Dataset repr (https://jsfiddle.net/jrot9pex/1/). It makes two changes:

Changes the letter "A" to "a" for attributes. Maybe this is just me, but the capital "A" feels very loud, and reminds me of the scarlet letter!
Moves the marker "a" closer to the variable name, which makes it slightly clearer that it's associated.

Note that the info() method was added a while ago: #1176

At least in the notebook, this change would make it obsolete.

Yes, but let's keep it -- it have a nice pure text format (from ncdump) which works especially well with copy & paste.

Given your case, maybe it would be nice to add a rule to show the attribute section collapsed when the number of attributes is too long. If we allow that, then maybe it would be nice to also show the number of items in the section titles, e.g., ► Attributes (10):

Yes, I like this general idea -- though we might only show the number when it is collapsed, e.g., ► Attributes: (10)

fujiisoup · 2017-10-13T15:36:13Z

If we allow that, then maybe it would be nice to also show the number of items in the section titles, e.g., ► Attributes (10):

I like this idea. Looking forward to having it.

benbovy · 2017-10-13T17:04:07Z

though we might only show the number when it is collapsed

Good idea!

Changes the letter "A" to "a" for attributes

Agreed.

Moves the marker "a" closer to the variable name

The good (or bad?) thing with the marker "a" to the very left is that it is aligned with the drop-down symbols of the main sections, but to me either way is fine.

Let's summarize all suggestions so far:

Dataset: https://jsfiddle.net/tay08cn9/2/
DataArray: https://jsfiddle.net/43z4v2wt/7/

I think that we're getting close to something good!

I'll wait a bit before starting a PR (maybe sometime next week), in case other feedbacks or suggestions come up.

spencerahill · 2017-10-13T17:15:33Z

OMG this is so cool!

ditto wow I can't wait for this to be in!

shoyer · 2017-10-29T04:54:32Z

@benbovy it seems like that discussion has stalled out... are you ready to put together that PR? :)

Personally, I'm very happy with your current version. You might even convince me to hold off on the v0.10 release to include it!

benbovy · 2017-11-01T14:07:24Z

Sorry for my late reply @shoyer !

Yes I'm happy with this version too, I'll open asap a PR! Unfortunately I hadn't much time to work on this these last two weeks, but I want to put this in my priorities next week.

I guess that it is a bit late for v0.10 release which is already on track? I still don't exactly know how much time I'll need to implement this. I haven't thought yet about all implementation details (e.g., how to calculate the width of the variable name column? potential issues with jupyter notebook / jupyterlab...). It might also require some refactoring and/or new public API (e.g., to_html like in pandas).

shoyer · 2017-11-01T15:05:09Z

I guess that it is a bit late for v0.10 release which is already on track?

Yes, probably at this point. But hopefully we can do a shorter release cycle for v0.11. Also, though this is a big visual change, I'm not sure it's actually a breaking change, per se. Only notebook output will change, not programmatic use of repr().

fmaussion · 2017-11-01T15:22:55Z

Also, though this is a big visual change, I'm not sure it's actually a breaking change, per se.

Yes, but once it's there I'm sure that we'll get bug reports right away if it doesn't work well, because this is a very visual change. Therefore I am almost in favor of pushing this forward as fast as possible and see how it goes, maybe followed by a quick 0.10.1

But since I'm not the one doing the job I'm not allowed to say anything more here ;)

benbovy · 2017-11-07T11:20:41Z

I started working on a new PR (not yet submitted), but before continuing the work I really need to know how best we can include the CSS code in rich outputs for the notebook (e.g., inline CSS or using the <style> tag) without requiring any extension for the various front-end applications.

It is not really clear to me if it is even supported so I opened an issue on the jupyterlab side (see reference above).

benbovy · 2017-11-13T18:59:28Z

So we have different options regarding the tools to use for implementing these rich representations:

Using vdom. This option is very pythonic and is suggested by jupyterlab and nteract developers. I have used vdom to implement the rich repr of Dataset (not fully working yet + you need last jupyterlab or nteract to see it): https://gist.github.com/benbovy/a30f286f7fdf9528c4d0c7980be9b6a7. vdom is still in development, though. It is not yet supported by all front-ends. Currently it is supported in jupyterlab and nteract and support will be added soon for the classic notebook (and nbviewer?)
Using a template system like jinja2. It doesn't require any specific support on the various notebook front-ends, but it still adds a dependency.
Using Python strings formatting. No dependencies, but more tough to maintain.

I'd like to know what are your thoughts. Should we go with vdom and wait a bit until the tool is more mature / more front-ends are supported (hopefully soon, which let us some time if we can wait for release v0.11)? Or do you want it right now and so it's perhaps better to use templates / formatting?

shoyer · 2017-11-13T19:47:10Z

I am OK adding new (optional) Python dependencies like vdom or Jinja2 if that makes implementing and maintaining this easier.

If we can solve the problem of generating HTML from vdom as a fallback (nteract/vdom#43) and the vdom developers are supportive for various issues that come up, that could easily be the best option.

I would be reluctant to only support vdom output, because there are a lot of legacy notebook viewing interfaces (including various IDEs, cloud hosted notebook environments and rendering on GitHub) that could take a while (years?) to support it.

shoyer · 2017-11-13T20:42:43Z

The other thing to watch out for is if the vdom Python API is still immature and likely to lead to additional works when it changes in the future. This is somewhat of a judgment call. My sense is that they are taking a careful design to the project but it is still in the early days so it's hard to say for sure.

benbovy · 2017-11-13T21:22:28Z

Yes good points.

One part of me (the one that loves every cool, new package) says "let's use vdom for this right now", and the other (more wise) part says "let's use Python string formatting -- not even sure we need something like jinja2 --, we'll switch to vdom once it is more mature and it will be quite easy to do so."

jhamman · 2018-07-13T20:30:05Z

@benbovy - this came up in conversation today with @shoyer and a number of Jupyter devs. What is the current interest on the subject?

benbovy · 2018-07-14T10:25:24Z

Yeah I really need to continue the work in #1820, this PR has stalled for too long!

Last time I worked on this I was struggling a bit on good column auto-sizing and alignment with a pure CSS implementation (i.e., using CSS grid, display: content), but that's not really a blocker I think.

Another option would be to write a jupyterlab mime render extension. This narrows the supported front-ends, but I guess the issue of front-end theme integration will be easier to solve. Also, it will be possible to add more fancy features later.

Did you were at SciPy? I hope you had a great time! I wanted to attend the conference this year but finally I couldn't make it, unfortunately.

jhamman · 2018-07-16T03:59:32Z

Last time I worked on this I was struggling a bit on good column auto-sizing and alignment with a pure CSS implementation...but that's not really a blocker I think.

Agreed. I think that is something we can work on over time. A jupyterlab extension would be cool too but, as you say, it would have a smaller footprint in the shor-term.

Also, yeah, Scipy was great. I think I'll go back.

mrocklin · 2019-06-30T15:49:08Z

Thought I'd bump this (hopefully no one minds). I think that this is great!

rabernat · 2019-10-24T16:56:15Z

🎉 🍰 🍾 🏆 🏅

shoyer added the design question label Oct 12, 2017

benbovy mentioned this issue Nov 7, 2017

Best way to add html/css rich output for the notebook (no javascript, no extension)? jupyterlab/jupyterlab#3200

Open

rgbkrk mentioned this issue Nov 7, 2017

include button, label, and style components nteract/vdom#42

Merged

fujiisoup mentioned this issue Nov 17, 2017

Fix unexpected loading after print #1724

Merged

4 tasks

benbovy mentioned this issue Jan 11, 2018

WIP: html repr #1820

Closed

8 tasks

jsignell mentioned this issue Oct 21, 2019

Html repr #3425

Merged

7 tasks

dcherian closed this as completed in #3425 Oct 24, 2019

gtca mentioned this issue Feb 24, 2021

HTML representation for Jupyter notebooks scverse/muon#22

Closed

ddkohler mentioned this issue Jun 8, 2022

Integrate WrightTools objects with iPython wright-group/WrightTools#705

Open

tasansal mentioned this issue Sep 1, 2022

Add printable representation for MDIOReader and MDIOWriter TGSAI/mdio-python#33

Open

martinvonk mentioned this issue Feb 2, 2023

[ENHANCEMENT] Create html repr for Model class (for Notebooks) pastas/pastas#519

Closed

anitagraser mentioned this issue Mar 1, 2023

Nice html repr for trajectories movingpandas/movingpandas#293

Open

html repr of xarray object (for the notebook) #1627

html repr of xarray object (for the notebook) #1627

Comments

benbovy commented Oct 11, 2017 • edited

rabernat commented Oct 11, 2017

shoyer commented Oct 11, 2017

shoyer commented Oct 11, 2017

darothen commented Oct 12, 2017

benbovy commented Oct 12, 2017 • edited

rabernat commented Oct 12, 2017 via email

shoyer commented Oct 12, 2017

benbovy commented Oct 12, 2017

benbovy commented Oct 12, 2017

shoyer commented Oct 12, 2017

shoyer commented Oct 12, 2017

benbovy commented Oct 12, 2017

shoyer commented Oct 12, 2017

benbovy commented Oct 12, 2017

benbovy commented Oct 12, 2017

shoyer commented Oct 12, 2017

fmaussion commented Oct 13, 2017

benbovy commented Oct 13, 2017

benbovy commented Oct 13, 2017

fujiisoup commented Oct 13, 2017

benbovy commented Oct 13, 2017 • edited

shoyer commented Oct 13, 2017

fujiisoup commented Oct 13, 2017

benbovy commented Oct 13, 2017

spencerahill commented Oct 13, 2017

shoyer commented Oct 29, 2017

benbovy commented Nov 1, 2017

shoyer commented Nov 1, 2017

fmaussion commented Nov 1, 2017

benbovy commented Nov 7, 2017

benbovy commented Nov 13, 2017

shoyer commented Nov 13, 2017 • edited

shoyer commented Nov 13, 2017

benbovy commented Nov 13, 2017

jhamman commented Jul 13, 2018

benbovy commented Jul 14, 2018

jhamman commented Jul 16, 2018

mrocklin commented Jun 30, 2019

rabernat commented Oct 24, 2019

benbovy commented Oct 11, 2017 •

edited

benbovy commented Oct 12, 2017 •

edited

benbovy commented Oct 13, 2017 •

edited

shoyer commented Nov 13, 2017 •

edited