Add example app: pivot chart maker #5894

mmowers · 2017-02-20T01:49:01Z

This bokeh app creates pivot charts from data, similar to Excel's pivot chart functionality, but with the additional ability to explode into multiple pivot charts.

fixes New App Example: Exploding Pivot Charts #5640
no tests added

bryevdv · 2017-02-20T21:50:07Z

Hi @mmowers thanks for the PR! Regarding the CI test failures there are just a few edits needed to satisfy the linter:

>       assert len(errors) == 0, "Code quality issues:\n%s" % "\n".join(errors)
E       AssertionError: Code quality issues:
E         File contains trailing whitespace: examples/app/pivot/README.md, line 41.
E         File contains trailing whitespace: examples/app/pivot/README.md, line 55.
E         File does not end with a newline: examples/app/pivot/README.md, line 82
E         File does not end with a newline: examples/app/pivot/downloads/.gitignore, line 4
E         File does not end with a newline: examples/app/pivot/main.py, line 377
E         File does not end with a newline: examples/app/pivot/templates/index.html, line 19
E         File does not end with a newline: examples/app/pivot/templates/scripts.js, line 62
E         File does not end with a newline: examples/app/pivot/templates/styles.css, line 79
E         File starts with more than 1 empty line: examples/app/pivot/theme.yaml, line 1
E       assert 9 == 0
E        +  where 9 = len(['File contains trailing whitespace: examples/app/pivot/README.md, line 41.', 'File contains trailing whitespace: exam...pp/pivot/main.py, line 377', 'File does not end with a newline: examples/app/pivot/templates/index.html, line 19', ...])

bryevdv · 2017-02-20T21:51:10Z

examples/app/pivot/theme.yaml

@@ -0,0 +1 @@
+


If there aren't any customizations made in this file, it can just be omitted entirely.

Thanks, done.

bryevdv · 2017-02-20T21:57:03Z

examples/app/pivot/downloads/.gitignore

+# Ignore everything in this directory
+*
+# Except this file
+!.gitignore


This seems unnecessary

I have this gitignore file because users are able to download csv files to this folder.

bryevdv · 2017-02-20T21:58:19Z

examples/app/pivot/main.py

+    wdg['y_major_label_size'].on_change('value', update_sel)
+    wdg['circle_size'].on_change('value', update_sel)
+    wdg['bar_width'].on_change('value', update_sel)
+    wdg['line_width'].on_change('value', update_sel)


since these are all getting the same callback, it might be a little more economical in terms of code to use a loop, somethign like:

for name in widget_names: wdg[name].on_change('value', update_sel)

Thanks, done.

bryevdv · 2017-02-20T22:00:51Z

examples/app/pivot/csv/electricity generation.csv

+Modeled,ID,gas,2024,2753007.082
+Modeled,ID,gas,2026,2782479.991
+Modeled,ID,gas,2028,2917963.431
+Modeled,ID,gas,2030,


This may be a bit large to check into the repo directly. We should consider putting it in bokeh.sampledata and using it from there (and using the mtcars data from there as well)

I've reduced the size of this csv to 6 KB, and removed cars.csv. Let me know if that is sufficient.

bryevdv · 2017-02-20T22:01:28Z

examples/app/pivot/main.py

+PLOT_HEIGHT = 300
+PLOT_FONT_SIZE = 10
+PLOT_AXIS_LABEL_SIZE = 8
+PLOT_LABEL_ORIENTATION = 45


These could go in theme.yaml I think

These variables are just used as defaults for the widgets. But the styling itself is coming from the widget values, not directly from these variables. Let me know if you have another idea of how to do this. In the meantime, I've removed theme.yaml.

bryevdv · 2017-02-20T22:03:49Z

I've left some initial review comments. I think there are some other things we can do like add a screenshot and link to the top level README

mmowers · 2017-02-23T07:11:58Z

Thanks @bryevdv! I've added a commit to address your comments.

bryevdv · 2017-02-23T15:13:42Z

@mmowers getting closer! First, there was an upstream dependency change that caused our unit tests to all start failing. You will need merge/rebase on master to get those green again.

After your lastest commit, there are still some (new) linter issues to fix:

__________________________________ test_files __________________________________
    @pytest.mark.quality
    def test_files():
        errors = collect_errors()
>       assert len(errors) == 0, "Code quality issues:\n%s" % "\n".join(errors)
E       AssertionError: Code quality issues:
E         File starts with more than 1 empty line: examples/app/pivot/theme.yaml, line 1
E       assert 1 == 0
E        +  where 1 = len(['File starts with more than 1 empty line: examples/app/pivot/theme.yaml, line 1'])
tests/test_code_quality.py:90: AssertionError
_________________________________ test_flake8 __________________________________
    @pytest.mark.quality
    def test_flake8():
        chdir(TOP_PATH)
    
        proc = Popen(["flake8"], stdout=PIPE, stderr=PIPE)
        out, err = proc.communicate()
    
>       assert proc.returncode == 0, "Flake8 issues:\n%s" % out.decode("utf-8")
E       AssertionError: Flake8 issues:
E         ./examples/app/pivot/main.py:57:166: E501 line too long (166 > 165 characters)
E         ./examples/app/pivot/main.py:60:166: E501 line too long (178 > 165 characters)
E         ./examples/app/pivot/main.py:80:166: E501 line too long (178 > 165 characters)
E         ./examples/app/pivot/main.py:321:166: E501 line too long (168 > 165 characters)
E         
E       assert 1 == 0
E        +  where 1 = <subprocess.Popen object at 0x7f95b4b9a9b0>.returncode
tests/test_flake8.py:16: AssertionError

I do think we need to figure out what to do about the large CSV. One option would be to have a download script or function that will fetch the CSV files the first time the app is run (or make a warning that instructs how to download them)

Also I think my comment about the README was too vague, apologies. I think the app dir should still have a standalone readme (i.e. not linking to an external one). I was suggesting that your new app be added and linked from the README one level up in examples/app: https://github.com/bokeh/bokeh/tree/master/examples/app

mmowers · 2017-02-24T04:16:59Z

Thanks @bryevdv! I'll try to get these changes done tomorrow. One clarification before I do: I updated the csv file and now it is 6 KB. I'm not sure you saw that because the comment is collapsed as it is associated with an outdated file. I'm more than willing to add a function to automatically fetch the csv when the app is initially loaded, but wanted to make sure you were aware of its updated size before I make the change. Thanks again!

mmowers · 2017-02-26T00:32:20Z

Hi @bryevdv , I've pushed some updates. The csv file is now down to 2K, so hopefully that's small enough (let me know if it isn't). For the examples/app readme, the png needs to be uploaded to http://bokeh.pydata.org/static/. I have the PNG here: https://github.com/mmowers/superpivot/blob/master/pivot.PNG. Thanks again!

…dd helpful docstrings.

bryevdv · 2017-02-27T20:59:02Z

@mmowers It think 2k is probably an OK size. I will upload the image to the pydata site this week. It looks like the test failure was spurious, so I've restarted it. Apart from that I just want to have a change to check out the branch and run the example directly, which I should be able to get to in the next few days.

mmowers · 2017-02-28T00:37:23Z

@bryevdv ,
Thanks, great to hear! Let me know if you have any questions.

bryevdv · 2017-03-02T16:46:02Z

Hi @mmowers I am overwhelmed with some other tasks right now. If you can resize the image to be the same width as the existing thumbnails in the apps README that would be a huge help actually.

mmowers · 2017-03-03T00:06:40Z

Hi @bryevdv ,
I've updated the width to 300px.
Let me know anything else I can do to help.
Thanks!

bryevdv · 2017-03-03T15:36:23Z

Hi @mmowers I have uploaded the thumbnail to this location:

http://bokeh.pydata.org/static/pivot_t.png

I added the _t to the filename to be consistent with the other thumbnails that are already there.

I also checked out the branch and ran it locally. It's very cool! In general my main observation is that it does not actively prevent unusable combinations of parameters, and when a user makes a selection that is not reasonable, then the only indication is an exception printed in the console.

Here are some observations/suggestions, in no particular order:

Be opinionated at the start. Pick the first and second columns to be x and y axes and start by showing that chart. I think seeing a chart right away will help people understand what's in front of them more quickly.
Since x-axis and y-axis are required, don't make them collapsible. Seeing some controls available right away will also help orient new users.
Label or somehow indicated whether the columns in the dropdown are categorical or numeric
A header with the app name and a brief description and brief instructions about what to do / what can be done.
Actively manage the dropdown list options. It's possible to update the available options so that "unreasonable" options don't show up. E.g. if a column is selected in for the x-axis, it should be removed from the dropdown for y-axis. Check out the one-line nix fuction from this example:

https://github.com/bokeh/bokeh/blob/master/examples/app/stocks/main.py

Even just doing this for x- and y- to start would go a very long way, but I can imagine also updating things so that e.g. if a column is selected to be exploded, it's not also in the stacked dropdown.
Maybe a default data set with more columns? I know I was harping about data file size. I think we could go up to 10-15k if necessary. But that space is better spent in additional dimensions rather than additional data points. Another reason it seemed quick to run in to "unreasonable" situations was that there were not many columns to choose from.
The splitting colors seem too close. It was hard for me to tell there were different shades of blue when I grouped or separated by series. I'd suggest defaulting to a palette that has lots of different and distinguishable hues (maybe Spectral?) Ultimately it might be nice to make the palette configurable.
Linked panning/selection brushing across exploded plots
Improving plot type heuristics. This might be harder. There are some combinations that don't currently work well. E.g. Area plot type with grouped axis. These could be disallowed, or handled more careful. Later Bokeh core work on nested coordinate systems (for groupings) will probably help this.

I'm definitely not saying all these things need to happen before this example is merged, but I wanted to at least get a record of them. However, we are shooting for the 20th for a release, so if you have any more time to work on this in the next ~2 weeks, I think getting a few of these changes in would add alot of polish. Understand completely if you do not have bandwidth over that time frame. Let me know what you think!

out of date

mmowers · 2017-03-05T05:39:42Z

Thanks much @bryevdv for the suggestions! I think I can knock a few of these out before the 20th.

I'm starting with active management of the dropdown list options. In addition to preventing the same column selections across x, x_group, y, series, explode, and explode_group, I'll also remove the series stacking widget and assume stacking by default for area and bar charts, but not for dot and line charts. I think will remove a fair amount of confusion for the user. Sound good?
I'll also add a header and look into changing the color palette. It would be great if I could make palette dynamic to the number of series via some equation, but still not be ugly... Any recommendations on that?
I'd like to allow x-axis and y-axis to be collapsible to save space, but I can initially open them up. How does that sound?
I'm hesitant to pick axes for the user if they're loading their own data, but for the default data I can load an initial widget configuration. So when someone initially fires up the app, charts would be shown. How does that sound?
On the data set and having more columns, I'm definitely open to using a different dataset. Do you have one that you think would work well? I really wanted to find exit polling data for the 2016 presidential election, similar to https://www.nytimes.com/interactive/2016/11/08/us/politics/election-exit-polls.html, where columns would be any number of demographic factors (age range, race, gender, income range, education, state, etc.), and two columns for the result (candidate supported, number of votes). Something like that would be really awesome to visualize, I think. Problem is, you have to pay for this data I think (from Edison Research) and we probably couldn't publish it.

Also wondering, do you or any other Bokeh maintainers have any performance improvement suggestions? I'm wondering if you can see offhand anywhere that the code is significantly under-performing its potential.

Thanks again!

bryevdv · 2017-03-05T19:21:42Z

That plan sounds great I think that will go a long way!
Bokeh ships with a number of "categorical" palettes, up to 20 colors. I'd say it would be fine to just pick one on the assumption that there won't be more than the palette size number of stackings, etc.
Sounds good as well
I think picking initial axes only for the default data set is also fine.
I don't know a great dataset offhand. There's "autompg" / "mtcars" which is probably familiar but maybe not so interesting. Another idea might some of the Gapminder data sets (or some subset). If there's an especially interesting data set that costs, it might be possible to procure funds to purchase it (up to a point of course) but it would need to be licensed to distribute for that to make sense (so, maybe not likely)

mmowers · 2017-03-11T03:37:56Z

Hi @bryevdv , I've made the changes we agreed on. Regarding colors, I'm actually using colors from a Spectral palette in bokeh. To make them more clear, I increased opacity from 0.6 to 0.8. Are distinctions clearer now?

Let me know what you think!

bryevdv · 2017-03-11T03:49:03Z

@mmowers I fetched the branch and took a quick look and the changes are pretty fantastic at a glance. I will take a closer look over the weekend. I don't expect to have many if any comments, thanks for the great example!

bryevdv · 2017-03-11T03:53:02Z

@mmowers on master I get errors like this when I try to change to a new csv:

2017-03-10 21:52:23,313 error handling message Message 'PATCH-DOC' (revision 1): RuntimeError('Cannot apply patch to 8a55e40a-99b2-45ca-a723-5ac408cc3a3e which is not in the document',)

is this working for you?

Edit: seem to get them with 0.12.4 too.

Edit 2: Oh, it's because there are no default axes selected.

bryevdv · 2017-03-11T03:59:53Z

Not for now, but a future improvement would be to handle datetime dimensions better, e.g. not turn every date into a categorical coordinate:

mmowers · 2017-03-11T16:58:28Z

@bryevdv , thanks for the datetime note. I haven't yet viewed a csv with dates.

On the error, I noticed it before too, but it didn't seem to prevent anything from working. However, with a large enough data set with many plots that I'm switching out, I'll experience some pretty significant slowdowns during which those errors will continually spawn for a few seconds.

I've added to the instructions at the top of the page, including a note that users need to select x-axis and y-axis after they switch a data source.

I also cleaned up a flake issue and it looks like all checks are good now.

Thanks again!

bryevdv · 2017-03-11T18:47:07Z

@mmowers OK looks good, I only have one more small ask. The para at the top is pretty monolithic. Do you mind deleting the "Instructions:" at the beginning (it seems evident to me at least that they are instructions without that reminder), and splitting the rest into 2 or 3 paragraphs?

mmowers · 2017-03-11T19:12:07Z

@bryevdv,
Done, thanks!

bryevdv · 2017-03-14T15:27:10Z

Going ahead and merging now. FYI this will fail when it is merged due to an unrelated problem (scipy.org is down causing our docs build to fail, this will resolve itself when the scipy.org site is restored.)

Thanks @mmowers !

Add example app pivot chart maker

dd53204

mmowers mentioned this pull request Feb 20, 2017

New App Example: Exploding Pivot Charts #5640

Closed

bryevdv previously requested changes Feb 20, 2017

View reviewed changes

Address linter issues and @bryevdv review comments

57c7207

Matt Mowers added 6 commits February 25, 2017 18:52

Remove theme.yaml

ad96be7

Adjust formatting to pass flake8

18b0add

Add back in full README

5f2ac84

Reduce example csv to 2K

9fa730e

Update README for pivot app, and add to general app README

ddc262e

Merge remote-tracking branch 'upstream/master' into pivot

758b7c9

Matt Mowers added 3 commits February 26, 2017 13:54

Fix Travis CI flake_docs formatting issues.

ee2127c

Refactor functions for better modularity and understandability, and a…

7284cc0

…dd helpful docstrings.

Merge remote-tracking branch 'upstream/master' into pivot

1f1f676

bryevdv added tag: component: examples type: task labels Mar 8, 2017

bryevdv added this to the 0.12.5 milestone Mar 8, 2017

Matt Mowers added 2 commits March 10, 2017 22:28

Merge remote-tracking branch 'upstream/master' into pivot

c8d5d66

Make changes agreed with @bryevdv on pull request

0935a58

Fix flake issues and add to instructions at top of page.

4dbd7af

Make intro paragraph less monolithic

06bcb8a

bryevdv merged commit f8e259e into bokeh:master Mar 14, 2017

bryevdv added the reso: completed label Mar 14, 2017

bryevdv added status: accepted and removed reso: completed tag: component: examples type: task labels Mar 22, 2017

bryevdv removed this from the 0.12.5 milestone Mar 22, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add example app: pivot chart maker #5894

Add example app: pivot chart maker #5894

mmowers commented Feb 20, 2017

bryevdv commented Feb 20, 2017 •

edited

Loading

bryevdv Feb 20, 2017

mmowers Feb 23, 2017

bryevdv Feb 20, 2017

mmowers Feb 23, 2017

bryevdv Feb 20, 2017

mmowers Feb 23, 2017

bryevdv Feb 20, 2017

mmowers Feb 23, 2017

bryevdv Feb 20, 2017

mmowers Feb 23, 2017

bryevdv commented Feb 20, 2017

mmowers commented Feb 23, 2017

bryevdv commented Feb 23, 2017

mmowers commented Feb 24, 2017

mmowers commented Feb 26, 2017

bryevdv commented Feb 27, 2017

mmowers commented Feb 28, 2017

bryevdv commented Mar 2, 2017 •

edited

Loading

mmowers commented Mar 3, 2017

bryevdv commented Mar 3, 2017 •

edited

Loading

mmowers commented Mar 5, 2017

bryevdv commented Mar 5, 2017

mmowers commented Mar 11, 2017

bryevdv commented Mar 11, 2017

bryevdv commented Mar 11, 2017 •

edited

Loading

bryevdv commented Mar 11, 2017 •

edited

Loading

mmowers commented Mar 11, 2017

bryevdv commented Mar 11, 2017 •

edited

Loading

mmowers commented Mar 11, 2017

bryevdv commented Mar 14, 2017

Add example app: pivot chart maker #5894

Add example app: pivot chart maker #5894

Conversation

mmowers commented Feb 20, 2017

bryevdv commented Feb 20, 2017 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

bryevdv commented Feb 20, 2017

mmowers commented Feb 23, 2017

bryevdv commented Feb 23, 2017

mmowers commented Feb 24, 2017

mmowers commented Feb 26, 2017

bryevdv commented Feb 27, 2017

mmowers commented Feb 28, 2017

bryevdv commented Mar 2, 2017 • edited Loading

mmowers commented Mar 3, 2017

bryevdv commented Mar 3, 2017 • edited Loading

mmowers commented Mar 5, 2017

bryevdv commented Mar 5, 2017

mmowers commented Mar 11, 2017

bryevdv commented Mar 11, 2017

bryevdv commented Mar 11, 2017 • edited Loading

bryevdv commented Mar 11, 2017 • edited Loading

mmowers commented Mar 11, 2017

bryevdv commented Mar 11, 2017 • edited Loading

mmowers commented Mar 11, 2017

bryevdv commented Mar 14, 2017

bryevdv commented Feb 20, 2017 •

edited

Loading

bryevdv commented Mar 2, 2017 •

edited

Loading

bryevdv commented Mar 3, 2017 •

edited

Loading

bryevdv commented Mar 11, 2017 •

edited

Loading

bryevdv commented Mar 11, 2017 •

edited

Loading

bryevdv commented Mar 11, 2017 •

edited

Loading