Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

"All-column" vis when only few columns in dataframe #199 #336

Merged
merged 45 commits into from
Apr 18, 2021

Conversation

caitlynachen
Copy link
Contributor

Overview

Current-vis: "All-column" vis when only few columns in dataframe #199

Changes

If dataframe has only 2 or 3 columns, use current_vis to show an all-column visualization

Example Output

Screen Shot 2021-03-31 at 11 57 51 AM
Screen Shot 2021-03-31 at 11 57 35 AM

@caitlynachen caitlynachen changed the title Allcol "All-column" vis when only few columns in dataframe #199 Mar 31, 2021
@codecov
Copy link

codecov bot commented Mar 31, 2021

Codecov Report

Merging #336 (1b7cf75) into master (084cf77) will increase coverage by 0.04%.
The diff coverage is 100.00%.

Impacted file tree graph

@@            Coverage Diff             @@
##           master     #336      +/-   ##
==========================================
+ Coverage   84.39%   84.44%   +0.04%     
==========================================
  Files          51       51              
  Lines        3891     3902      +11     
==========================================
+ Hits         3284     3295      +11     
  Misses        607      607              
Impacted Files Coverage Δ
lux/action/enhance.py 100.00% <ø> (ø)
lux/action/filter.py 91.66% <100.00%> (ø)
lux/core/frame.py 81.50% <100.00%> (+0.47%) ⬆️
lux/vis/Vis.py 75.59% <100.00%> (+0.14%) ⬆️
lux/vislib/matplotlib/ScatterChart.py 76.47% <100.00%> (ø)

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 084cf77...1b7cf75. Read the comment docs.

@dorisjlee
Copy link
Member

dorisjlee commented Apr 3, 2021

Hi @caitlynachen, Thanks for the PR! The all-column visualization looks much better than the recommendations that we had before! This is especially true when we filter to only 2 or 3 columns in a dataframe. Here are some examples of this:

image
image

I was playing around with this feature and had a couple of suggestions on what we can do to improve this:

  • We might want to modify the text under the Current Visualization and title Current Visualization to say that the visualization is not based on the current intent, but based on all the columns in the current dataframe (Dataframe Visualization:based on all columns in the dataframe, with the text on "all columns" highlightable showing the columns involved.)

  • What happens when there is an intent set and there is an actual Current Visualization, in this case, do we hide the all-column vis? or show it as a separate tab?
    image

  • You might want to look into some of the failing tests, which is related to calling the print displays

Copy link
Member

@jinimukh jinimukh left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Great idea! Maybe we can extend this feature to the entire dataframe and set a global config like small_dataframe_ok or show_all_data_columns depending on the use-case you were thinking of!

lux/core/frame.py Show resolved Hide resolved
lux/core/frame.py Outdated Show resolved Hide resolved
lux/core/frame.py Outdated Show resolved Hide resolved
@dorisjlee dorisjlee merged commit d6cca26 into lux-org:master Apr 18, 2021
dorisjlee added a commit that referenced this pull request Apr 30, 2021
…ales (#262)

* Add support to improve temporal action to display different timescales

* Resolve PR comments

* Add support to improve temporal action to display different timescales

* Resolve PR comments

* Reformat files using black

* "All-column" vis when only few columns in dataframe #199 (#336)

Co-authored-by: Caitlyn Chen <caitlynachen@berkeley.edu>
Co-authored-by: Doris Lee <dorisjunglinlee@gmail.com>

* documentation and cleaning
* added notebook gallery
* update README
* removed scatterplot message in SQLExecutor
* fixed typo in SQL documentation

* update README and bump version

* bump version

* clear propagated vis data intent after PandasExecutor completes execute (#297)

* fix black to stable version

* Scalability: incorporate early pruning optimizations (#368)

* changes from perf branch to config
* added flag for turning on/off lazy maintain optimization

* merged in approx early pruning code

* increase overall sampling start and cap

* Adjust width and length criteria for early pruning vislist based on experiment results; Add warning message and test for early pruning

* black version update

* version lock on black

* * fixed sql tests (added approx to execute constructor)
* fixed sampling config test
* improved Executor documentation

* timescale feature
* adding weekday
* adding docs
* bugfix for y axis line chart export
* fixing temporal axis by adding timescale variable in Clause

Co-authored-by: Doris Lee <dorisjunglinlee@gmail.com>
Co-authored-by: Caitlyn Chen <caitlynachen@gmail.com>
Co-authored-by: Caitlyn Chen <caitlynachen@berkeley.edu>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants