Update contour-utils.js normalizeSeries to do filtering #202

wasbridge · 2015-02-06T16:36:04Z

Filter data to a desired num pts (configurable with new data object ({filter:true, filterNumPts:1000}))

Always keeps first/last -- and then for all the ones in the middle it grabs the high/low per interval

…ased on desired number of pts Conflicts: dist/contour.js dist/contour.min.js dist/contour.min.js.map

wasbridge · 2015-02-10T19:31:41Z

Anything you want me to do to make it easier to merge this? I can revert the /dist folder if you need.

jaimedp · 2015-02-10T21:30:04Z

Hi billy, I'm thinking of what would be the correct way to incorporate the point simplification into the data workflow. I'm not sure about 'loosing' the data points on normalization, because the next visualization may actually use the full data set do display other characteristics of the data. That's why I liked the idea of filtering the data that you will be rendering not not the original data set.

Also, I think we should provide a way to specify or pass in different filtering/data simplification functions so the user decides what's the best way to simplify the data.

What do you think?

PD. Yes we also need to remove the /dist from the PR :)

wasbridge · 2015-02-11T14:54:23Z

I definitely agree with the ability to pass a custom filtering function.

As for where to 'loose' the data it depends on if we want all
visualizations to have the 'same' data or not. Think of a visualization
that displays the mean, median, mode, min max etc. Do we want those stats
to be based on the full data set or the filtered set? If you answer the
full set then we need to filter per visualization -- if you answered the
filtered set we should do it in normalization.

My answer after thinking about it is per visualization. Do you agree?

On Tue, Feb 10, 2015 at 4:30 PM, Jaime del Palacio <notifications@github.com

wrote:

Hi billy, I'm thinking of what would be the correct way to incorporate the
point simplification into the data workflow. I'm not sure about 'loosing'
the data points on normalization, because the next visualization may
actually use the full data set do display other characteristics of the
data. That's why I liked the idea of filtering the data that you will be
rendering not not the original data set.

Also, I think we should provide a way to specify or pass in different
filtering/data simplification functions so the user decides what's the best
way to simplify the data.

What do you think?

—
Reply to this email directly or view it on GitHub
#202 (comment).

jaimedp · 2015-02-11T17:35:01Z

Yes, I think the simplification should be per visualization without modifying the original data set. How about something like this:

We add a preprocess function to each visualization's config object (by default is a noop, returning the same dataset)
We provide a set (we can start with only one) simplification functions on _.nw namespace.

Then you would use it something like:

new Contour(..)
  .cartesian()
  .line(myBidDataSet, { preprocess: _.nw.simplifications.minMaxFilter(1000) })
  .statsVisualization(myBidDataSet) // <- this gets original data set
  .render()

the minMaxFilter(1000) would return a simplification function configured for 1000 points.

then inside the line we would have

var data = options.preprocess(raw);

What do you think? (not crazy about the simplifications name for the namespace)

wasbridge · 2015-02-11T19:05:46Z

I like the idea of a preprocess function per viz

How about we call it a dataFilter?

Then the filter I coded could be bound to _.nw.dataFilters.minMaxFilter

On Wed, Feb 11, 2015 at 12:35 PM, Jaime del Palacio <
notifications@github.com> wrote:

Yes, I think the simplification should be per visualization without
modifying the original data set. How about something like this:

We add a preprocess function to each visualization's config object
(by default is a noop, returning the same dataset)

We provide a set (we can start with only one) simplification
functions on _.nw namespace.

Then you would use it something like:

new Contour(..)
.cartesian()
.line(myBidDataSet, { preprocess: _.nw.simplifications.minMaxFilter(1000) })
.statsVisualization(myBidDataSet) // <- this gets original data set
.render()

the minMaxFilter(1000) would return a simplification function configured
for 1000 points.

What do you think? (not crazy about the simplifications name for the
namespace)

—
Reply to this email directly or view it on GitHub
#202 (comment).

jaimedp · 2015-02-17T18:16:00Z

Sorry for the delay,

dataFilters sounds good

wasbridge · 2015-03-05T17:28:36Z

Closing in favor of optimization2 pull request

Billy Schoenberg added 2 commits February 6, 2015 11:31

Change the filtering algo for pts to grab high lows in an increment b…

8a1c56c

…ased on desired number of pts Conflicts: dist/contour.js dist/contour.min.js dist/contour.min.js.map

Fixed bugs with filtering algo and line charts

5b9c01e

wasbridge closed this Mar 5, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update contour-utils.js normalizeSeries to do filtering #202

Update contour-utils.js normalizeSeries to do filtering #202

wasbridge commented Feb 6, 2015

wasbridge commented Feb 10, 2015

jaimedp commented Feb 10, 2015

wasbridge commented Feb 11, 2015

jaimedp commented Feb 11, 2015

wasbridge commented Feb 11, 2015

jaimedp commented Feb 17, 2015

wasbridge commented Mar 5, 2015

Update contour-utils.js normalizeSeries to do filtering #202

Update contour-utils.js normalizeSeries to do filtering #202

Conversation

wasbridge commented Feb 6, 2015

wasbridge commented Feb 10, 2015

jaimedp commented Feb 10, 2015

wasbridge commented Feb 11, 2015

jaimedp commented Feb 11, 2015

wasbridge commented Feb 11, 2015

jaimedp commented Feb 17, 2015

wasbridge commented Mar 5, 2015