Fixed range #37

theogf · 2021-12-07T16:17:01Z

It is made clear from the docs that once a range has been fixed, there is no possibility to change it later.
Is it because of the design of the algorithm or because the functionality is simply lacking?
Could an approximation be made to go from one range to the next (given some assumptions) ?

joshday · 2021-12-07T16:39:23Z

Is it because of the design of the algorithm or because the functionality is simply lacking?

A little of both. If you change the range (which defines bin locations), the counts associated with the bins no longer make sense (unless you take special care in aligning the new edges with the old ones). This is what I've done with OnlineStats.ExpandingHist. I've also moved some of the ash functionality to OnlineStats so that you can do:

o = Ash(ExpandingHist(400))

Here's the docs that explains the details:

  ExpandingHist(nbins)

  An adaptive histogram where the bin edges keep doubling in size in order to
  contain every observation. nbins must be an even number. Bins are
  left-closed and the rightmost bin is closed, e.g.

    •  [a, b), [b, c), [c, d]

  Example
  ≡≡≡≡≡≡≡≡≡

  o = fit!(ExpandingHist(200), randn(10^6))

  using Plots
  plot(o)

  Details
  ≡≡≡≡≡≡≡≡≡

  How ExpandingHist works is best understood through example. Suppose we start
  with a histogram of edges/counts as follows:

  |1|2|5|3|2|

    •  Now we observe a data point that is not contained in the bin
       edges:

  |1|2|5|3|2|       *

    •  In order to contain the point, the range of the edges doubles in
       the direction of the new data point and adjacent bins merge their
       counts:

  |1|2|5|3|2|       *
   \ / \ / \ /      ↓
    ↓   ↓   ↓       ↓
  | 3 | 8 | 2 | 0 | 1 |

    •  Note that multiple iterations of bin-doubling may occur until the
       new point is contained by the bin edges.

theogf · 2021-12-07T16:48:47Z

Thanks, I did not follow these updates! That looks great!

joshday closed this as completed Dec 7, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixed range #37

Fixed range #37

theogf commented Dec 7, 2021

joshday commented Dec 7, 2021

theogf commented Dec 7, 2021

Fixed range #37

Fixed range #37

Comments

theogf commented Dec 7, 2021

joshday commented Dec 7, 2021

theogf commented Dec 7, 2021