Feature/changefinder1d #333

myui · 2016-09-03T08:31:41Z

Revising the PR of #305 by @L3Sota

…etected. Removed -dim option accordingly. Also changed some initial parameter values and minutiae in ChangeFinderUDFTest, and fixed a typo in MathUtils.

Implemented multiple passes for multidimensional covariances to ensure they are full-rank. Changed internals of solving the system of equations from Cholesky Decomposition to LU Decomposition (However, after the multiple changes introduced in this commit, Cholesky Decomposition may be viable again.)

- Pretty comments describing the computation process in more detail. - Using the initial values of the data to break in data structures instead of rand. (Random generation tends to unpredictably reduce detection accuracy in the beginning.) - Added a moving average for change-point scores to improve detection accuracy. - Refactored tests

The Java 7 Double class doesn't have a isFinite(double) method. Using isNaN(double) instead.

…ivemall into L3Sota-feature/cf_sdar_focused

feature/changefinder1d

* Implementation refers to MATLAB's `arburg()` function * Test cases come from outputs of Octave's `arburg()` function

* Replace `-A` with `A` in SDAR1D * Update corresponding test cases

Implement `arburg()` and some minor fixes on MatrixUtils

Support Hellinger-distance-based scoring both on SDAR 1D and 2D

myui · 2016-09-03T08:38:22Z

core/src/main/java/hivemall/anomaly/SDAR2D.java

+         */
+        RealMatrix[][] rhs = MatrixUtils.toeplitz(C, k);
+        RealMatrix[] lhs = Arrays.copyOfRange(C, 1, k + 1);
+        RealMatrix R = MatrixUtils.flatten(rhs);


@L3Sota Are you expecting each C_i as a submatrix and flatting a matrix of matrix to a matrix?

R will not become a Toeplitz Matrix then and it's hard to solve A by LUDecomposition.

@myui Yes, that's expected. I didn't know non-Toeplitz makes LU decomp so difficult at the time of writing, though :(

Small points: the comments from L116-129 seem to be a little off regarding style, not too important:

In the Yule-Walker equations, i and j are 1..k, but in the visualisation of the equation the C and A vector-of-matrices' run from 1..n

The A vector-of-matrices should be horizontal

Both C'_i and C_i' are used

myui · 2016-09-03T09:31:21Z

Is this expected behavior for combineMatrix() (i.e., flatten() )?

    @Test
    public void testFlatten2d() {
        RealMatrix[] m1 = new RealMatrix[] { new Array2DRowRealMatrix(new double[] {1, 1.1}), new Array2DRowRealMatrix(new double[] {2, 2.2}), new Array2DRowRealMatrix(new double[] {3, 3.3})}; 
        // {1.0,1.1}
        // {2.0,2.2}
        // {3.0,3.3}
        RealMatrix[][] toeplitz1 = MatrixUtils.toeplitz(m1, 3);
        // {1.0,1.1}  {2.0,2.2}' {3.0,3.3}'
        // {2.0,2.2}  {1.0,1.1}  {2.0,2.2}'
        // {3.0,3.3}  {2.0,2.2}  {1.0,1.1}
        RealMatrix flatten1 = MatrixUtils.flatten(toeplitz1, 2);
        // 1.0 0.0 2.0 2.2 3.0 3.3
        // 1.1 0.0 0.0 0.0 0.0 0.0
        // 2.0 0.0 1.0 0.0 2.0 2.2
        // 2.2 0.0 1.1 0.0 0.0 0.0
        // 3.0 0.0 2.0 0.0 1.0 0.0
        // 3.3 0.0 2.2 0.0 1.1 0.0
    }

LUDecomposition assume a squared matrix.
http://commons.apache.org/proper/commons-math/userguide/linear.html

myui · 2016-09-03T15:13:15Z

>>> from scipy import linalg
>>> import numpy as np
>>> x = np.array([[1.0,1.1],[2.0,2.2],[3.0,3.3]])
>>> x
array([[ 1. ,  1.1],
       [ 2. ,  2.2],
       [ 3. ,  3.3]])
>>> linalg.toeplitz(x)
array([[ 1. ,  1.1,  2. ,  2.2,  3. ,  3.3],
       [ 1.1,  1. ,  1.1,  2. ,  2.2,  3. ],
       [ 2. ,  1.1,  1. ,  1.1,  2. ,  2.2],
       [ 2.2,  2. ,  1.1,  1. ,  1.1,  2. ],
       [ 3. ,  2.2,  2. ,  1.1,  1. ,  1.1],
       [ 3.3,  3. ,  2.2,  2. ,  1.1,  1. ]])
>>> x = np.matrix([[1.0,1.1],[2.0,2.2],[3.0,3.3]])
>>> x
matrix([[ 1. ,  1.1],
        [ 2. ,  2.2],
        [ 3. ,  3.3]])
>>> linalg.toeplitz(x)
array([[ 1. ,  1.1,  2. ,  2.2,  3. ,  3.3],
       [ 1.1,  1. ,  1.1,  2. ,  2.2,  3. ],
       [ 2. ,  1.1,  1. ,  1.1,  2. ,  2.2],
       [ 2.2,  2. ,  1.1,  1. ,  1.1,  2. ],
       [ 3. ,  2.2,  2. ,  1.1,  1. ,  1.1],
       [ 3.3,  3. ,  2.2,  2. ,  1.1,  1. ]])
>>> x = np.array([np.array([[1.0,1.1],[2.0,2.2]]),np.array([[3.0,3.3],[4.0,4.4]])])
>>> x
array([[[ 1. ,  1.1],
        [ 2. ,  2.2]],

       [[ 3. ,  3.3],
        [ 4. ,  4.4]]])
>>> linalg.toeplitz(x)
array([[ 1. ,  1.1,  2. ,  2.2,  3. ,  3.3,  4. ,  4.4],
       [ 1.1,  1. ,  1.1,  2. ,  2.2,  3. ,  3.3,  4. ],
       [ 2. ,  1.1,  1. ,  1.1,  2. ,  2.2,  3. ,  3.3],
       [ 2.2,  2. ,  1.1,  1. ,  1.1,  2. ,  2.2,  3. ],
       [ 3. ,  2.2,  2. ,  1.1,  1. ,  1.1,  2. ,  2.2],
       [ 3.3,  3. ,  2.2,  2. ,  1.1,  1. ,  1.1,  2. ],
       [ 4. ,  3.3,  3. ,  2.2,  2. ,  1.1,  1. ,  1.1],
       [ 4.4,  4. ,  3.3,  3. ,  2.2,  2. ,  1.1,  1. ]])
>>> y = x.flatten()
>>> y
array([ 1. ,  1.1,  2. ,  2.2,  3. ,  3.3,  4. ,  4.4])
>>> y.shape
(8,)

Fix transpose scheme of toeplitz().
https://github.com/scipy/scipy/blob/v0.14.0/scipy/linalg/special_matrices.py#L186
http://mathworld.wolfram.com/ToeplitzMatrix.html

Before:
                if (row < col) {
                    toeplitz[row][col] = c[col - row];
                } else if (row > col) {
                    toeplitz[row][col] = c[row - col].transpose();
                }
After:
                if (row < col) {
                    toeplitz[row][col] = c[col - row].transpose();
                } else if (row > col) {
                    toeplitz[row][col] = c[row - col];
                }

L3Sota · 2016-09-04T02:46:39Z

core/src/main/java/hivemall/utils/math/MatrixUtils.java

+     */
+    @Nonnull
+    public static RealMatrix[][] toeplitz(@Nonnull final RealMatrix[] c, final int dim) {
+        Preconditions.checkArgument(dim >= 1, "Invliad dimension: " + dim);


Invliad -> Invalid

myui · 2016-09-04T05:16:02Z

@L3Sota Thanks. Fixed typos.

myui · 2016-09-04T06:52:05Z

@L3Sota

I think it should be flattened as follows to be solved by LU decomposition:

>>> x = np.array([[1.0,1.1],[2.0,2.2],[3.0,3.3]])
>>> x
array([[ 1. ,  1.1],
       [ 2. ,  2.2],
       [ 3. ,  3.3]])
>>> linalg.toeplitz(x)
array([[ 1. ,  1.1,  2. ,  2.2,  3. ,  3.3],
       [ 1.1,  1. ,  1.1,  2. ,  2.2,  3. ],
       [ 2. ,  1.1,  1. ,  1.1,  2. ,  2.2],
       [ 2.2,  2. ,  1.1,  1. ,  1.1,  2. ],
       [ 3. ,  2.2,  2. ,  1.1,  1. ,  1.1],
       [ 3.3,  3. ,  2.2,  2. ,  1.1,  1. ]])

myui · 2016-09-06T13:11:40Z

5 dimenasional input of @L3Sota 's test

Hyperparameters used: K=10, T1=10, T2=10, r1=0.01, r2=0.01, lossFunc1=logloss, lossFunc2=lossloss

set ytics nomirror
set y2tics
set y2r[-5:60]
set style fill transparent solid 0.3 noborder

plot \
  for [i=7:11:1] \
    "sota5d.dat" using 1:(sum [col=i:11] column(col)) \
    title columnheader(i) with filledcurves y1=0 axes x1y1, \
  "sota5d.dat" using 1:12 with lines title "outlier" axes x1y2, \
  "sota5d.dat" using 1:13 with lines title "change" axes x1y2

myui · 2016-09-06T13:12:57Z

@L3Sota @takuti merged into master. Thank you for the contributions!

coveralls · 2016-09-06T13:39:13Z

Changes Unknown when pulling 622a793 on feature/changefinder1d into * on master*.

myui and others added 27 commits June 17, 2016 07:59

Changed the behavior of binarize_label

a782c8e

Added ChangeFinder with tests

f2fe4a5

Fixed detection/training execution order; input dimensions now auto-d…

2c0dff1

…etected. Removed -dim option accordingly. Also changed some initial parameter values and minutiae in ChangeFinderUDFTest, and fixed a typo in MathUtils.

Fixed computation logic, added new datafile

ba88534

Miscellaneous code updates

4fbb321

WIP: Removed regular AR elements, switched to Cholesky Decomposition

a27126b

Changed NaN recovery (Java 7 support)

36990d1

The Java 7 Double class doesn't have a isFinite(double) method. Using isNaN(double) instead.

Added DoubleRingBuffer class

6a06025

Added missing license header

88a87ba

Initial commit to support ChangeFinder

ec073f0

Merge branch 'feature/cf_sdar_focused' of https://github.com/L3Sota/h…

edf1cdf

…ivemall into L3Sota-feature/cf_sdar_focused

Merge branch 'L3Sota-feature/cf_sdar_focused' into

546e86e

feature/changefinder1d

Fixed bugs in A initialization and else

6dcd4d7

Fixed bugs e.g., on Toeplitz matrix initialization

6ab23ce

Updated unit tests for ChangeFinder

b761d0b

Avoided creating a local variable

e9208ed

Updated reference in Javadoc

45ab82c

Implement arburg() method for 1D data points

418e2d5

* Implementation refers to MATLAB's `arburg()` function * Test cases come from outputs of Octave's `arburg()` function

Return -A as a result of AR model parameter estimation

a1f6d63

* Replace `-A` with `A` in SDAR1D * Update corresponding test cases

Fix an incorrect test case for toeplitz()

55e7b81

Merge pull request #329 from takuti/arburg

6ec9ac1

Implement `arburg()` and some minor fixes on MatrixUtils

Support Hellinger-distance-based scoring on SDAR1D

c77f7a6

Support Hellinger-distance-based scoring on SDAR2D

7a94c70

Merge pull request #330 from takuti/support-hellinger-distance

9db88f1

Support Hellinger-distance-based scoring both on SDAR 1D and 2D

Added a 3 dimensional x test

7914aea

myui added pullrequest WIP labels Sep 3, 2016

myui added this to the v0.4 milestone Sep 3, 2016

myui self-assigned this Sep 3, 2016

myui mentioned this pull request Sep 3, 2016

ChangeFinder Anomaly and Change-Point Detector #305

Closed

myui reviewed Sep 3, 2016
View reviewed changes

L3Sota reviewed Sep 4, 2016
View reviewed changes

myui added 3 commits September 4, 2016 13:32

Fixed typos

e0fc341

Fixed flatten() scheme

f3518a6

Add tests

eba812c

Fixed comments

8c095d3

myui added 8 commits September 5, 2016 23:44

Updated to fallback to SVD

1a71611

Fixed flatten scheme for computing A

e10bf5f

Applied refactoring and add unit tests

b1cf7fa

Fixed null handling

5c504e8

Added a test case testSota5D()

cf18f62

Updated a test case testSota5D

b35ffc4

Fixed yule-walker solving scheme

0bbe32f

Supported loss1 and loss2 options

622a793

myui merged commit 375913e into master Sep 6, 2016

myui deleted the feature/changefinder1d branch September 6, 2016 13:12

takuti mentioned this pull request Sep 12, 2016

Implement another change-point detector requiring simpler hyperparameters #342

Closed

myui mentioned this pull request Dec 14, 2016

[WIP] Implement SST-based change-point detector apache/incubator-hivemall#11

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature/changefinder1d #333

Feature/changefinder1d #333

myui commented Sep 3, 2016

myui Sep 3, 2016

L3Sota Sep 4, 2016

myui commented Sep 3, 2016 •

edited

Loading

myui commented Sep 3, 2016 •

edited

Loading

L3Sota Sep 4, 2016

myui commented Sep 4, 2016

myui commented Sep 4, 2016

myui commented Sep 6, 2016

myui commented Sep 6, 2016

coveralls commented Sep 6, 2016

Feature/changefinder1d #333

Feature/changefinder1d #333

Conversation

myui commented Sep 3, 2016

myui Sep 3, 2016

Choose a reason for hiding this comment

L3Sota Sep 4, 2016

Choose a reason for hiding this comment

myui commented Sep 3, 2016 • edited Loading

myui commented Sep 3, 2016 • edited Loading

L3Sota Sep 4, 2016

Choose a reason for hiding this comment

myui commented Sep 4, 2016

myui commented Sep 4, 2016

myui commented Sep 6, 2016

myui commented Sep 6, 2016

coveralls commented Sep 6, 2016

myui commented Sep 3, 2016 •

edited

Loading

myui commented Sep 3, 2016 •

edited

Loading