NDArray support #3

mikera · 2013-01-09T03:23:27Z

Add support for general purpose N-dimensional arrays (like NumPy ndarray)

Features:

Allow arbitrary objects (not just numbers)
Allow in-place modifications
Allow "views" - i.e. slices / subsets of other arrays that can be modified via the view
Can be used as 1D / 2D / 3D vectors / matrices / tensors if filled with java.lang.Numbers

Initial implementation stubs:

https://github.com/mikera/matrix-api/blob/master/src/main/clojure/core/matrix/impl/ndarray.clj

mjwillson · 2013-01-09T22:52:17Z

This looks like a good start, nice one. Something based on Java arrays seems like a good dependency-free baseline implementation. (Fair bit faster than basing it on clojure vectors I'd imagine!)

By the way: does the ^:longs type annotation work for a higher-dimensional arrays? I was under the impression it was only a plain 1D array, and that you might need to annotate with something a bit more clunky like (Class/forName "[[[D") or similar for higher-dimensional array types. Couldn't spot much documentation on it.

mikera · 2013-01-09T23:56:38Z

The Java array implementation is necessary for fast immutability more than anything else, so we can do decent default implementation of mutable array operations.

^longs doesn't work for higher dimensional arrays. But for our NDArray we don't care about multi-dimensional Java arrays since we are going to flatten the array into a long 1D array (with variable strides, like NumPy).

heffalump · 2013-01-10T04:20:56Z

Should this just replace the PersistentVector implementation? Otherwise we may well end up doubling up on a whole load of matrix algorithms internally, to little benefit. The matrix constructor should take nested persistent vectors for ease of use, of course.

mikera · 2013-01-10T07:27:21Z

@heffalump: I'm not sure.

Clearly the NDArray has the potential to be a more "serious" implementation. It give us a good base implementation that approximates NumPy style functionality. And it lets us test a "mutable" array implementation.

At the same time the persistent vector implementation is very useful for quick tests / interop with idiomatic Clojure where it is very easy to construct and use vectors. It's also good for testing as an "immutable" array implementation.

I'm hoping that many of the generic implementations of matrix functions can be written in a way that works on both. Consider my primitive trace implementation for example:

    (trace [m]
      (if-not (square? m) (error "Can't compute trace of non-square matrix"))
      (let [dims (long (row-count m))]
        (loop [i 0 res 0.0]
          (if (>= i dims)
            res
            (recur (inc i) (+ res (double (mp/get-2d m i i))))))))

That should work fine on both NDArrays and persistent vectors (and anything else that implements the standard matrix access protocols)

So for the moment I suggest keeping both in... they both have their uses, they are relative standalone and it doesn't seem likely to cost us too much in terms of extra code.

mikera · 2013-01-12T11:08:01Z

Worth looking at in this context:

https://code.google.com/p/clj-multiarray/

mjwillson · 2013-01-12T15:28:43Z

Re flattened array -- ah of course, should've spotted that :) and strided access makes views possible, nice.

mikera · 2013-01-13T01:37:58Z

Strided arrays are an awesome trick: probably NumPy's secret weapon in fact.

Some of the things you can do with them:

Copy-free array broadcasting (set strides to zero)
Copy-free trasposes (permute dimensions / strides)
Copy-free submatrices (combine original strides with an offset and reduced dimensions)

The technique originates in computer graphics I think: I certainly remember coding strided image access in assembler during the 80s/90s.... funny to see the same technique being used now for matrix computations!

mjwillson · 2013-01-23T23:11:13Z

Another library I spotted for NDArray stuff on the JVM: http://code.google.com/p/array4j/

"a vector, matrix and N-dimensional array library for Java that combines ideas from JAMA, Matrix Toolkits for Java, JScience and NumPy. ... uses JNA to interface with vendor BLAS implementations".

Doesn't seem to've been much activity in the last 5 years or so mind.

mikera · 2013-01-24T00:22:55Z

egads..... more of them coming out of the woodwork.

Unless I'm missing something though, this one looks only barely started, e.g. the double array implementation has hardly any code in it:

http://code.google.com/p/array4j/source/browse/trunk/src/main/java/net/lunglet/array4j/array/DoubleArrayImpl.java

Still, looks like it would still fit the core.matrix API if it ever got finished :-)

mjwillson · 2013-01-24T15:42:22Z

Just starting to look at NDArray support, and I think some terminology tweaks might be a good start: #18

mikera · 2013-01-25T03:05:15Z

Good idea on the terminology tweaks. I'll keep this issue open to represent the need for a proper NDArray implementation inside core.matrix itself.

mikera · 2013-02-18T06:30:10Z

Have got a basic implementation working in core/maptrix/impl/ndarray.clj as of release 0.2.0

mikera · 2013-02-18T06:31:37Z

Closing this issue as it doesn't seem actionable - discussion on further enhancements should move to the google group.

https://groups.google.com/forum/?fromgroups#!forum/numerical-clojure

mikera closed this as completed Feb 18, 2013

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

NDArray support #3

NDArray support #3

mikera commented Jan 9, 2013

mjwillson commented Jan 9, 2013

mikera commented Jan 9, 2013

heffalump commented Jan 10, 2013

mikera commented Jan 10, 2013

mikera commented Jan 12, 2013

mjwillson commented Jan 12, 2013

mikera commented Jan 13, 2013

mjwillson commented Jan 23, 2013

mikera commented Jan 24, 2013

mjwillson commented Jan 24, 2013

mikera commented Jan 25, 2013

mikera commented Feb 18, 2013

mikera commented Feb 18, 2013

NDArray support #3

NDArray support #3

Comments

mikera commented Jan 9, 2013

mjwillson commented Jan 9, 2013

mikera commented Jan 9, 2013

heffalump commented Jan 10, 2013

mikera commented Jan 10, 2013

mikera commented Jan 12, 2013

mjwillson commented Jan 12, 2013

mikera commented Jan 13, 2013

mjwillson commented Jan 23, 2013

mikera commented Jan 24, 2013

mjwillson commented Jan 24, 2013

mikera commented Jan 25, 2013

mikera commented Feb 18, 2013

mikera commented Feb 18, 2013