Multidimensional array similar to NumPy and NArray
Ruby Java
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
lib
src
target Adding helper Dec 23, 2014
test Merge branch 'master' of https://github.com/rbotafogo/mdarray Jan 5, 2016
vendor
.gitignore Add support for JRuby 9.0.4.0 and remove support for Lazy MDArray Dec 30, 2015
Announcement.docx
Announcement_0_5_3.docx adding support for linear algebra Nov 10, 2013
Announcement_0_5_4.docx
Announcement_0_5_5.docx Preparing version 0.5.5 Nov 14, 2013
Announcement_0_5_5_1.docx
Announcement_0_5_5_2.docx
Announcement_0_5_6.docx
ChangeLog
Contribuiting.md
Gemfile
LICENSE.txt
README.md
Rakefile
config.rb
mdarray.gemspec
version.rb Add support for JRuby 9.0.4.0 and remove support for Lazy MDArray Dec 30, 2015

README.md

Announcement

MDArray version 0.5.5.2 has been released. MDArray is a multi dimensional array implemented for JRuby inspired by NumPy (www.numpy.org) and Masahiro Tanaka´s Narray (narray.rubyforge.org).
MDArray stands on the shoulders of Java-NetCDF and Parallel Colt. At this point MDArray has libraries for linear algebra, mathematical, trigonometric and descriptive statistics methods.

NetCDF-Java Library is a Java interface to NetCDF files, as well as to many other types of scientific data formats. It is developed and distributed by Unidata (http://www.unidata.ucar.edu).

Parallel Colt (https://sites.google.com/site/piotrwendykier/software/parallelcolt is a multithreaded version of Colt (http://acs.lbl.gov/software/colt/). Colt provides a set of Open Source Libraries for High Performance Scientific and Technical Computing in Java. Scientific and technical computing is characterized by demanding problem sizes and a need for high performance at reasonably small memory footprint.

What´s new:

Version 0.5.5.2 is a bug fix for a class StringArray. In Java-NetCDF when passing "string" as type an ObjectArray is created and not a StringArray. This version fix this issue and gets a StringArray when the "string" type is selected.

MDArray and SciRuby:

MDArray subscribes fully to the SciRuby Manifesto (http://sciruby.com/).

“Ruby has for some time had no equivalent to the beautifully constructed NumPy, SciPy, and matplotlib libraries for Python.

We believe that the time for a Ruby science and visualization package has come. Sometimes when a solution of sugar and water becomes super-saturated, from it precipitates a pure, delicious, and diabetes-inducing crystal of sweetness, induced by no more than the tap of a finger. So is occurring now, we believe, with numeric and visualization libraries for Ruby.”

MDArray main properties are:

  • Homogeneous multidimensional array, a table of elements (usually numbers), all of the same type, indexed by a tuple of positive integers;
  • Support for many linear algebra methods (see bellow);
  • Easy calculation for large numerical multi dimensional arrays;
  • Basic types are: boolean, byte, short, int, long, float, double, string, structure;
  • Based on JRuby, which allows importing Java libraries;
  • Operator: +,-,*,/,%,**, >, >=, etc.;
  • Functions: abs, ceil, floor, truncate, is_zero, square, cube, fourth;
  • Binary Operators: &, |, ^, ~ (binary_ones_complement), <<, >>;
  • Ruby Math functions: acos, acosh, asin, asinh, atan, atan2, atanh, cbrt, cos, erf, exp, gamma, hypot, ldexp, log, log10, log2, sin, sinh, sqrt, tan, tanh, neg;
  • Boolean operations on boolean arrays: and, or, not;
  • Fast descriptive statistics from Parallel Colt (complete list found bellow);
  • Easy manipulation of arrays: reshape, reduce dimension, permute, section, slice, etc.;
  • Support for reading and writing NetCDF-3 files;
  • Reading of two dimensional arrays from CSV files (mainly for debugging and simple testing purposes);
  • StatList: a list that can grow/shrink and that can compute Parallel Colt descriptive statistics;
  • Experimental lazy evaluation (still slower than eager evaluation).

Supported linear algebra methods:

  • backwardSolve: Solves the upper triangular system U*x=b;
  • chol: Constructs and returns the cholesky-decomposition of the given matrix.
  • cond: Returns the condition of matrix A, which is the ratio of largest to smallest singular value.
  • det: Returns the determinant of matrix A.
  • eig: Constructs and returns the Eigenvalue-decomposition of the given matrix.
  • forwardSolve: Solves the lower triangular system L*x=b;
  • inverse: Returns the inverse or pseudo-inverse of matrix A.
  • kron: Computes the Kronecker product of two real matrices.
  • lu: Constructs and returns the LU-decomposition of the given matrix.
  • mult: Inner product of two vectors; Sum(x[i] * y[i]).
  • mult: Linear algebraic matrix-vector multiplication; z = A * y.
  • mult: Linear algebraic matrix-matrix multiplication; C = A x B.
  • multOuter: Outer product of two vectors; Sets A[i,j] = x[i] * y[j].
  • norm1: Returns the one-norm of vector x, which is Sum(abs(x[i])).
  • norm1: Returns the one-norm of matrix A, which is the maximum absolute column sum.
  • norm2: Returns the two-norm (aka euclidean norm) of vector x; equivalent to Sqrt(mult(x,x)).
  • norm2: Returns the two-norm of matrix A, which is the maximum singular value; obtained from SVD.
  • normF: Returns the Frobenius norm of matrix A, which is Sqrt(Sum(A[i]2)).
  • normF: Returns the Frobenius norm of matrix A, which is Sqrt(Sum(A[i,j]2)).
  • normInfinity: Returns the infinity norm of vector x, which is Max(abs(x[i])).
  • normInfinity: Returns the infinity norm of matrix A, which is the maximum absolute row sum.
  • pow: Linear algebraic matrix power; B = Ak <==> B = AA...*A.
  • qr: Constructs and returns the QR-decomposition of the given matrix.
  • rank: Returns the effective numerical rank of matrix A, obtained from Singular Value Decomposition.
  • solve: Solves A*x = b.
  • solve: Solves A*X = B.
  • solveTranspose: Solves X*A = B, which is also A'*X' = B'.
  • svd: Constructs and returns the SingularValue-decomposition of the given matrix.
  • trace: Returns the sum of the diagonal elements of matrix A; Sum(A[i,i]).
  • trapezoidalLower: Modifies the matrix to be a lower trapezoidal matrix.
  • vectorNorm2: Returns the two-norm (aka euclidean norm) of vector X.vectorize();
  • xmultOuter: Outer product of two vectors; Returns a matrix with A[i,j] = x[i] * y[j].
  • xpowSlow: Linear algebraic matrix power; B = Ak <==> B = AA...*A.

Properties´ methods tested on matrices:

  • density: Returns the matrix's fraction of non-zero cells; A.cardinality() / A.size().
  • generate_non_singular!: Modifies the given square matrix A such that it is diagonally dominant by row and column, hence non-singular, hence invertible.
  • diagonal?: A matrix A is diagonal if A[i,j] == 0 whenever i != j.
  • diagonally_dominant_by_column?: A matrix A is diagonally dominant by column if the absolute value of each diagonal element is larger than the sum of the absolute values of the off-diagonal elements in the corresponding column.
  • diagonally_dominant_by_row?: A matrix A is diagonally dominant by row if the absolute value of each diagonal element is larger than the sum of the absolute values of the off-diagonal elements in the corresponding row.
  • identity?: A matrix A is an identity matrix if A[i,i] == 1 and all other cells are zero.
  • lower_bidiagonal?: A matrix A is lower bidiagonal if A[i,j]==0 unless i==j || i==j+1.
  • lower_triangular?: A matrix A is lower triangular if A[i,j]==0 whenever i < j.
  • nonnegative?: A matrix A is non-negative if A[i,j] >= 0 holds for all cells.
  • orthogonal?: A square matrix A is orthogonal if A*transpose(A) = I.
  • positive?: A matrix A is positive if A[i,j] > 0 holds for all cells.
  • singular?: A matrix A is singular if it has no inverse, that is, iff det(A)==0.
  • skew_symmetric?: A square matrix A is skew-symmetric if A = -transpose(A), that is A[i,j] == -A[j,i].
  • square?: A matrix A is square if it has the same number of rows and columns.
  • strictly_lower_triangular?: A matrix A is strictly lower triangular if A[i,j]==0 whenever i <= j.
  • strictly_triangular?: A matrix A is strictly triangular if it is triangular and its diagonal elements all equal 0.
  • strictly_upper_triangular?: A matrix A is strictly upper triangular if A[i,j]==0 whenever i >= j.
  • symmetric?: A matrix A is symmetric if A = tranpose(A), that is A[i,j] == A[j,i].
  • triangular?: A matrix A is triangular iff it is either upper or lower triangular.
  • tridiagonal?: A matrix A is tridiagonal if A[i,j]==0 whenever Math.abs(i-j) > 1.
  • unit_triangular?: A matrix A is unit triangular if it is triangular and its diagonal elements all equal 1.
  • upper_bidiagonal?: A matrix A is upper bidiagonal if A[i,j]==0 unless i==j || i==j-1.
  • upper_triangular?: A matrix A is upper triangular if A[i,j]==0 whenever i > j.
  • zero?: A matrix A is zero if all its cells are zero.
  • lower_bandwidth: The lower bandwidth of a square matrix A is the maximum i-j for which A[i,j] is nonzero and i > j.
  • semi_bandwidth: Returns the semi-bandwidth of the given square matrix A.
  • upper_bandwidth: The upper bandwidth of a square matrix A is the maximum j-i for which A[i,j] is nonzero and j > i.

Descriptive statistics methods imported from Parallel Colt:

  • auto_correlation, correlation, covariance, durbin_watson, frequencies, geometric_mean,
  • harmonic_mean, kurtosis, lag1, max, mean, mean_deviation, median, min, moment, moment3,
  • moment4, pooled_mean, pooled_variance, product, quantile, quantile_inverse,
  • rank_interpolated, rms, sample_covariance, sample_kurtosis, sample_kurtosis_standard_error,
  • sample_skew, sample_skew_standard_error, sample_standard_deviation, sample_variance,
  • sample_weighted_variance, skew, split, standard_deviation, standard_error, sum,
  • sum_of_inversions, sum_of_logarithms, sum_of_powers, sum_of_power_deviations,
  • sum_of_squares, sum_of_squared_deviations, trimmed_mean, variance, weighted_mean,
  • weighted_rms, weighted_sums, winsorized_mean.

Double and Float methods from Parallel Colt:

  • acos, asin, atan, atan2, ceil, cos, exp, floor, greater, IEEEremainder, inv, less, lg,
  • log, log2, rint, sin, sqrt, tan.

Double, Float, Long and Int methods from Parallel Colt:

  • abs, compare, div, divNeg, equals, isEqual (is_equal), isGreater (is_greater),
  • isles (is_less), max, min, minus, mod, mult, multNeg (mult_neg), multSquare (mult_square),
  • neg, plus (add), plusAbs (plus_abs), pow (power), sign, square.

Long and Int methods from Parallel Colt

  • and, dec, factorial, inc, not, or, shiftLeft (shift_left), shiftRightSigned (shift_right_signed), shiftRightUnsigned (shift_right_unsigned), xor.

MDArray installation and download:

  • Install Jruby
  • jruby –S gem install mdarray

MDArray Homepages:

Contributors:

Contributors are welcome.

MDArray History:

  • 30/Dec/2014: Version 0.5.5.2 - Fix for StringArray
  • 16/Nov/2014: Version 0.5.5.1 - Small bug fix
  • 14/Nov/2013: Version 0.5.5 - Support for linear algebra methods
  • 07/Aug/2013: Version 0.5.4 - Support for reading and writing NetCDF-3 files
  • 24/Jun/2013: Version 0.5.3 – Over 90% Performance improvements for methods imported from Parallel Colt and over 40% performance improvements for all other methods (implemented in Ruby);
  • 16/Mai/2013: Version 0.5.0 - All loops transferred to Java with over 50% performance improvements. Descriptive statistics from Parallel Colt;
  • 19/Apr/2013: Version 0.4.3 - Fixes a simple, but fatal bug in 0.4.2. No new features;
  • 17/Apr/2013: Version 0.4.2 - Adds simple statistics and boolean operators;
  • 05/Apr/2013: Version 0.4.0 – Initial release.