Marginally Interesting: jblas: 1.1 Released and Some Examples

阿新 • • 發佈：2018-12-29

I have just released jblas 1.1 (release notes). The main addition are Singular Value Decomposition (both “full” and “sparse”), and fixing a nasty bug with complex return values. Basically, the way complex return values are treated by g77 and gfortran are drastically different leading to some very odd bugs resulting from a confused stack frame.

Unfortunately, I also had to remove support for 32-bit Mac OS X in the precompiled jar file. The reason is that I lost access to the 32-bit machine I originally used to compile ATLAS for Mac, and any other machine I got my hands on is apparently running with 64-bits. Also, I didn’t quite get the 32-bit version of gcc working. But in case you have a 32-bit machine and need jblas, please contact me and I’ll help you compile the beast.

Examples

To make up for that, here are two examples of how to use jblas. These are from my poster at this years MLOSS workshop at ICML.

You can download the slides here.

I’ve also put the source up on github. Note that these sources are not meant to be a good example for coding style or project layout, but just an example of how to use jblas!

Please take a few seconds for jsMath do its magic… .

Kernel Ridge Regression

The first example is taken from machine learning. Let’s implement the following: Noisy sinc data set learned with Kernel Ridge Regression (KRR) with a Gaussian kernel.

The data set

First, we define the sinc function, which is usually defined as follows (up to scaling of the $x$):

The first try to code this in Java using jblas is to just focus on the part where $x \neq 0$:

  DoubleMatrix sinc(DoubleMatrix x) {
    return sin(x).div(x);
  }

Note how sin and div “vectorize” over a number of vectors given as a matrix.

Now let’s add the $x=0$ case. Since the code like this:

  DoubleMatrix safeSinc(DoubleMatrix x) {
    DoubleMatrix xIsZero = x.eq(0);
    return sin(x).div(x.add(xIsZero)).add(xIsZero);
  }

Note how we first “patch” for the case where x is zero, and then again add those points to get the required output.

Next, we draw some data from the sinc function and add noise. Our data model looks like this:

The following function generates a data set by returning an array of two DoubleMatrices representing $X$ and $Y$.

  DoubleMatrix[] sincDataset(int n, double noise) {
    DoubleMatrix X = rand(n).mul(8).sub(4);
    DoubleMatrix Y = sinc(X) .add( randn(n).mul(noise) );
    return new DoubleMatrix[] {X, Y};
  }

The Gaussian kernel

For KRR, we need to compute the whole kernel matrix. The kernel function is defined as

You can easily compute the kernel matrix using Geometry.pairwiseSquaredDistances().

  DoubleMatrix gaussianKernel(double w, 
                              DoubleMatrix X, 
                              DoubleMatrix Z) {
    DoubleMatrix d = 
      Geometry.pairwiseSquaredDistances(X.transpose(), 
                                        Z.transpose());
    return exp(d.div(w).neg());
  }

Kernel Ridge Regression

KRR learns a “normal” kernel model of the form with where $K$ is the kernel matrix

With jblas, you would compute the $\alpha$ as follows:

  DoubleMatrix learnKRR(DoubleMatrix X, DoubleMatrix Y,
                      double w, double lambda) {
    int n = X.rows;
    DoubleMatrix K = gaussianKernel(w, X, X);
    K.addi(eye(n).muli(lambda));
    DoubleMatrix alpha = Solve.solveSymmetric(K, Y);
    return alpha;
  }

  DoubleMatrix predictKRR(DoubleMatrix XE, DoubleMatrix X, 
                          double w, DoubleMatrix alpha) {
    DoubleMatrix K = gaussianKernel(w, XE, X);
    return K.mmul(alpha);
  }

The function predictKRR computes predictions on new points stored as rows of the matrix XE.

Computing the mean-squared error

Finally, in order to compute the error of our fit, we would like to compute the mean-squared error.

In jblas:

  double mse(DoubleMatrix Y1, DoubleMatrix Y2) {
    DoubleMatrix diff = Y1.sub(Y2);
    return pow(diff, 2).mean();
  }

Conjugate Gradients

As a second example, let’s implement the conjugate gradients algorithm. This is an iterative algorithm for solving linear equations of the form $Ax = b$, where $A$ is symmetric and positive definite.

The pseudo-code looks as follows:

$r \gets b - Ax$
$p \gets r$
repeat
$\alpha \gets \frac{r^Tr}{p^T A p}$
$x \gets x + \alpha p$
$r’ \gets r - \alpha A p$
if $r’$ is sufficiently small, exit loop
$\beta \gets \frac{ {r’ }^T r’}{r^Tr}$
$p \gets r + \beta p$
$r \gets r’$
end repeat

In jblas, the algorithm looks as follows (numbers in comments indicate steps in the algorithm above.)

  DoubleMatrix cg(DoubleMatrix A, DoubleMatrix b, 
                  DoubleMatrix x, double thresh) {
    int n = x.length;
    DoubleMatrix r = b.sub(A.mmul(x)); // 1
    DoubleMatrix p = r.dup();          // 2
    double alpha = 0, beta = 0;
    DoubleMatrix r2 = zeros(n), Ap = zeros(n);
    while (true) {                     // 3
      A.mmuli(p, Ap);
      alpha = r.dot(r) / p.dot(Ap);  // 4
      x.addi(p.mul(alpha));          // 5
      r.subi(Ap.mul(alpha), r2);     // 6
      double error = r2.norm2();     // 7
      System.out.printf("Residual error = %f\n", error);
      if (error < thresh)
        break;
      beta = r2.dot(r2) / r.dot(r);  // 8
      r2.addi(p.mul(beta), p);       // 9
      DoubleMatrix temp = r;         // 10
      r = r2;
      r2 = temp;
    }
    return x;
  }

For better speed, I have tried to reduce the number of matrix allocations and used the in-place variants of the arithmetic operations where possible. These kinds of tweaks can often give some performance boost. For example, Ap is used to hold the result of $Ap$, and so on.

As a side-effect the assigment $r \gets r’$ actually becomes a swap between the matrices stored in r and r2.

Posted by Mikio L. Braun at 2010-08-16 16:55:00 +0200

blog comments powered by Disqus

Marginally Interesting: jblas: 1.1 Released and Some Examples

Examples

Kernel Ridge Regression

The data set

The Gaussian kernel

Kernel Ridge Regression

Computing the mean-squared error

Conjugate Gradients

Marginally Interesting: jblas: 1.1 Released and Some Examples

Marginally Interesting: jblas 1.2.0: A look behind the scenes

Marginally Interesting: jblas finally on central Maven repository

Marginally Interesting: The Open Source Process and Research

Marginally Interesting: JRuby 1.1.3 and Jython 2.5

Marginally Interesting: Command Line Interactive Machine Learning on the JVM. Part 1: Why?

Marginally Interesting: Some Benchmark Numbers for jblas

GoLand 2018.2.1 is released with tangible performance improvements and lots of bug

Kotlin 1.3 Released with Coroutines, Kotlin/Native Beta, and more

Codeforces Round #263 (Div.1) B. Appleman and Tree

【貪心】 Codeforces Round #419 (Div. 1) A. Karen and Game

【找規律】【遞推】【二項式定理】Codeforces Round #419 (Div. 1) B. Karen and Test

難1 297. Serialize and Deserialize Binary Tree

Codeforces Round #485 (Div. 1) B. Petr and Permutations

0 or 1,1 and 0

Codeforces Round #512 (Div. 2, based on Technocup 2019 Elimination Round 1) C. Vasya and Golden Ticket

Codeforces Round #371 (Div. 1) D - Animals and Puzzle 二維ST表 + 二分

Resolved versions for app (25.4.0) and test app (27.1.1) differ.

224/227/772/150 Basic Calculator 1 2 3 and Evaluate Reverse Polish Notation

Support for TLS 1.0 and 1.1 in Office 365

Marginally Interesting: jblas: 1.1 Released and Some Examples

Examples

Kernel Ridge Regression

The data set

The Gaussian kernel

Kernel Ridge Regression

Computing the mean-squared error

Conjugate Gradients

相關推薦