Add block matmul #1212

aman-tiwari · 2018-08-06T23:49:23Z

Description

Currently: 1.4x times speed up on a 512x512 matrix-matrix matmul.

For repository owners only:

Please remember to apply all applicable tags to your pull request.
Tags: FEATURE, BREAKING, BUG, PERF, DEV, DOC, SECURITY

For more info see: https://github.com/tensorflow/tfjs/blob/master/DEVELOPMENT.md

This change is

aman-tiwari · 2018-08-07T19:53:52Z

This is the benchmarking code (removed it from the PR)

  it('benchmark matmul sq matrix', async done => {
    const backend = tf.ENV.backend as MathBackendCPU;
    const bs = [32, 48, 64, (64 / 2) + 64, 128, (128 / 2) + 128];
    const ns = [64, 128, 192, 256, 239, 398, 512];
    const RUNS = 20;
    for (const n of ns) {
      const a = tf.randomUniform([n, n]) as tf.Tensor2D;
      const b = tf.randomUniform([n, n]) as tf.Tensor2D;
      // Warmup.
      backend.matMulNaive(a, b, false, false).dataSync();

      let res: tf.Tensor = null;
      const start = now();
      for (let i = 0; i < RUNS; i++) {
        res = backend.matMulNaive(a, b, false, false);
      }
      res.dataSync();
      const naiveTime = (now() - start) / RUNS;
      console.log(`N: ${n}\t ${naiveTime.toFixed(2)}ms`);

      for (const blockSize of bs) {
        backend.blockSize = blockSize;
        const a = tf.randomUniform([n, n]) as tf.Tensor2D;
        const b = tf.randomUniform([n, n]) as tf.Tensor2D;
        // Warmup.
        backend.matMul(a, b, false, false).dataSync();

        let res: tf.Tensor = null;
        const start = now();
        for (let i = 0; i < RUNS; i++) {
          res = backend.matMul(a, b, false, false);
        }
        res.dataSync();
        const elapsed = (now() - start) / RUNS;
        const speedup = (naiveTime / elapsed).toFixed(2);
        console.log(
            `mul BS: ${blockSize}\t ${elapsed.toFixed(2)} ms\t speedup: ${
                speedup}x\t diff:${(elapsed - naiveTime).toFixed(2)}ms`);
        await tf.nextFrame();
      }
    }
    done();
  });

… blocked-matmul-cpu

dsmilkov

Reviewed 1 of 3 files at r1, 1 of 2 files at r2, 1 of 1 files at r3, 1 of 1 files at r4.
Reviewable status: 0 of 1 approvals obtained

aman-tiwari · 2018-08-09T18:42:42Z

By running tests on matrices we found that the block size for the cache blocked matrix multiply was 48. We suffer a performance penalty of 0.5-1ms on small matrices but gain 100s of milliseconds on large matrices.

dsmilkov

Reviewed 1 of 1 files at r5.
Reviewable status: complete! 1 of 1 approvals obtained

aman-tiwari added 6 commits August 6, 2018 17:39

add blocked matmul

0bcebc0

improve test and remove multiply from hottest loop

eefb5a3

better name etc

faa3667

fix failing mat-vec tests

eaec48c

remove debugging comments

525002c

remove benchmarking code from tests

b9eb4d6

aman-tiwari added 6 commits August 9, 2018 10:19

Merge branch 'master' of https://github.com/tensorflow/tfjs-core into…

9cd456e

… blocked-matmul-cpu

set blockSize to 48, remove speed testing code

32f4061

oops forgotten fit

5a34bcf

oops forgotten fit

be625c8

remov matmul-test

b95b8e5

Add comment about orthonormal

91ad7d0

dsmilkov approved these changes Aug 9, 2018

View reviewed changes

dsmilkov changed the title ~~WIP: Add block matmul~~ Add block matmul Aug 9, 2018

reduce size of test to prevent timing out on browserstack

4ae4ae7

dsmilkov approved these changes Aug 9, 2018

View reviewed changes

dsmilkov merged commit dbb1b81 into tensorflow:master Aug 9, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add block matmul #1212

Add block matmul #1212

aman-tiwari commented Aug 6, 2018 •

edited by dsmilkov

Loading

aman-tiwari commented Aug 7, 2018

dsmilkov left a comment

aman-tiwari commented Aug 9, 2018

dsmilkov left a comment

Add block matmul #1212

Add block matmul #1212

Conversation

aman-tiwari commented Aug 6, 2018 • edited by dsmilkov Loading

Description

For repository owners only:

aman-tiwari commented Aug 7, 2018

dsmilkov left a comment

Choose a reason for hiding this comment

aman-tiwari commented Aug 9, 2018

dsmilkov left a comment

Choose a reason for hiding this comment

aman-tiwari commented Aug 6, 2018 •

edited by dsmilkov

Loading