New log, log2, exp, exp2 and pow implementations #992

MaxGraey · 2019-12-06T04:34:30Z

Status

Benchmark for `Math.pow` [f64]

Results

Firefox 71

old as pow: 190ms
NEW as pow: 31ms
js pow:     52ms

Chrome 79.0.3945.79

old as pow: 235.450927734375ms
NEW as pow: 91.623046875ms
js pow:     146.000732421875ms

Benchmark for `Mathf.pow` [f32]

Firefox 71

old as pow: 106ms
NEW as pow: 16ms
js pow:     26ms

Chrome 79.0.3945.79

old as pow: 136.93701171875ms
NEW as pow: 39.447265625ms
js pow:     51.557861328125ms

UPDATE

Benchmark for Math.pow [f64] using internal loops in AssemblyScript

Chrome 79.0.3945.88

old as pow: 153.734130859375ms
NEW as pow: 21.9619140625ms
js pow: 137.114990234375ms

MaxGraey · 2019-12-12T00:41:57Z

I decided add new non-std Math functions Math.exp2 / Mathf.exp2 and Math.exp10 / Mathf.exp10. Not necessary use it explicitly but its will use for special lowering during Math.pow optimization when first argument known at compile time like:

2 ** y   ->  exp2(y)

~~10 ** y -> exp10(y)~~ figure out this hasn't any benefits compare to pow(10, y)

e ** y   ->  exp(y)

// where y is f32 or f64 and result also f32 or f64

dcodeIO · 2019-12-12T03:13:44Z

Can you explain a bit what's the theory behind these improvements? For instance, what did the old implementation do that made it slow, and does the new implementation do that makes it fast? What are the algorithms used here and where are they from? Stuff like that :)

MaxGraey · 2019-12-12T03:20:51Z

Basicaly it's adoption of new ARM math lib: https://github.com/ARM-software/optimized-routines/tree/master/math (MIT). This link present in musl's implementation. New routines use lookup tables and more clever handling special cases when we could simplify path using twofold fast arithmetic some of LUTs need for speedup FMA emulation. All this significantly speedup pow/log/exp. But sometimes increase code size, so I use this routines mostly for ASC_SHRINK_LEVEL == 0. except Mathf.pow which accidentally decrease size.

std/assembly/util/math.ts

NOTICE

dcodeIO · 2020-01-01T23:03:45Z

Great, thanks!

MaxGraey added 30 commits December 5, 2019 02:18

init (wip)

15a3462

update (wip)

813b1ad

add logf

7b7b2cb

cleanups (wip)

a2272d6

add log2f_lut

b88aeed

cleanups

955d5ec

more

b9440fa

more

72b0da2

add expf_lut

d381309

cleanups

e65c83f

comments

cd98d81

update (wip)

819f0f0

simplify powf

8b52b6c

refactorings

53f6708

more

445cc10

rebuild libm

99ff383

improve (wip)

9744fbc

refactor

f723b73

add pow_lut

ec71a97

pow_lut pass tests!

c50829f

uncomment inline

dca128f

add exp_lut

25bdc7d

cleanup

7d1eb57

improvments

7c711c4

Merge branch 'master' into speedup-log-exp-pow

873e5a8

rebuild rest tests

e9e3bd5

cleanups

abe60f5

refactoring

7174d1f

refactor & comment about ARM license

5cf441e

add log2_lut

cd0c8b2

MaxGraey added 3 commits December 8, 2019 01:46

rebuild

208f7c6

Merge branch 'master' into speedup-log-exp-pow

5426c81

add Math.exp2 / Mathf.exp2 tests

23ba664

MaxGraey marked this pull request as ready for review December 11, 2019 20:33

MaxGraey requested a review from dcodeIO December 11, 2019 20:33

MaxGraey added 3 commits December 12, 2019 02:44

add Mathf.exp10

236a43c

rebuild

914f4df

remove Mathf.exp10

9d85119

MaxGraey added 3 commits December 12, 2019 21:39

better overflows for expf_lut and exp2f_lut

9a0023d

refactorings

d24d2e1

rebuild

cbcbb22

dcodeIO reviewed Dec 13, 2019

View reviewed changes

std/assembly/util/math.ts Outdated Show resolved Hide resolved

MaxGraey added 4 commits December 13, 2019 16:00

move Arm license to NOTICE

cb600d3

Merge branch 'master' into speedup-log-exp-pow

3c176f6

rebuild

701b638

Merge branch 'master' into speedup-log-exp-pow

4b9347b

dcodeIO reviewed Dec 18, 2019

View reviewed changes

std/assembly/util/math.ts Outdated Show resolved Hide resolved

dcodeIO reviewed Dec 18, 2019

View reviewed changes

std/assembly/util/math.ts Outdated Show resolved Hide resolved

dcodeIO reviewed Dec 18, 2019

View reviewed changes

std/assembly/util/math.ts Outdated Show resolved Hide resolved

dcodeIO reviewed Dec 18, 2019

View reviewed changes

NOTICE Outdated Show resolved Hide resolved

MaxGraey added 4 commits December 18, 2019 18:24

convert comments. Remove 2 lines from NOTICE

2b1b32e

more

df7913f

Merge branch 'master' into speedup-log-exp-pow

7ca3644

rebuild

81cf26c

dcodeIO merged commit ab1e1dd into AssemblyScript:master Jan 1, 2020

MaxGraey deleted the speedup-log-exp-pow branch January 1, 2020 23:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

New log, log2, exp, exp2 and pow implementations #992

New log, log2, exp, exp2 and pow implementations #992

Uh oh!

MaxGraey commented Dec 6, 2019 •

edited

Loading

Uh oh!

MaxGraey commented Dec 12, 2019 •

edited

Loading

Uh oh!

dcodeIO commented Dec 12, 2019

Uh oh!

MaxGraey commented Dec 12, 2019 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

dcodeIO commented Jan 1, 2020

Uh oh!

Uh oh!

Uh oh!

New log, log2, exp, exp2 and pow implementations #992

New log, log2, exp, exp2 and pow implementations #992

Uh oh!

Conversation

MaxGraey commented Dec 6, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Benchmark for Math.pow [f64]

Benchmark for Mathf.pow [f32]

UPDATE

Uh oh!

MaxGraey commented Dec 12, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dcodeIO commented Dec 12, 2019

Uh oh!

MaxGraey commented Dec 12, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

dcodeIO commented Jan 1, 2020

Uh oh!

Uh oh!

MaxGraey commented Dec 6, 2019 •

edited

Loading

Benchmark for `Math.pow` [f64]

Benchmark for `Mathf.pow` [f32]

MaxGraey commented Dec 12, 2019 •

edited

Loading

MaxGraey commented Dec 12, 2019 •

edited

Loading