Skip to content
This repository has been archived by the owner on Aug 15, 2019. It is now read-only.

vectorize min/max/logsumexp/nan/reduce_sum #145

Merged
merged 2 commits into from
Sep 22, 2017
Merged

vectorize min/max/logsumexp/nan/reduce_sum #145

merged 2 commits into from
Sep 22, 2017

Conversation

dsmilkov
Copy link
Contributor

@dsmilkov dsmilkov commented Sep 22, 2017

2x improvement for min/min and 4x for LogSumExp

Old Max on Linux:
max

Vectozied Max on Linux:
max-vec4

Old LogSumExp on Linux:
logsumexp

Vectorized LogSumExp on Linux:
logsumexp-vec4


This change is Reviewable

@nsthorat
Copy link
Contributor

Reviewed 6 of 6 files at r1.
Review status: all files reviewed at latest revision, all discussions resolved.


Comments from Reviewable

@nsthorat
Copy link
Contributor

:lgtm_strong:


Review status: all files reviewed at latest revision, all discussions resolved.


Comments from Reviewable

@dsmilkov dsmilkov merged commit 50c3805 into master Sep 22, 2017
@dsmilkov dsmilkov deleted the vec4 branch September 22, 2017 22:39
mnottheone pushed a commit to mnottheone/deeplearnjs that referenced this pull request Dec 1, 2018
* vectorize min/max/logsumexp/nan shaders

* vectorize reduce sum
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
2 participants