add simd example #13

attacker0211 · 2022-06-14T03:33:06Z

Implements #11

penzn

Sorry, came across by accident.

Looks like SIMD mul technically processes one element, usually for matrices it is easier to write a vector dot product and then build the rest on top of it.

Additionally, floating point arithmetic is more interesting, as floating operations are more expensive.

penzn · 2022-06-16T20:48:00Z

examples/rust/simd/src/lib.rs

+    fn mul(a: u64, b: u64) -> u64 {
+        let va: v128 = u64x2_splat(a);
+        let vb: v128 = u64x2_splat(b);
+        let c = u64x2_extract_lane::<1>(i64x2_mul(va, vb));


I think this is technically a scalar multiplication - it fills all lanes with the same value and then extracts just one value out of the result.

Thank you so much for your review. I wasn't familiar with simd and made a mistake. Would appreciate feedbacks on the new code.

penzn · 2022-06-16T20:56:21Z

examples/rust/simd/src/lib.rs

+    fn dot(a: Vec<u64>, b: Vec<u64>) -> u64 {
+        assert!(a.len() == b.len());
+        let mut sum: u64 = 0;
+        for i in 0..a.len() {
+            sum += Self::mul(a[i], b[i]);
+        }
+        sum
+    }


Dot product is the smallest unit of work in matrix multiplication that can be implemented in SIMD, it usually works by taking N worth of elements from the first array and second array, multiplying them via SIMD, then adding N results to the intermediate vector sum (N is number of lanes). Intermediate sum is the added up at the end, also for input sizes not divisible by N the remainder needs to be calculated manually.

Updated. Please let me know if anything I could do better. I assume floating point implementation should be similar (please let me know if it isn't) so I will update floating point examples once this is ok :)

carlsverre · 2022-07-25T18:11:04Z

examples/rust/simd/simd.wit

+u64x2-scalar-mul: func(a: u64, b: list<u64>) -> list<u64>
+u64x2-dot: func(a: list<u64>, b: list<u64>) -> u64
+u64x2-inner: func(a: list<u64>, b: list<u64>) -> list<u64>
+u64x2-mat-mul: func(a: list<list<u64>>, b: list<list<u64>>) -> list<list<u64>>


rather than using list you should use singlestore compatible packed 64 bit vectors:

https://docs.singlestore.com/db/v7.8/en/reference/sql-reference/vector-functions/vector-functions.html

carlsverre · 2022-07-25T18:11:37Z

examples/rust/simd/src/lib.rs

+use core::arch::wasm32::*;
+
+impl simd::Simd for Simd {
+    fn u64x2_scalar_mul(a: u64, b: Vec<u64>) -> Vec<u64> {


please add docstrings to each function explaining it's purpose

add simd example

ecf243e

attacker0211 force-pushed the tphan/simd branch from 78e710d to ecf243e Compare June 14, 2022 03:42

penzn reviewed Jun 16, 2022

View reviewed changes

update simd example

dbc7dab

carlsverre reviewed Jul 25, 2022

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add simd example #13

add simd example #13

attacker0211 commented Jun 14, 2022 •

edited by esoterra

penzn left a comment

penzn Jun 16, 2022

attacker0211 Jun 23, 2022

penzn Jun 16, 2022

attacker0211 Jun 23, 2022

carlsverre Jul 25, 2022

carlsverre Jul 25, 2022

add simd example #13

Are you sure you want to change the base?

add simd example #13

Conversation

attacker0211 commented Jun 14, 2022 • edited by esoterra

penzn left a comment

Choose a reason for hiding this comment

penzn Jun 16, 2022

Choose a reason for hiding this comment

attacker0211 Jun 23, 2022

Choose a reason for hiding this comment

penzn Jun 16, 2022

Choose a reason for hiding this comment

attacker0211 Jun 23, 2022

Choose a reason for hiding this comment

carlsverre Jul 25, 2022

Choose a reason for hiding this comment

carlsverre Jul 25, 2022

Choose a reason for hiding this comment

attacker0211 commented Jun 14, 2022 •

edited by esoterra