Currently these functions are just using Unaligned Loads, we can make them after by aligning the data and doing aligned loads. - [ ] AddScalerU - [ ] ScaleSrcU - [ ] AddScaleU - [ ] ScaleAddU - [ ] AddU - [ ] AddScaleCopyU - [ ] AddSU - [ ] MulElementWiseU - [ ] SumU - [ ] SumSqU - [ ] SumSqDiffU - [ ] SumAbsU - [ ] SumAbsDiffU - [ ] MaxAbsU - [ ] MaxAbsDiffU - [ ] DotU - [ ] DotSU - [ ] Dist2 - [ ] SdcaL1UpdateU - [ ] SdcaL1UpdateSU Reference for algorithm https://github.com/dotnet/machinelearning/pull/1143 cc @danmosemsft @tannergooding @eerhardt