[arm64] Tune memory copy performance #7559

jkotas · 2017-03-04T17:36:43Z

Check what was done for x86/x64 in dotnet/coreclr#9786

sdmaclea · 2017-08-31T15:07:35Z

@jkotas @jashook Please assign this to me.

If I look at the benchmark which was used to optimize dotnet/coreclr#9786, it looks flawed. All the measurements are made with a fairly small bucket. This will train the branch predictors and result in excessively branchy code. I think that is why the "optimized" Buffer.cs code is so complicated. I will add some additional tests which will hopefully detect the branch prediction penalties.

jkotas · 2017-08-31T15:17:37Z

cc @vkvenkat

jkotas assigned sdmaclea Aug 31, 2017

jkotas closed this as completed in dotnet/coreclr#13793 Sep 9, 2017

msftgits transferred this issue from dotnet/coreclr Jan 31, 2020

msftgits added this to the Future milestone Jan 31, 2020

dotnet locked as resolved and limited conversation to collaborators Dec 25, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[arm64] Tune memory copy performance #7559

[arm64] Tune memory copy performance #7559

jkotas commented Mar 4, 2017

sdmaclea commented Aug 31, 2017

jkotas commented Aug 31, 2017

[arm64] Tune memory copy performance #7559

[arm64] Tune memory copy performance #7559

Comments

jkotas commented Mar 4, 2017

sdmaclea commented Aug 31, 2017

jkotas commented Aug 31, 2017