Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

v0.5 - MSVC doesn't compile the loop properly #47

Open
cmuratori opened this issue May 18, 2019 · 1 comment
Open

v0.5 - MSVC doesn't compile the loop properly #47

cmuratori opened this issue May 18, 2019 · 1 comment

Comments

@cmuratori
Copy link
Owner

CLANG gets the full speed, MSVC doesn't - I haven't looked into it yet but this is a reminder to see if MSVC can be coerced into not screwing it up.

- Casey

@cmuratori
Copy link
Owner Author

I took a look at this and unfortunately, although I was able to force the instruction ordering by using a ReadWriteBarrier, I'm not sure that was actually a problem at all... the problem appears to be that the register allocator forces a huge amount of xmm1 usage for no apparent reason. That's the only particular thing I can see different between this code and the CLANG code which runs at speed.

- Casey

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant