Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

HaldCLUT cleanups after the dust (#3154) has settled #3282

Merged
merged 4 commits into from
May 14, 2016
Merged

HaldCLUT cleanups after the dust (#3154) has settled #3282

merged 4 commits into from
May 14, 2016

Conversation

Floessie
Copy link
Collaborator

Here are some cleanups, mainly conceived by @heckflosse, for the HaldCLUT code:

  • Load and store source[RGB] aligned again
  • Switch getClutValue() to vfloat2
  • Optimize the SSE2 version of getClutValue() and omit MMX (by Ingo)

Best,
Flössie

Ingo had some cleanup suggestions in #3154 which I tried to realize with
this commit. Although switching to `vfloat2` is a clever idea, I can see
no further speedup.
Ingo has provided a solution for the strange Windows crash with
`_mm_cvtpu16_ps()`: It was not an alignment problem, but the use of
MMX instructions which led to the SEGV.

Now Ingo's solutions omits MMX instructions altogether and is
nevertheless faster than the `_mm_set_ps()` workaround.

Many thanks to @heckflosse!
@Floessie
Copy link
Collaborator Author

@heckflosse Maybe you have missed my last commit? Should be ready to merge now.

@heckflosse heckflosse merged commit 298a8f4 into Beep6581:master May 14, 2016
@heckflosse
Copy link
Collaborator

@Floessie Yes, I missed it.

@Floessie Floessie deleted the dedusting-haldclut-fix branch May 17, 2016 18:33
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants