Skip to content

Conversation

@synopse
Copy link
Contributor

@synopse synopse commented Mar 22, 2024

  • in fact, the Station[] array was not 64-bytes aligned, so the L1 CPU cache access was suboptimal
  • timing went from around 4.2s to 4.0s on my laptop (5% improvement)
  • we could expect better scaling too with multiple threads on the Benchmark hardware, because L3 shared cache should be less polluted

Arnaud Bouchez added 2 commits March 22, 2024 10:18
- in fact, the Station[] array was not 64-bytes aligned, so the L1 CPU cache access was suboptimal
- timing went from around 4.2s to 4.0s on my laptop (5% improvement)
- we could expect better scaling too with multiple threads, because L3 shared cache should be less polluted
@gcarreno
Copy link
Collaborator

Hey Arnaud(@synopse),

Can you also update this PR with a correction to the issues I mention on your discussion?

Thanks!!

Cheers,
Gus

@gcarreno
Copy link
Collaborator

Hey Arnaud(@synopse),

Never mind, I shoulda looked at your commit 😄

Merging!!

Cheers,
Gus

@gcarreno gcarreno merged commit 6c83298 into ObjectPascal-Community:main Mar 22, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants