Speed up TT store by checking for matches first #170

Thanar2 · 2014-02-15T19:36:23Z

Pure speed improvement to TranspositionTable::store(). The new code first checks all four TTEntry slots in a cluster for an empty or matching one, avoiding the "replace strategy" code completely in cases where an empty or matching one is found. The loop is unrolled to the constant ClusterSize of 4 for a measurable speed improvement.

The loop that implements the replace strategy only needs to execute 3 times instead of the original 4. Since in the old version, replace and tte started off pointing to the same slot which did nothing in the first iteration.

Standard bench shows approximately 1.4% speed improvement on my machine. A local test of 10,000 games at 10 second time control gave the following results: 1942-1816-6242 [.506].

No functional change.

mcostalba · 2014-02-15T21:14:33Z

Could you please give it a run on fishtest? Thanks.

On Sat, Feb 15, 2014 at 8:36 PM, Fr. Terry Donahue, CC <
notifications@github.com> wrote:

Pure speed improvement to TranspositionTable::store(). The new code first
checks all four TTEntry slots in a cluster for an empty or matching one,
avoiding the "replace strategy" code completely in cases where an empty or
matching one is found. The loop is unrolled to the constant ClusterSize of
4 for a measurable speed improvement.

The loop that implements the replace strategy only needs to execute 3
times instead of the original 4. Since in the old version, replace and tte
started off pointing to the same slot which did nothing in the first
iteration.

Standard bench shows approximately 1.4% speed improvement on my machine. A
local test of 10,000 games at 10 second time control gave the following
results: 1942-1816-6242 [.506].

No functional change.

You can merge this Pull Request by running

git pull https://github.com/Thanar2/Stockfish ttstore

Or view, comment on, or merge it at:

#170
Commit Summary

Speed up TT store by checking for matches first

File Changes

M src/tt.cpphttps://github.com/Speed up TT store by checking for matches first #170/files#diff-0(31)

Patch Links:

https://github.com/mcostalba/Stockfish/pull/170.patch

https://github.com/mcostalba/Stockfish/pull/170.diff

—
Reply to this email directly or view it on GitHubhttps://github.com//pull/170
.

mstembera · 2014-02-15T22:51:32Z

Thanar maybe u will find this interesting.

mstembera/Stockfish@2740799...26ef246

mstembera · 2014-02-15T22:53:08Z

Meant to mention mine failed http://tests.stockfishchess.org/tests/view/523b7ff70ebc59749a54ae48

Thanar2 · 2014-02-15T23:56:33Z

Thanks for the heads up on the previous test. Were you able to measure a speed improvement using bench with your version? I did with mine, using profile-builds, but it is always possible to have different cache alignments magnifying or negating any particular minor speed improvement. I'm also curious whether fishtest uses profile-builds or not. Certainly the optimizations make for faster code, but it may be too small to measure.

mcostalba · 2014-02-16T09:30:31Z

Yes, fishtest uses profile builds. I asked to test on fishtest because
different hardware may behave differently, so fishtest is a good mix.

On Sun, Feb 16, 2014 at 12:56 AM, Fr. Terry Donahue, CC <
notifications@github.com> wrote:

Thanks for the heads up on the previous test. Were you able to measure a
speed improvement using bench with your version? I did with mine, using
profile-builds, but it is always possible to have different cache
alignments magnifying or negating any particular minor speed improvement.
I'm also curious whether fishtest uses profile-builds or not. Certainly the
optimizations make for faster code, but it may be too small to measure.

—
Reply to this email directly or view it on GitHubhttps://github.com//pull/170#issuecomment-35171994
.

mstembera · 2014-02-16T09:51:07Z

profile-builds tangent @marco
Since (at least on windows machines) the binary is not compiled locally there will be cases where the profiled build from the fishtest machine will be a bad match for the local hardware. Worse than a non profile build would have been. I always thought profiled builds were intended for hardware with identical timings?

@Thanar
I was able to measure a speedup using QueryPerformanceCounter under MSVC but it wasn't enough overall to be conclusive using bench.

mcostalba · 2014-02-16T10:02:12Z

I don't understand what's the reason of using profile builds. In which way
this is connected with the patch? What kind of optimization profile builds
enables that default doesn't ?

On Sun, Feb 16, 2014 at 10:51 AM, mstembera notifications@github.comwrote:

profile-builds tangent @marco https://github.com/Marco
Since (at least on windows machines) the binary is not compiled locally
there will be cases where the profiled build from the fishtest machine will
be a bad match for the local hardware. Worse than a non profile build would
have been. I always thought profiled builds were intended for hardware with
identical timings?

@Thanar https://github.com/Thanar
I was able to measure a speedup using QueryPerformanceCounter under MSVC
but it wasn't enough overall to be conclusive using bench.

—
Reply to this email directly or view it on GitHubhttps://github.com//pull/170#issuecomment-35181147
.

mstembera · 2014-02-16T10:09:28Z

I don't know what is the reason for profile builds. You mentioned above that fishtest uses them. I just made a general observation that it seems wrong to me to use them if they are not compiled on the local machine. By tangent I meant not related to this patch in particular.

Speed up TT store by checking for matches first

04556d6

mcostalba force-pushed the master branch from d761bc3 to 82f539b Compare September 2, 2014 12:36

mcostalba force-pushed the master branch from 5d4fbce to 068e2bd Compare September 14, 2014 08:11

mcostalba force-pushed the master branch 3 times, most recently from 167a465 to ef14ba8 Compare October 4, 2014 04:25

mcostalba force-pushed the master branch from a0897d4 to 7b9df3e Compare October 11, 2014 07:19

mcostalba closed this Aug 31, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Speed up TT store by checking for matches first #170

Speed up TT store by checking for matches first #170

Thanar2 commented Feb 15, 2014

mcostalba commented Feb 15, 2014

No functional change.

mstembera commented Feb 15, 2014

mstembera commented Feb 15, 2014

Thanar2 commented Feb 15, 2014

mcostalba commented Feb 16, 2014

mstembera commented Feb 16, 2014

mcostalba commented Feb 16, 2014

mstembera commented Feb 16, 2014

Speed up TT store by checking for matches first #170

Speed up TT store by checking for matches first #170

Conversation

Thanar2 commented Feb 15, 2014

mcostalba commented Feb 15, 2014

No functional change.

mstembera commented Feb 15, 2014

mstembera commented Feb 15, 2014

Thanar2 commented Feb 15, 2014

mcostalba commented Feb 16, 2014

mstembera commented Feb 16, 2014

mcostalba commented Feb 16, 2014

mstembera commented Feb 16, 2014